{"id":2218,"date":"2012-03-09T06:06:18","date_gmt":"2012-03-09T10:06:18","guid":{"rendered":"http:\/\/blogs.law.harvard.edu\/sj\/?p=2218"},"modified":"2012-03-09T06:06:18","modified_gmt":"2012-03-09T10:06:18","slug":"j-b-s-haldane-on-statistical-fraud","status":"publish","type":"post","link":"https:\/\/archive.blogs.harvard.edu\/sj\/2012\/03\/09\/j-b-s-haldane-on-statistical-fraud\/","title":{"rendered":"J. B. S. Haldane on Statistical Fraud"},"content":{"rendered":"<p><em>From <a href=\"http:\/\/en.wikipedia.org\/wiki\/J._B._S._Haldane\">Haldane<\/a>&#8216;s 1941 essay in <\/em>Eureka<em>\u00a0#6 on &#8220;<\/em><a href=\"http:\/\/www.archim.org.uk\/eureka\/27\/faking.html\">The Faking of Genetical Results<\/a><em>&#8220;, reproduced here with appropriate corrections and hyperlinks.<\/em><\/p>\n<div style=\"padding:10px 10px 2px 10px;border:1px solid #eee\">\n<a href=\"http:\/\/en.wikipedia.org\/wiki\/John_Scott_Haldane\">MY\u00a0FATHER<\/a>\u00a0published a number of papers on blood analysis. In the proofs of one of them the following sentence, or something very like it, occurred: &#8220;<em>Unless the blood is very thoroughly faked, it will be found that duplicate determinations rarely agree.<\/em>&#8221; Every biochemist will sympathise with this opinion. I may add that the verb &#8220;to lake,&#8221; when applied to blood, means to break up the corpuscles so that it becomes transparent.<\/p>\n<p>In genetical work also, duplicates rarely agree unless they are faked. Thus I may mate two brother black mice, both sons of a black father and a white mother, with two white sisters, and one will beget 10 black and 15 white young; the other 15 black and 10 white. To the ingenuous biologist this appears to be a bad agreement. A mathematician will tell him that where the same ratio of black to white is expected in each family, so large a discrepancy (though how best to compare discrepancies is not obvious) will occur in about 26 percent of all cases. If the mathematician is a rigorist he will say the same thing a little more accurately in a great many more words.<\/p>\n<p>A biologist who has no mathematical knowledge, and, what is vastly more serious, no scientific honour, will be tempted to fake his results, and say that he got 12 black and 13 white in one family, and 13 black and 12 white in the other. The temptation is generally more subtle. In one of a number of families where equality is expected he gets 19 black and 6 white mice. It looks much more like a ratio of 3 black to 1 white. How is he to explain it? Wasn&#8217;t that the cage whose door once seemed to be insecurely fastened? Perhaps the female got out for a while or some other mouse got in. Anyway he had better reject the family. The total gives a better fit to expectation if he does so, by the way. Our poor friend has forgotten the binomial theorem. A study of the expansion of\u00a0<strong>(1+x\/2)<sup>25<\/sup><\/strong>\u00a0would have shown him that as bad a fit or worse would be obtained with a probability of 122753 x 2<sup>-23<\/sup>, or <strong>.0146<\/strong>. There is nothing at all surprising in getting one family as aberrant as this in a set of 20. But he is now on a slippery slope.<\/p>\n<p>He gets his Ph.D. \u00a0He wants a fellowship, and time is short. But he has been reading\u00a0<em>Nature<\/em>\u00a0and noticed two letters*\u00a0to that journal of which I was joint author, in which I might appear to have hinted at faking by my genetical colleagues. Thoroughly alarmed, he goes to a venal mathematician. Cambridge is full of mathematicians who have been so corrupted by quantum mechanics that they use series which are clearly <a href=\"http:\/\/en.wikipedia.org\/wiki\/Divergent_series\">divergent<\/a>, and not even proved to be summable. Interrupting such a one in the midst of an orgy of <a href=\"http:\/\/en.wikipedia.org\/wiki\/Homi_J._Bhabha\">Bhabha<\/a> and benzedrine, our villain asks for a treatise on faking.<\/p>\n<div style=\"padding:6px 10px 4px 10px;border:1px solid #ddd;color:#666\">&#8220;I am trying to reconcile <a href=\"http:\/\/en.wikipedia.org\/wiki\/Edward_Arthur_Milne\">Milne<\/a>, <a href=\"http:\/\/en.wikipedia.org\/wiki\/Max_Born\">Born<\/a> and <a href=\"http:\/\/en.wikipedia.org\/wiki\/Paul_Dirac\">Dirac<\/a>, not to mention some facts which don&#8217;t seem to agree with any of them, or with Eddington,&#8221; replies the debauchee, &#8220;and I feel discontinuous in every interval; but here goes.<\/p>\n<p>&#8220;I suppose you know the hypothesis you want to prove. It wouldn&#8217;t be a bad thing to grow a few mice or flies or parrots or cucumbers or whatever you&#8217;re supposed to be working on, to see if your hypothesis is anywhere near the facts. Suppose in a given series of families you expect to get four classes of hedgehogs or whatnot with frequencies\u00a0<em>p<\/em><sub>1<\/sub>,\u00a0<em>p<\/em><sub>2<\/sub>,\u00a0<em>p<\/em><sub>3<\/sub>,\u00a0<em>p<\/em><sub>4<\/sub>, and your total is\u00a0<strong><em>S<\/em><\/strong>, I shouldn&#8217;t advise you to say you got just\u00a0<em>Sp<\/em><sub>1<\/sub>,\u00a0<em>Sp<\/em><sub>2<\/sub>,\u00a0<em>Sp<\/em><sub>3<\/sub>\u00a0and\u00a0<em>Sp<\/em><sub>4<\/sub>, or even the nearest whole number. Here is what you&#8217;d better do. Say you got\u00a0<em>A<\/em><sub>1<\/sub>,\u00a0<em>A<\/em><sub>2<\/sub>,\u00a0<em>A<\/em><sub>3<\/sub>\u00a0and\u00a0<em>A<\/em><sub>4<\/sub>, and evaluate<\/p>\n<p align=\"center\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-eq2.png?resize=290%2C42\" alt=\"\\chi^2 = ((A_1 - Sp_1)^2 \/ Sp_1) + ((A_2 - Sp_2)^2 \/ Sp_2) + ...\" width=\"290\" height=\"42\" \/><\/p>\n<p>Your\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-chi2.png?resize=21%2C19\" alt=\"\\chi^2\" width=\"21\" height=\"19\" \/>\u00a0has three degrees of freedom. That is to say you can say you got\u00a0<em>A<\/em><sub>1<\/sub>\u00a0red,\u00a0<em>A<\/em><sub>2<\/sub>\u00a0green and\u00a0<em>A<\/em><sub>3<\/sub>\u00a0blue hedgehogs. But you will then have to say you got\u00a0<strong><em>S<\/em><\/strong>&#8211;<em>A<\/em><sub>1<\/sub>&#8211;<em>A<\/em><sub>2<\/sub>&#8211;<em>A<\/em><sub>3<\/sub>\u00a0purple ones. Hence the expected value of\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-chi2.png?resize=21%2C19\" alt=\"\\chi^2\" width=\"21\" height=\"19\" \/>\u00a0is 3, and its standard error is\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-sqrt6.png?resize=29%2C19\" alt=\"\\sqrt{6}\" width=\"29\" height=\"19\" \/>; so choose your\u00a0<em>A<\/em>&#8216;s so as to give a\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-chi2.png?resize=21%2C19\" alt=\"\\chi^2\" width=\"21\" height=\"19\" \/>\u00a0anywhere between 1 and 6. This is called faking of the first order. It isn&#8217;t really necessary. You might have\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-eq3.png?resize=61%2C36\" alt=\"p_1 = 9\/16\" width=\"61\" height=\"36\" \/>,\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/blogs.law.harvard.edu\/sj\/files\/2012\/03\/p2-equals-p3.png?resize=92%2C36\" alt=\"p_2 = p_3 = 3\/16\" width=\"92\" height=\"36\" \/>,\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-eq5.png?resize=61%2C36\" alt=\"p_4 = 1\/16\" width=\"61\" height=\"36\" \/>\u00a0and\u00a0<strong><em>A<\/em><sub>1<\/sub>=9,\u00a0<em>A<\/em><sub>2<\/sub>=<em>A<\/em><sub>3<\/sub>=3,\u00a0<em>A<\/em><sub>4<\/sub>=1<\/strong>. The probability of getting this is\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-eq6.png?resize=99%2C43\" alt=\"(16! 3^24) \/ (9! (3!)^2 1! 16^16)\" width=\"99\" height=\"43\" \/>, which is only just under <strong>.04<\/strong>. \u00a0However, it looks better not to get the exact numbers expected, and if you do it on a population of hundreds or thousands you may be caught out.<br \/>\n&#8220;Your second order faking is the same sort of thing. Supposing your total is made up of\u00a0<strong><em>n<\/em><\/strong>\u00a0families, and you say the\u00a0<strong><em>r<\/em><\/strong>th consisted of\u00a0<em>a<\/em><sub><em>r<\/em>1<\/sub>,\u00a0<em>a<\/em><sub><em>r<\/em>2<\/sub>,\u00a0<em>a<\/em><sub><em>r<\/em>3<\/sub>,\u00a0<em>a<\/em><sub><em>r<\/em>4<\/sub>\u00a0members of the four classes,\u00a0<strong><em>s<\/em><sub><em>r<\/em><\/sub><\/strong>\u00a0in all: you take<\/p>\n<p align=\"center\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-eq7.png?resize=260%2C42\" alt=\"((a_{r1} - s_r p_1)^2 \/ s_r p_1) + ((a_{r2} - s_r p_2)^2 \/ s_r p_2) + ...\" width=\"260\" height=\"42\" \/><\/p>\n<p>and sum for all values of\u00a0<strong><em>r<\/em><\/strong>. Your total ought to be somewhere near <strong>3<em>n<\/em><\/strong>. The standard error is\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-sqrt6n.png?resize=40%2C19\" alt=\"\\sqrt{6n}\" width=\"40\" height=\"19\" \/>, and it&#8217;s better to be too high than too low. A chap called <a href=\"http:\/\/en.wikipedia.org\/wiki\/Franz_Moewus\">Moewus<\/a> in Berlin who counted different types of <a href=\"http:\/\/en.wikipedia.org\/wiki\/Chlamydomonas\">algae<\/a> (or so he said), got such a magnificent agreement between observed and theoretical results, that if every member of the human race had repeated his work once a month for <strong>10<sup>12<\/sup><\/strong>\u00a0years, they might expect as good a fit on one occasion (though not with great confidence). So Moewus certainly hadn&#8217;t done any second order faking. Of course I don&#8217;t suggest that he did any faking at all. He <a href=\"http:\/\/books.google.com\/books\/about\/Where_the_Truth_Lies.html?id=1rDCaJ25KRAC\">may have run into<\/a>\u00a0one of those theoretically possible miracles, like the monkey typing out the text of Hamlet by mere luck. But I shouldn&#8217;t have a miracle like that in your fellowship dissertation.<\/p>\n<p>&#8220;There is also third order faking. The <strong>4<em>n<\/em><\/strong>\u00a0different components of\u00a0<img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/www.archim.org.uk\/eureka\/27\/faking-chi2.png?resize=21%2C19\" alt=\"\\chi^2\" width=\"21\" height=\"19\" \/>\u00a0should be distributed round their mean in the proper way. That is to say, not merely their mean, but their mean square, cube and so on, should be near the expected values (but not too near). But I shouldn&#8217;t worry too much about the higher orders. The only examiner who is likely to spot that you haven&#8217;t done them is <a href=\"http:\/\/en.wikipedia.org\/wiki\/J._B._S._Haldane\">Haldane<\/a>, and he&#8217;ll probably be interned as a Red before you send your thesis in. Of course you might get <a href=\"http:\/\/en.wikipedia.org\/wiki\/Ronald_Fisher\">R. A. Fisher<\/a>, which would be quite as bad. So if you are worried about it you&#8217;d better come back and see me later.&#8221;<\/div>\n<p>Man is an orderly animal. He finds it very hard to imitate the disorder of nature. In fact the situation is the exact opposite of what the reader of <a href=\"http:\/\/en.wikipedia.org\/wiki\/William Paley\">Paley<\/a>&#8216;s <em>Evidences<\/em>\u00a0might expect. But the problem is an interesting one, because it raises in a sharp and concrete way the question of what is meant by randomness, a question which, I believe, has not been fully worked out. The number of independent numerical criteria of randomness which can be applied increases with the number of observations, but much more slowly, perhaps as its logarithm. The criteria now in use have been developed to search for excessive irregularity, that is to say, unduly bad fit between observation and hypothesis. It does not follow that they are so well adapted to a search for an unduly good fit. Here, I believe, is a real problem for students of probability. Its solution might lead to a better set of axioms for that very far from rigorous but none the less fascinating branch of mathematics.<\/p>\n<p>* see U. Philip and J. B. S. Haldane (1939).\u00a0<em>Nature<\/em>,\u00a0<strong>143<\/strong>, p. 334. \u00a0and<br \/>\n&nbsp; Hans Gr\u00fcneberg and J. B. S. Haldane (1940).\u00a0<em>Nature<\/em>,\u00a0<strong>145<\/strong>, p. 704.\n<\/div>\n<p>Two closing comments by <a href=\"http:\/\/en.wikipedia.org\/wiki\/Thomas_William_K%C3%B6rner\">T. W. K\u00f6rner<\/a>, who found Haldane&#8217;s essay worth reprinting in his brilliant <a href=\"http:\/\/www.amazon.com\/Fourier-Analysis-T-246-rner\/dp\/0521389917\">textbook<\/a> on Fourier analysis:  <\/p>\n<blockquote><p>&#8220;<em>The reluctance of the scientific community to accept the possibility of fraud is illustrated by the fact that Moewus was still cited in the literature (and even spoken of as a possible Nobel prize winner) until 1953.  However, no one else ever succeeded in repeating his experiments&#8230;<\/p>\n<p>Unfortunately the statistical war against fraud is now over and the cheaters have won.  The kind of tests proposed by Haldane depended on the fact that &#8216;higher order faking&#8217; required a great deal of computational work. The invention and accessibility of the computer means that the computational work involved has ceased to be a problem for the dishonest scientist.  In the physical and biological sciences the possibility that others will attempt to replicate experiments may act as a sufficient deterrent but in purely statistical subjects like sociology and experimental psychology the poblems raised by potential fraud have still to be faced<\/em>.&#8221;<\/p><\/blockquote>\n","protected":false},"excerpt":{"rendered":"<p>From Haldane&#8216;s 1941 essay in Eureka\u00a0#6 on &#8220;The Faking of Genetical Results&#8220;, reproduced here with appropriate corrections and hyperlinks. MY\u00a0FATHER\u00a0published a number of papers on blood analysis. In the proofs of one of them the following sentence, or something very like it, occurred: &#8220;Unless the blood is very thoroughly faked, it will be found that [&hellip;]<\/p>\n","protected":false},"author":1202,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"jetpack_post_was_ever_published":false},"categories":[210,213,209,1],"tags":[5553,3826,60479,136,291,4497],"class_list":["post-2218","post","type-post","status-publish","format-standard","hentry","category-chain-gang","category-metrics","category-popular-demand","category-uncategorized","tag-analysis","tag-fraud","tag-haldane","tag-mathematics","tag-science","tag-statistics"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p7iVvB-zM","jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/posts\/2218","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/users\/1202"}],"replies":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/comments?post=2218"}],"version-history":[{"count":13,"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/posts\/2218\/revisions"}],"predecessor-version":[{"id":2232,"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/posts\/2218\/revisions\/2232"}],"wp:attachment":[{"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/media?parent=2218"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/categories?post=2218"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/tags?post=2218"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}