
<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Always Watch Out for Number One</title>
	<atom:link href="http://www.damninteresting.com/always-watch-out-for-number-one/feed" rel="self" type="application/rss+xml" />
	<link>http://www.damninteresting.com/always-watch-out-for-number-one</link>
	<description>A collection of Damn Interesting things</description>
	<lastBuildDate>Tue, 16 Mar 2010 09:41:34 -0700</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: allduerespect88</title>
		<link>http://www.damninteresting.com/always-watch-out-for-number-one#comment-22988</link>
		<dc:creator>allduerespect88</dc:creator>
		<pubDate>Fri, 03 Oct 2008 12:51:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.damninteresting.com/?p=210#comment-22988</guid>
		<description>A1c: You had ice-cream for dinner?
You&#039;re my hero</description>
		<content:encoded><![CDATA[<p>A1c: You had ice-cream for dinner?<br />
You&#8217;re my hero</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: a1c</title>
		<link>http://www.damninteresting.com/always-watch-out-for-number-one#comment-22480</link>
		<dc:creator>a1c</dc:creator>
		<pubDate>Sun, 10 Aug 2008 01:50:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.damninteresting.com/?p=210#comment-22480</guid>
		<description>Many universities accept term papers only electronically, and share them with other universities to pattern-match for cheaters.

Also, anyone forced to give a number between 1 and 10 often gives 7, with 3 being the 2nd most likely statistic.

In other news, I had 1,234 scoops of ice cream for dinner last night.</description>
		<content:encoded><![CDATA[<p>Many universities accept term papers only electronically, and share them with other universities to pattern-match for cheaters.</p>
<p>Also, anyone forced to give a number between 1 and 10 often gives 7, with 3 being the 2nd most likely statistic.</p>
<p>In other news, I had 1,234 scoops of ice cream for dinner last night.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: bc5431</title>
		<link>http://www.damninteresting.com/always-watch-out-for-number-one#comment-20510</link>
		<dc:creator>bc5431</dc:creator>
		<pubDate>Mon, 17 Mar 2008 22:00:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.damninteresting.com/?p=210#comment-20510</guid>
		<description>[quote]rexodus said: &quot;For instance, look at average height (in feet) for 20-year-old males. It&#039;s a bell curve distribution (or close to one) with 5 as the most common first digit.  Sure, you could convert this to meters, and the most common digit would be 1. But that&#039;s just a matter of shifting the curve. If you shifted it again into inches, the most common first digit is 6.&quot;[/quote]

I think the problem with your logic here is that you data set is too constrained, which will, as Alan pointed out at the beginning of the article, cause it to fail Benford&#039;s Law. It is not to constrained because it is too small a data set, but because the range is too small. That is, the range between smallest number and largest number is too small. According to Wikipedia, the shortest fully grown man is 2&#039;11&quot; while the tallest man in recorded history was 8&#039;11&quot;. Thus there is only an approximately 300% increase from shortest to tallest, regardless of what unit system you use. Compare this to the range in river lengths or yearly incomes, to go back to the two examples so heavily cited above. To run the full range of starting-digit-1 to starting-digit-9
requires a 900% increase in value, again regardless of what unit system is used. Bedford&#039;s Law requires that the numbers involved traverse this range at least once , otherwise at least 1 number will always have a 0% occurrence in the opening position and the number set will thus immediately fail the test.

So, I think the bell curve distributions will still follow Bedford&#039;s Law, but only when the distribution covers a sufficiently wide enough range of numbers, not just a sufficiently large enough set. That, it appears, is a key distinction to &quot;too constrained&quot; which it does not seem was made above.</description>
		<content:encoded><![CDATA[<p>[quote]rexodus said: &#8220;For instance, look at average height (in feet) for 20-year-old males. It&#8217;s a bell curve distribution (or close to one) with 5 as the most common first digit.  Sure, you could convert this to meters, and the most common digit would be 1. But that&#8217;s just a matter of shifting the curve. If you shifted it again into inches, the most common first digit is 6.&#8221;[/quote]</p>
<p>I think the problem with your logic here is that you data set is too constrained, which will, as Alan pointed out at the beginning of the article, cause it to fail Benford&#8217;s Law. It is not to constrained because it is too small a data set, but because the range is too small. That is, the range between smallest number and largest number is too small. According to Wikipedia, the shortest fully grown man is 2&#8242;11&#8243; while the tallest man in recorded history was 8&#8242;11&#8243;. Thus there is only an approximately 300% increase from shortest to tallest, regardless of what unit system you use. Compare this to the range in river lengths or yearly incomes, to go back to the two examples so heavily cited above. To run the full range of starting-digit-1 to starting-digit-9<br />
requires a 900% increase in value, again regardless of what unit system is used. Bedford&#8217;s Law requires that the numbers involved traverse this range at least once , otherwise at least 1 number will always have a 0% occurrence in the opening position and the number set will thus immediately fail the test.</p>
<p>So, I think the bell curve distributions will still follow Bedford&#8217;s Law, but only when the distribution covers a sufficiently wide enough range of numbers, not just a sufficiently large enough set. That, it appears, is a key distinction to &#8220;too constrained&#8221; which it does not seem was made above.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rexodus</title>
		<link>http://www.damninteresting.com/always-watch-out-for-number-one#comment-19737</link>
		<dc:creator>rexodus</dc:creator>
		<pubDate>Tue, 29 Jan 2008 19:32:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.damninteresting.com/?p=210#comment-19737</guid>
		<description>[quote]ahecht said: &quot;The reason it works for something natural like rivers (or mountain), but not for truely random numbers like all the digits from 1 to 100 is that Bedford&#039;s Law only applies to data sets that have a normal &quot;bell curve&quot; distribution. In other words, if higher numbers are more rare than lower numbers, you can apply the law.&quot;[/quote]

Actually, this Law does NOT work for bell curve distributions. In a perfect bell curve, the median and the mode are the same value. This means high numbers and low numbers are equally rare, with numbers somewhere in between being most common. For instance, look at average height (in feet) for 20-year-old males. It&#039;s a bell curve distribution (or close to one) with 5 as the most common first digit.  Sure, you could convert this to meters, and the most common digit would be 1. But that&#039;s just a matter of shifting the curve. If you shifted it again into inches, the most common first digit is 6. 

As long as the curve is bell-shaped, the most common first digit will be whatever the first digit of the median is.

But river length is not a bell curve distribution. Neither is amounts on a tax form. In both of those distributions, the median and the mode are not the same. If the distribution were unimodal, which it may not be, the mode would not be the same as the median. There would be a large number of rivers a little bit shorter than the median, and a few rivers extremely longer than the median. The distribution is much more likely to look like this: http://www.gatsby.ucl.ac.uk/~turner/Benford&#039;s%20law/Benford%20Tea%20Talk_files/Benfords%20law3.png
This is because short rivers are much more common than long ones. And, similarly, small dollar amounts are much more likely than large ones.

Take a look at that distribution and adjust the scale however you like. The most common data points will begin with 1 in every scale.</description>
		<content:encoded><![CDATA[<p>[quote]ahecht said: &#8220;The reason it works for something natural like rivers (or mountain), but not for truely random numbers like all the digits from 1 to 100 is that Bedford&#8217;s Law only applies to data sets that have a normal &#8220;bell curve&#8221; distribution. In other words, if higher numbers are more rare than lower numbers, you can apply the law.&#8221;[/quote]</p>
<p>Actually, this Law does NOT work for bell curve distributions. In a perfect bell curve, the median and the mode are the same value. This means high numbers and low numbers are equally rare, with numbers somewhere in between being most common. For instance, look at average height (in feet) for 20-year-old males. It&#8217;s a bell curve distribution (or close to one) with 5 as the most common first digit.  Sure, you could convert this to meters, and the most common digit would be 1. But that&#8217;s just a matter of shifting the curve. If you shifted it again into inches, the most common first digit is 6. </p>
<p>As long as the curve is bell-shaped, the most common first digit will be whatever the first digit of the median is.</p>
<p>But river length is not a bell curve distribution. Neither is amounts on a tax form. In both of those distributions, the median and the mode are not the same. If the distribution were unimodal, which it may not be, the mode would not be the same as the median. There would be a large number of rivers a little bit shorter than the median, and a few rivers extremely longer than the median. The distribution is much more likely to look like this: <a href="http://www.gatsby.ucl.ac.uk/~turner/Benford" rel="nofollow">http://www.gatsby.ucl.ac.uk/~turner/Benford</a>&#8217;s%20law/Benford%20Tea%20Talk_files/Benfords%20law3.png<br />
This is because short rivers are much more common than long ones. And, similarly, small dollar amounts are much more likely than large ones.</p>
<p>Take a look at that distribution and adjust the scale however you like. The most common data points will begin with 1 in every scale.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alchemist</title>
		<link>http://www.damninteresting.com/always-watch-out-for-number-one#comment-17935</link>
		<dc:creator>Alchemist</dc:creator>
		<pubDate>Tue, 23 Oct 2007 21:12:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.damninteresting.com/?p=210#comment-17935</guid>
		<description>[quote]Alan Bellows said: &quot;Good point.

0.00000164 lightyears = 9,640,739.7 miles… that&#039;s one LONG river.&quot;[/quote]

which is 1/10 or (.1) the approximate distance from the earth to the sun.</description>
		<content:encoded><![CDATA[<p>[quote]Alan Bellows said: &#8220;Good point.</p>
<p>0.00000164 lightyears = 9,640,739.7 miles… that&#8217;s one LONG river.&#8221;[/quote]</p>
<p>which is 1/10 or (.1) the approximate distance from the earth to the sun.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mb799</title>
		<link>http://www.damninteresting.com/always-watch-out-for-number-one#comment-16020</link>
		<dc:creator>mb799</dc:creator>
		<pubDate>Thu, 28 Jun 2007 11:44:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.damninteresting.com/?p=210#comment-16020</guid>
		<description>&lt;p&gt;There are a number of articles on the use of &lt;a href=&quot;http://ezrstats.com&quot; rel=&quot;nofollow&quot;&gt;Benford&#039;s law&lt;/a&gt; , in addition to software and tutorials on the use of the law.  It has extensive applicability in auditing and forensic accounting.&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>There are a number of articles on the use of <a href="http://ezrstats.com" rel="nofollow">Benford&#8217;s law</a> , in addition to software and tutorials on the use of the law.  It has extensive applicability in auditing and forensic accounting.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mr_chol</title>
		<link>http://www.damninteresting.com/always-watch-out-for-number-one#comment-11345</link>
		<dc:creator>mr_chol</dc:creator>
		<pubDate>Thu, 28 Dec 2006 17:17:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.damninteresting.com/?p=210#comment-11345</guid>
		<description>&lt;p&gt;&lt;span class=&#039;quote&#039;&gt;chswartz said: &quot;Look, this Law is easy to explain. Have you ever seen a Log plot? Check out http://www.science-projects.com/SemiLogUse.htm&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span class=&#039;quote&#039;&gt;
Hmm. . notice how the coverage between 1 and 2 is a lot bigger than between 8 and 9? In fact, it&#039;s about 30% of the area from 1 to 10.
You see, this is just a matter of things GROWING, natural or artificial. When a river is forming, it might grow 10% longer in a century. So if it&#039;s a 100 miles long, it could easily grow by 10miles. But a ONE mile long river is not likely to grow by ten miles. So we get a bunch of short rivers and a few long ones, which looks wierd on a plot, but nice and uniform on a LOG plot! The log plot has wider 1&#039;s, so the (now uniform) data is more likely to be from 1 to 2 than from 8 to 9. 

Am I right here??&quot;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;


No!

Feel free to comment me if I&#039;m mistaken.

1) The law does not just apply rivers. The length of rivers was used as an example, I believe the author started with tax returns...

However, on the subject of rivers...

2) Although I agree that it takes longer to move from 1 to 2 exponentially (or on a log plot) than 8 to 9, you must remember that our system of measurement is essentially arbitrary. If a river is 1.1 miles long, it will take a long time to reach 2 miles exponentially. But 1.1 miles is also 1.76km. it will take a much shorter time to reach &quot;2&quot; by exponential growth. It doesnt mean that the river is any longer or growing any faster.

In answer to the other person&#039;s comment above, I suspect that if measurements were converted from miles to km or vice versa, the law would still hold.&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p><span class='quote'>chswartz said: &#8220;Look, this Law is easy to explain. Have you ever seen a Log plot? Check out <a href="http://www.science-projects.com/SemiLogUse.htm" rel="nofollow">http://www.science-projects.com/SemiLogUse.htm</a></span></p>
<p><span class='quote'><br />
Hmm. . notice how the coverage between 1 and 2 is a lot bigger than between 8 and 9? In fact, it&#8217;s about 30% of the area from 1 to 10.<br />
You see, this is just a matter of things GROWING, natural or artificial. When a river is forming, it might grow 10% longer in a century. So if it&#8217;s a 100 miles long, it could easily grow by 10miles. But a ONE mile long river is not likely to grow by ten miles. So we get a bunch of short rivers and a few long ones, which looks wierd on a plot, but nice and uniform on a LOG plot! The log plot has wider 1&#8217;s, so the (now uniform) data is more likely to be from 1 to 2 than from 8 to 9. </p>
<p>Am I right here??&#8221;</span></p>
<p>No!</p>
<p>Feel free to comment me if I&#8217;m mistaken.</p>
<p>1) The law does not just apply rivers. The length of rivers was used as an example, I believe the author started with tax returns&#8230;</p>
<p>However, on the subject of rivers&#8230;</p>
<p>2) Although I agree that it takes longer to move from 1 to 2 exponentially (or on a log plot) than 8 to 9, you must remember that our system of measurement is essentially arbitrary. If a river is 1.1 miles long, it will take a long time to reach 2 miles exponentially. But 1.1 miles is also 1.76km. it will take a much shorter time to reach &#8220;2&#8243; by exponential growth. It doesnt mean that the river is any longer or growing any faster.</p>
<p>In answer to the other person&#8217;s comment above, I suspect that if measurements were converted from miles to km or vice versa, the law would still hold.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: chswartz</title>
		<link>http://www.damninteresting.com/always-watch-out-for-number-one#comment-6750</link>
		<dc:creator>chswartz</dc:creator>
		<pubDate>Thu, 29 Jun 2006 07:31:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.damninteresting.com/?p=210#comment-6750</guid>
		<description>&lt;p&gt;Look, this Law is easy to explain.  Have you ever seen a Log plot? Check out  http://www.science-projects.com/SemiLogUse.htm&lt;/p&gt;&lt;p&gt;Hmm. . notice how the coverage between 1 and 2 is a lot bigger than between 8 and 9?  In fact, it&#039;s about 30% of the area from 1 to 10.  
You see, this is just a matter of things GROWING, natural or artificial.  When a river is forming, it might grow 10% longer in a century.  So if it&#039;s a 100 miles long, it could easily grow by 10miles.  But a ONE mile long river is not likely to grow by ten miles.   So we get a bunch of short rivers and a few long ones, which looks wierd on a plot, but nice and uniform on a LOG plot!  The log plot has wider 1&#039;s, so the (now uniform) data is more likely to be from 1 to 2 than from 8 to 9.    &lt;/p&gt;&lt;p&gt;Am I right here??&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Look, this Law is easy to explain.  Have you ever seen a Log plot? Check out  <a href="http://www.science-projects.com/SemiLogUse.htm" rel="nofollow">http://www.science-projects.com/SemiLogUse.htm</a></p>
<p>Hmm. . notice how the coverage between 1 and 2 is a lot bigger than between 8 and 9?  In fact, it&#8217;s about 30% of the area from 1 to 10.<br />
You see, this is just a matter of things GROWING, natural or artificial.  When a river is forming, it might grow 10% longer in a century.  So if it&#8217;s a 100 miles long, it could easily grow by 10miles.  But a ONE mile long river is not likely to grow by ten miles.   So we get a bunch of short rivers and a few long ones, which looks wierd on a plot, but nice and uniform on a LOG plot!  The log plot has wider 1&#8217;s, so the (now uniform) data is more likely to be from 1 to 2 than from 8 to 9.    </p>
<p>Am I right here??</p>
]]></content:encoded>
	</item>
</channel>
</rss>
