"The distribution of heights is not Gaussian"

In summary, a discussion on the distribution of human heights was sparked on Physics Forums, with one member stating that the average height follows a Gaussian distribution. Another member argued that the distribution is actually closer to a Maxwell-Boltzmann or lognormal distribution. The debate centered around whether a Gaussian model accurately represents the distribution of heights, and how to properly fit a model to the data. Reference was made to the work of Adolphe Quetelet, who proposed a normal distribution for human traits, and the use of body mass index as a measure of weight relative to height. Ultimately, the distribution of human heights may vary depending on factors such as age and gender, and a large enough sample size is needed to accurately
  • #1
bluemoonKY
131
16
I was browsing old threads at Physics Forums, and I came across some information in this thread from 2008 that got my interest. The thread is titled "Child molester avoids prison because he is short." PF member stickythighs wrote the following: "Since the average American man is 5'10", there are about equal numbers of men in Florida 5'5" and shorter as there are men 6'3" and taller."In post #25 on the thread, PF Member Gokul43201 responded: "Not true. The distribution of heights is not Gaussian. It's almost Boltzmannian, and 5 inches is way bigger than the standard deviation - so a Gaussian approximation could be quite off when you go that far away from the mean. And it is..."

The distribution of human heights is the classic example that statistics textbooks and other textbooks use to show a Gaussian Distribution. By the way, a Guassian distribution = a Normal Distribution. I admit that the distribution of human heights is not Gaussian at the tails. In reality, there are far more people at 5+ standard deviations both above and below the mean than a graph of a 100% Gaussian Distribution of human heights would show. In other words, a graph of a 100% Guassian Distribution of human heights would show less people at 5 SD from the mean than there would actually be in real life.

However, in the example that stickythighs and gokul were discussing, the comparison was between male heights of 5'5" and 6'3". Human height distribution IS Gaussian when you are so close to the mean as 5'5" and 6'3". Therefore, why did Gokul deny that the human height distribution is Gaussian in the 5'5"-6'3" range?

Here is a link to the thread that I am referencing: https://www.physicsforums.com/threads/child-molester-avoids-prison-because-he-is-short.249825/page-2

Why did Gokul say that the distribution of human heights is almost Boltzmannian? Clearly it's not.

Note to moderators: The topic of the thread that I am referencing is about a child molester avoiding prison because he is short. The main topic of the thread that I am referencing is NOT about whether or not the distribution of human height is Gaussian or not. The correct etiquette and protocol for a digression in another area is to create a new thread on the digression, not to hijack the previous thread. There is no thread that I am aware of specifically about whether or not the distribution of human heights is Gaussian. Therefore, I should not be breaking any rules by creating this thread. It's a new topic.
 
Biology news on Phys.org
  • #2
Have a look at the graph here.
https://www.khanacademy.org/science.../a/what-is-the-maxwell-boltzmann-distribution

This is a distribution that has a lower limit like zero, and a maximum going out the x axis. Maxwell-Boltzmann distribution.

So, with normal human adults there is a minimum height, and a larger maximum. I'm excluding dwarfism and gigantism due to abnormalities. And there is a skewness to the result; the graph is not symmetric across the mean, it is skewed.

What gokul43210 (no longer active on the forums) said was that it was a poor fit to a Gaussian curve, and close (but not really) a M-B distribution.
His link to what he cites as a model of the distribution is broken.

How you find a model to fit an existing distribution is interesting. @Dale works with this kind of thing. Maybe he can help clarify what you do with 'almost-fits' situations.

There is also this: Limpert, E; Stahel, W; Abbt, M (2001). "Lognormal distributions across the sciences: keys and clues". BioScience. 51 (5): 341–352
which I cannot get to show in a link. It says that a good human height distribution model is lognormal.

Also, it is not valid to claim that a part of the data is distribution A, and another part is distribution B. And then apply statistical methods on each part as if they were separate.
Since I cannot get all the facts, I cannot give you a good answer.
 
Last edited:
  • #3
bluemoonKY said:
The distribution of human heights is the classic example that statistics textbooks and other textbooks use to show a Gaussian Distribution.

You can reference Lambert Adolphe Jacques Quetelet for bringing statistical studies into the humanities. His conclusion is that the traits of the average man follow a normal distribution. That has been followed ever since in many areas.

https://en.wikipedia.org/wiki/Adolphe_Quetelet
In his 1835 text on social physics, in which he presented his theory of human variance around the average, with human traits being distributed according to a normal curve, he proposed that normal variation provided a basis for the idea that populations produce sufficient variation for artificial or natural selection to operate.[7]

In terms of influence over later public health agendas, one of Quetelet's lasting legacies was the establishment of a simple measure for classifying people's weight relative to an ideal for their height. His proposal, the body mass index (or Quetelet index), has endured with minor variations to the present day.[8] Anthropometric data is used in modern applications and referenced in the development of every consumer-based product

You may also want to read this.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2831262/
Our data are based on complete enumerations, not samples. The number of young men whose heights were tabulated in the Torre reports rose from about 250,000 individuals for the cohorts born before 1860 to over half a million for those born after 1905 ). Overall, the heights of over 21 million individuals were tabulated in these records.
See figure 3 for the tabulated distribution of 20 year old males (1900 ) - raw data, adjusted and fitted normal distribution.

For other populations, the height curve could well not follow a normal distribution.
ie ages 0 to old age ( 60, 70 ... ) whole population - what's that going to look like??
ie sex - male and female follow two difference curves. Bring them together and one gets a flat-ish top
 
  • Like
Likes jim mcnamara
  • #4
bluemoonKY said:
Human height distribution IS Gaussian when you are so close to the mean as 5'5" and 6'3".
If you have a large enough sample then even the tiniest departures from normality become significant. With N on the order of a billion I am sure that it is not normal.

The question isn't really whether or not something is normal, just whether or not the approximation is close enough that you can use the nice simplifying assumption that normality provides.
 
  • Like
Likes jim mcnamara
  • #5
Closed threads should not be re-opened without moderator approval. I think this has been answered.
 

FAQ: "The distribution of heights is not Gaussian"

1. What does it mean when the distribution of heights is not Gaussian?

When the distribution of heights is not Gaussian, it means that the data does not follow a normal or bell-shaped curve. This indicates that the data is not evenly distributed and may have a different pattern or shape.

2. Why is it important to know if the distribution of heights is not Gaussian?

Knowing if the distribution of heights is not Gaussian is important because it can impact how we interpret and analyze the data. It may require different statistical methods or techniques to accurately describe and make conclusions about the data.

3. What factors can cause the distribution of heights to not be Gaussian?

There are several factors that can cause the distribution of heights to not be Gaussian, including a small sample size, outliers, and skewed data. Additionally, cultural or genetic differences in height can also contribute to a non-Gaussian distribution.

4. Can the distribution of heights ever be perfectly Gaussian?

It is highly unlikely for the distribution of heights to be perfectly Gaussian in a real-world setting. This is because there will always be some variation in height due to genetics, environment, and other factors. However, with a large enough sample size, the distribution can closely approximate a normal curve.

5. How can we analyze data with a non-Gaussian distribution?

To analyze data with a non-Gaussian distribution, we can use non-parametric statistical tests or transform the data to try and make it more normally distributed. We can also use descriptive statistics, such as the median and interquartile range, instead of the mean and standard deviation to describe the data.

Back
Top