Derivation of Gaussian Distribution

In summary, the Gaussian distribution is a probability distribution that can be derived from the binomial distribution by means of the central limit theorem. The binomial distribution is given by the equation f(x) = \frac{n!p^{x}(1-p)^{n-x}}{x!(n-x)!}, where k is the number of trials out of n that "succeeded". The average energy density of particles around the n particles chosen gives you the amount of energy you have to work with within the system of n particles.
  • #1
dimensionless
462
1

Homework Statement


Derive the equation for the Gaussian distribution.

Homework Equations



The probability density function for the Gaussian distribution:
[tex]f(x) = \frac{1}{\sigma \sqrt{2\pi} } e^{ -\frac{(x-\mu)^2}{2\sigma ^2} }[/tex]

The Attempt at a Solution



It is my understanding that the Gaussian distribution is derived from the binomial distribution by means of the central limit theorem. The bionomial distribution is given by the following:

[tex]f(x) = \frac{n!p^{x}(1-p)^{n-x}}{x!(n-x)!}[/tex]

If this is true, I don't know how to go about doing it.
 
Physics news on Phys.org
  • #2
It helps to know the relation between the 'x' in the gaussian, and the 'x' in the binomial distribution. Just to keep things clear put k instead of x in the binomial distribution. Then k=x*sqrt(n*p*(p-1))+np (just x times standard deviation plus mean). Now your binomial distribution is in terms of n and x. Let n go to infinity. I did this by taking the log of the binomial distribution and using Stirling's approximation to replace the factorials. I hope this is just a curiosity question. I did this once on a 5hr plane flight to pass the time - and I think it took me most of the flight to get everything to come out. It wasn't much fun. Maybe there's a cleverer way. But it does work.
 
  • #3
okay cool.

What does the quantity 'k' represent? This is not actually a HW problem, I'm just trying to make a little more sense of the world.
 
  • #4
k is the number of trials out of n that 'succeeded', i.e. the k in the combinatorial part C(n,k). What you originally had as 'x' in the binomial distribution. I just suggested you replace it because having two potentially different 'x' around didn't seem like a good idea.
 
  • #5
That's an impressive derivation. How did you manage to find the limit as n was increased to infinity? I have trouble with a lot of complicated functions.
 
  • #6
It breaks up into a pile of simple functions which to have to rearranged in such a way as to get a finite limit. No individual part is all that bad, it's just complicated. Wanna try it?
 
  • #7
Dick said:
Wanna try it?
I'm not sure. It looks like a very tedious process with lots of opportunities for typographical errors to occur.
 
  • #8
dimensionless said:
I'm not sure. It looks like a very tedious process with lots of opportunities for typographical errors to occur.

Exactly.
 
  • #9
I'm not sure if this will help for intuition behind deriving the Gaussian Distribution, but here's the reason why it follows from the binomial distribution:

Think of a collection of an infinite number of particles distributed throughout space. The Gaussian Distribution can be thought of as randomly choosing n particles out of the infinitely many. What we are interested in (for now) is the expected total thermal energy in the system of n particles chosen randomly, or at least the distribution of chance that a collection of n particles will have a particular thermal energy.
After doing this a few times, we find the average thermal energy for a collection of "n" particles to be E, which we can now take to be constant (think of this stage as being something like an induction hypothesis: we can use this to determine how the energy is distributed later).

We can now look at how that total energy E is distributed within the system of n particles:

create a "histogram" of possible thermal energies for the n particles, with bin width of "e". The histogram sorts particles within the system of n particles: each "bin" bi will contain ni particles, so that [tex]\sum[/tex]ni = n.

You can imagine that when you give a certain total energy to the system of n particles, we can (with a few assumptions) create a finite number of energy distributions within the n particles that allow the system to have energy E.

Count unique configurations for every allowable energy value (k*e) to get the binomial distribution as a function of energy level. (for the "0" state, all the particles are in the lowest bin: this is 1 configuration; for the "1" state, one particle is in the 1st energy level while the rest are in the lowest bin: this gives n configurations; etc.).

Here are variables to consider:

The average energy density of particles around the n particles chosen (this gives you the amount of energy you have to work with within the system of n particles).

The (classical) fact that you can choose an energy partition e(E), so that there will be an equal number of particles as there are energy "bins" (you then proceed to take the limit as n and e go to infinity and 0 respectively: with the preceding assumption, e = E/n, where E is the "total energy" of the system given by the average energy density) .
 
Last edited:

FAQ: Derivation of Gaussian Distribution

What is Gaussian distribution?

Gaussian distribution, also known as normal distribution, is a probability distribution that is widely used in statistics. It is a continuous distribution that is symmetrical and bell-shaped, with a mean, median, and mode that are all equal.

What is the formula for Gaussian distribution?

The formula for Gaussian distribution is f(x) = (1 / σ√2π) * e^(-1/2 * ((x-μ)/σ)^2), where μ is the mean and σ is the standard deviation.

What are the properties of Gaussian distribution?

The properties of Gaussian distribution include: symmetry about the mean, mean, median, and mode are all equal, the curve approaches the x-axis but never touches it, 68% of the data falls within one standard deviation of the mean, and 95% of the data falls within two standard deviations of the mean.

What is the importance of Gaussian distribution?

Gaussian distribution is important because it is used to model many natural phenomena in the real world, such as height, weight, and IQ scores. It is also the basis for many statistical tests and methods, making it a fundamental concept in statistics.

How is Gaussian distribution derived?

Gaussian distribution is derived using the Central Limit Theorem, which states that the sum of a large number of independent and identically distributed random variables will follow a normal distribution. The proof involves manipulating the moment generating function of the random variables and taking the limit as the number of variables approaches infinity.

Similar threads

Back
Top