The Normal Distribution and the Central Limit Theorem
The following was implemented in Maple by Marcus Davidsson (2008) davidsson_marcus@hotmail.com
and is based upon the work by Zietz (2004) Dynamic Programming: An Introduction by Example
1) Theoretical Distribution
We start by noting that
If we toss the coin one time we can either get
 |
(1) |
If we toss the coin two time we can either get
 |
(2) |
If we toss the coin three time we can either get
 |
(3) |
We now note that we can calculate the total number of combinations after a n coin toss as
 |
(4) |
for example
for a one coin toss the total number of combinations are given by
 |
(5) |
for a two coin toss the total number of combinations are given by
 |
(6) |
for a three coin toss the total number of combinations are given by
 |
(7) |
We now note that
If we toss the coin one time we can either get
 |
(8) |
which means that we can either get 1 zero head or 1 one head
If we toss the coin two time we can either get
 |
(9) |
which means that we can either get 1 zero head, 1 two heads or 2 one heads
If we toss the coin three time we can either get
 |
(10) |
which means that we can either get 1 three heads, 1 zero heads, 3 two heads or 3 one heads
The above argument can be generalized into the Pascal Triangle as seen below
The first thing to note is that each cell is the sum of the two cells above.
The second thing to note is that each element in the triangle is number of successful combinations.
for example the left-hand side of the triangle gives us the total number of outcomes that has zero heads for different coin tosses
if we move one step to the right we get the total number of outcomes that has one heads for different coin tosses
We now note that we could have calculated the value in each cell (the number of successful combinations) by using the below formula
where ! is the factorial notation for example
 |
(11) |
for example when n=4 and x=1 we get
 |
(12) |
which is also the number we can find in the above triangle where n=4 and x=1
Now if we now want to find the probability of finding one head (x=1) after we toss the coin four times (x=4) we can simply look at the ratio of number
of successful combinations / number of total combinations. We can therefore rewrite the Pascal triangle triangle as seen below.
We now note that
the probability of 2 heads (or zero heads) after 2 tosses
 |
(13) |
the probability of 3 heads (or zero heads) after 3 tosses
 |
(14) |
the probability of 4 heads (or zero heads) after 4 tosses
 |
(15) |
We can calculate the probability of finding x heads in a n random coin toss (p=0.5) as follows:
 |
(16) |
for example
We can calculate the probability of finding 3 heads (x) after 3 coin toss (n) as
 |
(17) |
We can calculate the probability of finding 2 heads (x) after 3 coin toss (n) as
 |
(18) |
We can calculate the probability of finding 4 heads after 4 coin toss as
 |
(19) |
We now note that when the number of coin tosses is large the Binominal distribution will converge to the normal distribution
which is outlined below.
2) Hypothesis Testing
We now note that we can also calculate the cumulative probability of finding x heads after a n coin toss
Example-1
We can calculate the probability of finding 10 heads in a 20 coin toss as
 |
(20) |
We can calculate the probability of finding 10 heads or more in a 20 coin toss as
 |
(21) |
If that probability is lower than 0.05 then we can claim with 95% certainty that the coin most likely is not random. In this case the coin is random
Example-2
We can calculate the probability of finding 14 heads in a 20 coin toss as
 |
(22) |
We can calculate the probability of finding 14 heads or more in a 20 coin toss as
 |
(23) |
If that probability is lower than 0.05 then we can claim with 95% certainty that the coin most likely is not random. In this case the coin is random
Example-3
We can calculate the probability of finding 15 heads in a 20 coin toss as
 |
(24) |
We can calculate the probability of finding 15 heads or more in a 20 coin toss as
 |
(25) |
If that probability is lower than 0.05 then we can claim with 95% certainty that the coin most likely is not random. In this case the coin is not random
Gambling Fallacy
If we conclude that the coin is random then the outcome in each period is completely random.
This means that we cannot quantify the probability of getting a head in the next period because the outcome an all periods are completely random
However if we can prove that the outcome has not been generated by a random process then we can quantify the probability of success without falling in to the gambling fallacy.
3) Empirical Distribution
The law of large numbers and the central limit theorem are two important theorems in statistics.
I will illustrate the basic mechanics by again considering a simple coin toss example.
We assume that we have a room filled with
number of people. Each person is asked to flip a coin 10 times and write down the total number of heads they get.
We can now plot the distribution of the total number of heads that each person observed by assigning the total number of observed heads to the x-axis and the frequency
of the observed number of heads on the y-axis
We note that when the number of observations is large the distribution will converge to the theoretical distribution suggested by the Pascal triangle (normal distribution).
3.1) When the number of people (nn) is small
The room contain 5 people each person is asked to toss the coin 10 times and write down the number of heads they get.
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_59.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_60.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_61.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_62.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_63.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_64.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_65.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_66.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_67.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_68.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 5); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); -1;...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_69.gif)
We can see that our distribution is both unstable (changes every time we run the code) and does not resemble the normal distribution suggested by the Pascal triangle
3.2) When the number of people (nn) is large
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_72.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_73.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_74.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_75.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_76.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_77.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_78.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_79.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_80.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_81.gif)
![restart; -1; `:=`(n, 10); -1; `:=`(nn, 1000); -1; randomize(); -1; `:=`(coin, rand(0 .. 1)); -1; `:=`(coin_1, proc (n) seq(coin(), i = 1 .. n) end proc); -1; `:=`(x_1, seq([coin_1(n)], i = 1 .. nn)); ...](/view.aspx?SI=6879/The_Normal_Distribution_and_Central_Limit_Theorem_82.gif)
We can now see that our distribution is stable (does not change every time we run the code) and it resembles the normal distribution suggested by the Pascal triangle
We can also see that the distribution is centered around the mean equal to 5. This also makes sense since for a random coin we should get head approximately half of the time and
tails approximately half of the time when n is large.
Legal Notice: The copyright for this application is owned by the authors. Neither Maplesoft nor the authors are responsible for any errors contained within and are not liable for any damages resulting from the use of this material.. This application is intended for non-commercial, non-profit use only. Contact the authors for permission if you wish to use this application in for-profit activities.