What is what between binomial many flip outcomes and statistical observation?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
0
down vote

favorite












I am trying to get my head around normal distribution which evolves as a good approximation for a binomial problem (like coin flips).



Theoretical outcome:



When I have say 50 flips, there is a deterministic no of binomial outcomes (X = no of heads in any outcome sequence), which can be visualized, and graphed. I tried to graph the frequency of X, that is, $n(X_k)$ vs $X_k$, where the $n(X_k)$ are the binomial coefficients. And also the resultant probability $p(X_k)$ vs $X_k$. I also tried to overlay normal approximation for both graphs.



No of flips: 50

p = 0.75 (probability of heads per flip)



enter image description here



Along with applying regular Normal distribution formula on probability curve, I also tried to normal-approximate on frequency curve $n(X_k)$ on LHS as shown below. That is why the term $max(n(X_k))$ multiplied by exponential denoting the red curve which is obviously misplaced regarding which I have questions below.



Statistical outcome:



I also tried to simulate statistically with p(H)=0.75. Since practically we only get one outcome out of $2^50$ possibilities every time, I ran this experiment for say 2000 times. I then collected the $n(X_k)$, and also statistical $p(X_k)$ by just dividing each $n(X_k)$ with total no of outcomes (sum of all $n(X_k)$ ) and plotted them. I also tried to approximate that with normal curves. I get this.



No of flips: 50

No of experiments : 2000 ( 1 experiment $rightarrow$ 50 flips $rightarrow$ 1 output sequence )

p = 0.75



enter image description here



My questions:



  1. I get vaguely, why $n(X_k)$ in Theoretical outcome still has 25 as mean, because no probability has been associated there yet or inherently we assume equal probability for all? In statistical outcome, you can see $n(X_k)$ has shifted mean to around 37 rightly thus resulting in pdf also on RHS. Theoretical outcome RHS also has pdf sitting on proper mean 37, but LHS looks misplaced.

  2. If my inference is correct, that by nature, possible outcomes $n(X_k)$ is always probability neutral or independent, how could I normally approximate such theoretical outcomes with varying probability?

  3. If my inference is wrong, what is missing piece that prevents my binomial outcomes (not their probability which is already in place 37) to shift to mean 37?

  4. What are proper terminology for both type of probabilities above? Top one is from theoretical outcome, while bottom is statistical. People use one of these to explain normal approximation, so which is correct or better?

On big picture, I am trying to understand how normal distribution evolves from sample distribution and the underlying cause for it.enter preformatted text here







share|cite|improve this question

















  • 1




    It´s not clear what you have done to get the theoretical outcome graphs. What does $max(n(x))...$ mean in this context and why do you use this? And what is $n(X_k)$? Please be more specific and provide a numerical example.
    – callculus
    Aug 2 at 12:45











  • Sorry I was not clear enough. $n(X_k)$ is the LHS graph, how many number of $X_k$, per $X_k$. $max(n(X_k))$ is just the maximum number that occurs at mean value, that is when $X_k = 37.5$. I used that function trying to approximate binomial outcomes, but they seem to be misplaced by the mean. I have graphed for numerical problem of No of flips = 50 in theoretical case with probability 0.75 for heads. In statistical case, one experiment is flipping 50 times, and observing no.of heads in resulting sequence. Likewise, I repeated experiment for 2000 times. I have updated now. Is it clearer?
    – Paari Vendhan
    Aug 2 at 13:36











  • You asked me to comment on this question, but I'm afraid it's not at all clear to me what you've done, so what I can say is limited. In your upper left graph, it appears that your binomial distribution has $p = 0.5$, but your normal approximation clearly uses $p = 0.75$. There is certainly nothing more interesting to that story than that; the two don't match and shouldn't be expected to match. I don't understand your "My questions" part well enough to be able to comment further. Sorry :-/
    – Aaron Montgomery
    Aug 4 at 1:45














up vote
0
down vote

favorite












I am trying to get my head around normal distribution which evolves as a good approximation for a binomial problem (like coin flips).



Theoretical outcome:



When I have say 50 flips, there is a deterministic no of binomial outcomes (X = no of heads in any outcome sequence), which can be visualized, and graphed. I tried to graph the frequency of X, that is, $n(X_k)$ vs $X_k$, where the $n(X_k)$ are the binomial coefficients. And also the resultant probability $p(X_k)$ vs $X_k$. I also tried to overlay normal approximation for both graphs.



No of flips: 50

p = 0.75 (probability of heads per flip)



enter image description here



Along with applying regular Normal distribution formula on probability curve, I also tried to normal-approximate on frequency curve $n(X_k)$ on LHS as shown below. That is why the term $max(n(X_k))$ multiplied by exponential denoting the red curve which is obviously misplaced regarding which I have questions below.



Statistical outcome:



I also tried to simulate statistically with p(H)=0.75. Since practically we only get one outcome out of $2^50$ possibilities every time, I ran this experiment for say 2000 times. I then collected the $n(X_k)$, and also statistical $p(X_k)$ by just dividing each $n(X_k)$ with total no of outcomes (sum of all $n(X_k)$ ) and plotted them. I also tried to approximate that with normal curves. I get this.



No of flips: 50

No of experiments : 2000 ( 1 experiment $rightarrow$ 50 flips $rightarrow$ 1 output sequence )

p = 0.75



enter image description here



My questions:



  1. I get vaguely, why $n(X_k)$ in Theoretical outcome still has 25 as mean, because no probability has been associated there yet or inherently we assume equal probability for all? In statistical outcome, you can see $n(X_k)$ has shifted mean to around 37 rightly thus resulting in pdf also on RHS. Theoretical outcome RHS also has pdf sitting on proper mean 37, but LHS looks misplaced.

  2. If my inference is correct, that by nature, possible outcomes $n(X_k)$ is always probability neutral or independent, how could I normally approximate such theoretical outcomes with varying probability?

  3. If my inference is wrong, what is missing piece that prevents my binomial outcomes (not their probability which is already in place 37) to shift to mean 37?

  4. What are proper terminology for both type of probabilities above? Top one is from theoretical outcome, while bottom is statistical. People use one of these to explain normal approximation, so which is correct or better?

On big picture, I am trying to understand how normal distribution evolves from sample distribution and the underlying cause for it.enter preformatted text here







share|cite|improve this question

















  • 1




    It´s not clear what you have done to get the theoretical outcome graphs. What does $max(n(x))...$ mean in this context and why do you use this? And what is $n(X_k)$? Please be more specific and provide a numerical example.
    – callculus
    Aug 2 at 12:45











  • Sorry I was not clear enough. $n(X_k)$ is the LHS graph, how many number of $X_k$, per $X_k$. $max(n(X_k))$ is just the maximum number that occurs at mean value, that is when $X_k = 37.5$. I used that function trying to approximate binomial outcomes, but they seem to be misplaced by the mean. I have graphed for numerical problem of No of flips = 50 in theoretical case with probability 0.75 for heads. In statistical case, one experiment is flipping 50 times, and observing no.of heads in resulting sequence. Likewise, I repeated experiment for 2000 times. I have updated now. Is it clearer?
    – Paari Vendhan
    Aug 2 at 13:36











  • You asked me to comment on this question, but I'm afraid it's not at all clear to me what you've done, so what I can say is limited. In your upper left graph, it appears that your binomial distribution has $p = 0.5$, but your normal approximation clearly uses $p = 0.75$. There is certainly nothing more interesting to that story than that; the two don't match and shouldn't be expected to match. I don't understand your "My questions" part well enough to be able to comment further. Sorry :-/
    – Aaron Montgomery
    Aug 4 at 1:45












up vote
0
down vote

favorite









up vote
0
down vote

favorite











I am trying to get my head around normal distribution which evolves as a good approximation for a binomial problem (like coin flips).



Theoretical outcome:



When I have say 50 flips, there is a deterministic no of binomial outcomes (X = no of heads in any outcome sequence), which can be visualized, and graphed. I tried to graph the frequency of X, that is, $n(X_k)$ vs $X_k$, where the $n(X_k)$ are the binomial coefficients. And also the resultant probability $p(X_k)$ vs $X_k$. I also tried to overlay normal approximation for both graphs.



No of flips: 50

p = 0.75 (probability of heads per flip)



enter image description here



Along with applying regular Normal distribution formula on probability curve, I also tried to normal-approximate on frequency curve $n(X_k)$ on LHS as shown below. That is why the term $max(n(X_k))$ multiplied by exponential denoting the red curve which is obviously misplaced regarding which I have questions below.



Statistical outcome:



I also tried to simulate statistically with p(H)=0.75. Since practically we only get one outcome out of $2^50$ possibilities every time, I ran this experiment for say 2000 times. I then collected the $n(X_k)$, and also statistical $p(X_k)$ by just dividing each $n(X_k)$ with total no of outcomes (sum of all $n(X_k)$ ) and plotted them. I also tried to approximate that with normal curves. I get this.



No of flips: 50

No of experiments : 2000 ( 1 experiment $rightarrow$ 50 flips $rightarrow$ 1 output sequence )

p = 0.75



enter image description here



My questions:



  1. I get vaguely, why $n(X_k)$ in Theoretical outcome still has 25 as mean, because no probability has been associated there yet or inherently we assume equal probability for all? In statistical outcome, you can see $n(X_k)$ has shifted mean to around 37 rightly thus resulting in pdf also on RHS. Theoretical outcome RHS also has pdf sitting on proper mean 37, but LHS looks misplaced.

  2. If my inference is correct, that by nature, possible outcomes $n(X_k)$ is always probability neutral or independent, how could I normally approximate such theoretical outcomes with varying probability?

  3. If my inference is wrong, what is missing piece that prevents my binomial outcomes (not their probability which is already in place 37) to shift to mean 37?

  4. What are proper terminology for both type of probabilities above? Top one is from theoretical outcome, while bottom is statistical. People use one of these to explain normal approximation, so which is correct or better?

On big picture, I am trying to understand how normal distribution evolves from sample distribution and the underlying cause for it.enter preformatted text here







share|cite|improve this question













I am trying to get my head around normal distribution which evolves as a good approximation for a binomial problem (like coin flips).



Theoretical outcome:



When I have say 50 flips, there is a deterministic no of binomial outcomes (X = no of heads in any outcome sequence), which can be visualized, and graphed. I tried to graph the frequency of X, that is, $n(X_k)$ vs $X_k$, where the $n(X_k)$ are the binomial coefficients. And also the resultant probability $p(X_k)$ vs $X_k$. I also tried to overlay normal approximation for both graphs.



No of flips: 50

p = 0.75 (probability of heads per flip)



enter image description here



Along with applying regular Normal distribution formula on probability curve, I also tried to normal-approximate on frequency curve $n(X_k)$ on LHS as shown below. That is why the term $max(n(X_k))$ multiplied by exponential denoting the red curve which is obviously misplaced regarding which I have questions below.



Statistical outcome:



I also tried to simulate statistically with p(H)=0.75. Since practically we only get one outcome out of $2^50$ possibilities every time, I ran this experiment for say 2000 times. I then collected the $n(X_k)$, and also statistical $p(X_k)$ by just dividing each $n(X_k)$ with total no of outcomes (sum of all $n(X_k)$ ) and plotted them. I also tried to approximate that with normal curves. I get this.



No of flips: 50

No of experiments : 2000 ( 1 experiment $rightarrow$ 50 flips $rightarrow$ 1 output sequence )

p = 0.75



enter image description here



My questions:



  1. I get vaguely, why $n(X_k)$ in Theoretical outcome still has 25 as mean, because no probability has been associated there yet or inherently we assume equal probability for all? In statistical outcome, you can see $n(X_k)$ has shifted mean to around 37 rightly thus resulting in pdf also on RHS. Theoretical outcome RHS also has pdf sitting on proper mean 37, but LHS looks misplaced.

  2. If my inference is correct, that by nature, possible outcomes $n(X_k)$ is always probability neutral or independent, how could I normally approximate such theoretical outcomes with varying probability?

  3. If my inference is wrong, what is missing piece that prevents my binomial outcomes (not their probability which is already in place 37) to shift to mean 37?

  4. What are proper terminology for both type of probabilities above? Top one is from theoretical outcome, while bottom is statistical. People use one of these to explain normal approximation, so which is correct or better?

On big picture, I am trying to understand how normal distribution evolves from sample distribution and the underlying cause for it.enter preformatted text here









share|cite|improve this question












share|cite|improve this question




share|cite|improve this question








edited Aug 2 at 13:34
























asked Aug 2 at 7:11









Paari Vendhan

355




355







  • 1




    It´s not clear what you have done to get the theoretical outcome graphs. What does $max(n(x))...$ mean in this context and why do you use this? And what is $n(X_k)$? Please be more specific and provide a numerical example.
    – callculus
    Aug 2 at 12:45











  • Sorry I was not clear enough. $n(X_k)$ is the LHS graph, how many number of $X_k$, per $X_k$. $max(n(X_k))$ is just the maximum number that occurs at mean value, that is when $X_k = 37.5$. I used that function trying to approximate binomial outcomes, but they seem to be misplaced by the mean. I have graphed for numerical problem of No of flips = 50 in theoretical case with probability 0.75 for heads. In statistical case, one experiment is flipping 50 times, and observing no.of heads in resulting sequence. Likewise, I repeated experiment for 2000 times. I have updated now. Is it clearer?
    – Paari Vendhan
    Aug 2 at 13:36











  • You asked me to comment on this question, but I'm afraid it's not at all clear to me what you've done, so what I can say is limited. In your upper left graph, it appears that your binomial distribution has $p = 0.5$, but your normal approximation clearly uses $p = 0.75$. There is certainly nothing more interesting to that story than that; the two don't match and shouldn't be expected to match. I don't understand your "My questions" part well enough to be able to comment further. Sorry :-/
    – Aaron Montgomery
    Aug 4 at 1:45












  • 1




    It´s not clear what you have done to get the theoretical outcome graphs. What does $max(n(x))...$ mean in this context and why do you use this? And what is $n(X_k)$? Please be more specific and provide a numerical example.
    – callculus
    Aug 2 at 12:45











  • Sorry I was not clear enough. $n(X_k)$ is the LHS graph, how many number of $X_k$, per $X_k$. $max(n(X_k))$ is just the maximum number that occurs at mean value, that is when $X_k = 37.5$. I used that function trying to approximate binomial outcomes, but they seem to be misplaced by the mean. I have graphed for numerical problem of No of flips = 50 in theoretical case with probability 0.75 for heads. In statistical case, one experiment is flipping 50 times, and observing no.of heads in resulting sequence. Likewise, I repeated experiment for 2000 times. I have updated now. Is it clearer?
    – Paari Vendhan
    Aug 2 at 13:36











  • You asked me to comment on this question, but I'm afraid it's not at all clear to me what you've done, so what I can say is limited. In your upper left graph, it appears that your binomial distribution has $p = 0.5$, but your normal approximation clearly uses $p = 0.75$. There is certainly nothing more interesting to that story than that; the two don't match and shouldn't be expected to match. I don't understand your "My questions" part well enough to be able to comment further. Sorry :-/
    – Aaron Montgomery
    Aug 4 at 1:45







1




1




It´s not clear what you have done to get the theoretical outcome graphs. What does $max(n(x))...$ mean in this context and why do you use this? And what is $n(X_k)$? Please be more specific and provide a numerical example.
– callculus
Aug 2 at 12:45





It´s not clear what you have done to get the theoretical outcome graphs. What does $max(n(x))...$ mean in this context and why do you use this? And what is $n(X_k)$? Please be more specific and provide a numerical example.
– callculus
Aug 2 at 12:45













Sorry I was not clear enough. $n(X_k)$ is the LHS graph, how many number of $X_k$, per $X_k$. $max(n(X_k))$ is just the maximum number that occurs at mean value, that is when $X_k = 37.5$. I used that function trying to approximate binomial outcomes, but they seem to be misplaced by the mean. I have graphed for numerical problem of No of flips = 50 in theoretical case with probability 0.75 for heads. In statistical case, one experiment is flipping 50 times, and observing no.of heads in resulting sequence. Likewise, I repeated experiment for 2000 times. I have updated now. Is it clearer?
– Paari Vendhan
Aug 2 at 13:36





Sorry I was not clear enough. $n(X_k)$ is the LHS graph, how many number of $X_k$, per $X_k$. $max(n(X_k))$ is just the maximum number that occurs at mean value, that is when $X_k = 37.5$. I used that function trying to approximate binomial outcomes, but they seem to be misplaced by the mean. I have graphed for numerical problem of No of flips = 50 in theoretical case with probability 0.75 for heads. In statistical case, one experiment is flipping 50 times, and observing no.of heads in resulting sequence. Likewise, I repeated experiment for 2000 times. I have updated now. Is it clearer?
– Paari Vendhan
Aug 2 at 13:36













You asked me to comment on this question, but I'm afraid it's not at all clear to me what you've done, so what I can say is limited. In your upper left graph, it appears that your binomial distribution has $p = 0.5$, but your normal approximation clearly uses $p = 0.75$. There is certainly nothing more interesting to that story than that; the two don't match and shouldn't be expected to match. I don't understand your "My questions" part well enough to be able to comment further. Sorry :-/
– Aaron Montgomery
Aug 4 at 1:45




You asked me to comment on this question, but I'm afraid it's not at all clear to me what you've done, so what I can say is limited. In your upper left graph, it appears that your binomial distribution has $p = 0.5$, but your normal approximation clearly uses $p = 0.75$. There is certainly nothing more interesting to that story than that; the two don't match and shouldn't be expected to match. I don't understand your "My questions" part well enough to be able to comment further. Sorry :-/
– Aaron Montgomery
Aug 4 at 1:45















active

oldest

votes











Your Answer




StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "69"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: false,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);








 

draft saved


draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2869773%2fwhat-is-what-between-binomial-many-flip-outcomes-and-statistical-observation%23new-answer', 'question_page');

);

Post as a guest



































active

oldest

votes













active

oldest

votes









active

oldest

votes






active

oldest

votes










 

draft saved


draft discarded


























 


draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2869773%2fwhat-is-what-between-binomial-many-flip-outcomes-and-statistical-observation%23new-answer', 'question_page');

);

Post as a guest













































































Comments

Popular posts from this blog

What is the equation of a 3D cone with generalised tilt?

Relationship between determinant of matrix and determinant of adjoint?

Color the edges and diagonals of a regular polygon