Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Hypothesis Testing … understanding statistical claims Doc Martin is at it again! After his line of Brain Pills was shown to be no better than a placebo, Doc Martin decided to devote his business genius to marketing his new line of ski wax – “Wax 2 the Max”. In a recent competition between conventional and the new spaceage W2tMax, when applied to the skis of 12 cross country skiers, 9 of the winners were using the Wax-2-the-Max skis! Wax-2-the-Max! Try it . My wax is proven to work! How can we test this claim? Either it works or it “don’t” • The null hypothesis: – H0: The wax makes no difference • The alternate hypothesis: – Ha: The wax improves performance If we accept the null hypothesis, then it would mean that winning the 9 (or more) races occurred merely by chance. How likely is this? Wax-2-the-Max • You either win or don’t – what kind of distribution does this suggest? B(12,0.5) • What is the probability of this happening by chance? P = P(9)+P(10)+P(11)+P(12) P = 0.0537+0.0161+0.0029+0.0002 P = 0.0729 What’s this mean? Bad news for Doc Martin! • The null hypothesis occurs with a probability of about 7.3% • If the null hypothesis occurs with 5% or lower chance then you would reject H0 and conclude that the alternate hypothesis is statistically significant • If the null hypothesis occurs with 1% or lower chance then Ha is strongly significant Rats! Foiled again! Null Hypothesis and Statistical Significance • It is usual to create and test a null hypothesis which essentially asks “how likely is the effect that we are testing” due to chance alone. For example: – In testing the claim that my ski wax provided a significant advantage we tested the null hypothesis that is did not and the effect I claimed could be explained as a result of chance. – A probability threshold called the statistical significance level is set to decide to accept or reject the null hypothesis. Significance Levels… • Symbol a denotes the significance level. An a of 5% or 0.05 means that events have a 1/20 chance of occurring by chance • US Supreme Court sets statistical significance at 2s or 3s away from the mean: – 2s a = 0.0223 – 3s a = 0.0013 Stats 300 Causes Stress! • An un-named student (whose initials are Carl) claims that Stats 300 causes stress. To prove this he measured the blood pressure of a SRS of 100 subjects at King’s between the ages of 18 and 36 and found a mean systolic blood pressure of 122 with a standard deviation if 12. He then took the blood pressure of the entire class and found a blood pressure of 128 and assumes the same standard deviation of 12 for each reading. Does this evidence support Carl’s claim? • We are making the assumption that Carl’s original SRS was normally distributed as is the Stats 300 class – Null Hypothesis The Stats 300 class has the same blood pressure as the SRS, ie: No effect on stress. : H0: m = 122 – The Alternative Hypothesis: A one-sided alternative Ha: m > 122 • We will test at the significance level by first X m 128 122 z 3.122 s 12 39 n So … what’s this mean? • A blood pressure of 128 is 3.122s above the mean. Either: – A) H0 is true and we just got 128 by chance – B) H0 is false – STATS 300 really does cause stress! • So how likely is A)? P(Z >= 3.122) implies that this only occurs with a probability of p = 1 – 0.9991 = 0.0009! H0 is false!! (Another way of thinking about this is that 99.91% of the readings expected would be less than 128 – getting this by chance is pretty unlikely!) One-Sided and Two-Sided Alternatives • There are three possible scenarios for the alternative hypothesis: • Ha: m > mo • Ha: m < mo • Ha: m ≠ mo One-sided Two-sided One-sided and two-sided alternative hypotheses have slightly different probability formulae Probability Formulae… • Ha: m > mo : P-value for H0 is P(Z ≥ z) • Ha: m < mo : P-value for H0 is P(Z ≤ z) • Ha: m ≠ mo : P-value for H0 is 2P(Z ≥ |z|) Closer look … example 6.13 • Make the null hypothesis: “sample contains 0.86% of the active ingredient” or H0: m = 0.86 • Alternative is Ha: m ≠ 0.86 • P = 2P(Z ≥ |z|) It could be more or less So this is a two-sided case z = 4.99 The probability of the null hypothesis is less than 2P(Z≥4.99) = 2(1-1)=0!