Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample • 7.1 - Basic Properties of Confidence Intervals • 7.2 - Large-Sample Confidence Intervals for a Population Mean and Proportion • 7.3 - Intervals Based on a Normal Population Distribution • 7.4 - Confidence Intervals for the Variance and Standard Deviation of a Normal Pop Chapter 8 - Tests of Hypotheses Based on a Single Sample • 8.1 - Hypotheses and Test Procedures • 8.2 - Z-Tests for Hypotheses about a Population Mean • 8.3 - The One-Sample T-Test • 8.4 - Tests Concerning a Population Proportion • 8.5 - Further Aspects of Hypothesis Testing POPULATION Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? “Statistical Inference” via… “Hypothesis Testing” Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population. Population Distribution X H0: pop mean age = 25.4 (i.e., no change since 2010) x4 x1 x2 x3 x5 … etc… x400 The reasonableness of the normality assumption is empirically verifiable (e.g., histogram, Q-Q plot) and in fact formally testable from the sample data. If violated (e.g., skewed) or inconclusive (e.g., small sample size), then a transformation (e.g. logarithm) or “distribution-free” nonparametric tests should be used instead… Examples: Sign Test, Wilcoxon Signed Rank Test (= Mann-Whitney U Test) POPULATION Study Question: Has “Mean (i.e., average) Age at First Birth” of women in the U.S. changed since 2010 (25.4 yrs old)? “Statistical Inference” via… “Hypothesis Testing” Present Day: Assume “Mean Age at First Birth” follows a normal distribution (i.e., “bell curve”) in the population. Population Distribution X H0: pop mean age = 25.4 (i.e., no change since 2010) x4 x1 x2 x3 x5 … etc… x400 Sample size n partially depends on the power of the test, i.e., the desired probability of correctly rejecting a false null hypothesis (80% or more). Coming up next! Power and Sample Size • IF the null hypothesis H : μ = μ 0 • • • 0 is true, “Null Distribution” then we should expect a random sample mean to lie in its “acceptance region” with probability 1 – α, the “confidence level.” That is, P(Accept H0 | H0 is true) = 1 – α. Therefore, we should expect a random sample mean to lie in its “rejection region” with probability α, the “significance level.” That is, P(Reject H0 | H0 is true) = α. 1 /2 /2 H0: = 0 “Type 1 Error” Rejection Region Acceptance Region for H0 Rejection Region μ0 + zα/2 (σ / n) Power and Sample Size “Null Distribution” “Alternative Distribution” • IF the null hypothesis H : μ = μ is false, 0 0 then the “power” to correctly reject it in favor of a particular alternative HA: μ = μ1 is 1– 1 /2 /2 H0: = 0 Rejection Region Acceptance Region for H0 P(Reject H0 | H0 is false) = 1 – . Thus, P(Accept H0 | H0 is false) = . “Type 2 Error” HA: μ = μ1 Rejection Region μ0 + zα/2 (σ / n) μ1 – z (σ / n) Set them equal to each other, and solve for n… z /2 z | 1 0 | n , where 2 Given: • X ~ N(μ , σ ) Normally-distributed population random variable, with unknown mean, but known standard deviation • H0: μ = μ0 Null Hypothesis value • HA: μ = μ1 Alternative Hypothesis specific value • significance level (or equivalently, confidence level 1 – ) • 1– power (or equivalently, Type 2 error rate ) Then the minimum required sample size is: N(0, 1) z /2 z | 1 0 | n , where 2 1 z Example: σ = 1.5 yrs, μ0 = 25.4 yrs, = .05 z.025 = 1.96 Suppose it is suspected that currently, μ1 = 26 yrs. Want 90% power of correctly rejecting H0 in favor of HA, if it is false 1 – = .90 = .10 z.10 = 1.28 qnorm(.9) = |26 – 25.4| / 1.5 = 0.4 2 1.96 + 1.28 n So… minimum sample size required is 65.61 0.4 n 66 Given: • X ~ N(μ , σ ) Normally-distributed population random variable, with unknown mean, but known standard deviation • H0: μ = μ0 Null Hypothesis value • HA: μ = μ1 Alternative Hypothesis specific value • significance level (or equivalently, confidence level 1 – ) • 1– power (or equivalently, Type 2 error rate ) Then the minimum required sample size is: N(0, 1) z /2 z | 1 0 | n , where 2 1 z Example: σ = 1.5 yrs, μ0 = 25.4 yrs, = .05 z.025 = 1.96 Suppose it is suspected that currently, μ1 = 26 yrs. Want 95% 90% power of correctly rejecting H0 in favor of HA, if it is false 1.28 .90 = .05 .10 z.05 1 – = .95 = |26 – 25.4| / 1.5 = 0.4 .10 = 1.645 qnorm(.975) qnorm(.9) 2 2 1.645 1.96 + 1.28 65.61 81.225 So… minimum sample size required is n 0.4 0.4 n 66 82 Given: • X ~ N(μ , σ ) Normally-distributed population random variable, with unknown mean, but known standard deviation • H0: μ = μ0 Null Hypothesis value • HA: μ = μ1 Alternative Hypothesis specific value • significance level (or equivalently, confidence level 1 – ) • 1– power (or equivalently, Type 2 error rate ) Then the minimum required sample size is: N(0, 1) z /2 z | 1 0 | n , where 2 1 Example: σ = 1.5 yrs, μ0 = 25.4 yrs, = .05 z.025 = 1.96 z Suppose it is suspected that currently, μ1 = 25.7 26 yrs. yrs. Want 95% power of correctly rejecting H0 in favor of HA, if it is false = |25.7 |26 – – 25.4| 25.4| / 1.5 / 1.5 == 0.4 0.2 1 – = .95 = .05 z.05 = 1.645 qnorm(.975) 2 1.96 + + 1.645 1.645 2 1.96 81.225 324.9 So… minimum sample size required is n 0.2 0.4 n 325 82 Given: • X ~ N(μ , σ ) Normally-distributed population random variable, with unknown mean, but known standard deviation • H0: μ = μ0 Null Hypothesis value • HA: μ = μ1 Alternative Hypothesis specific value • significance level (or equivalently, confidence level 1 – ) • 1– power (or equivalently, Type 2 error rate ) Then the minimum required sample size is: N(0, 1) z /2 z | 1 0 | n , where 2 Example: σ = 1.5 yrs, μ0 = 25.4 yrs, = .05 z.025 = 1.96 1 z Suppose it is suspected that currently, μ1 = 25.7 yrs. With n = 400, how much power exists to correctly reject H0 in favor of HA, if it is false? Power = 1 – = P Z z /2 n P Z 1.96 0.2 400 P Z 2.04 = 0.9793, i.e., 98% Given: • X ~ N(μ , σ ) Normally-distributed population random variable, with unknown mean, but known standard deviation • H0: μ = μ0 Null Hypothesis • HA: μ ≠ μ0 Alternative Hypothesis (2-sided) • significance level (or equivalently, confidence level 1 – ) • n sample size From this, we obtain… s n x1, x2,…, xn “standard error” s.e. (estimate) x s sample mean sample standard deviation …with which to test the null hypothesis (via CI, AR, p-value). In practice however, it is far more common that the true population standard deviation σ is unknown. So we must estimate it from the sample! Recall that s2 2 ( x x ) i n 1 SS df Given: • X ~ N(μ , σ ) Normally-distributed population random variable, with unknown mean, but known standard deviation • H0: μ = μ0 Null Hypothesis • HA: μ ≠ μ0 Alternative Hypothesis (2-sided) • significance level (or equivalently, confidence level 1 – ) • n sample size This introduces additional variability from one sample to another… PROBLEM??? From this, we obtain… s n x1, x2,…, xn Not if n is “large”…say, 30. “standard error” s.e. (estimate) x s But what if n < 30? T-test! sample mean sample standard deviation …with which to test the null hypothesis (via CI, AR, p-value). In practice however, it is far more common that the true population standard deviation σ is unknown. So we must estimate it from the sample! Recall that s2 2 ( x x ) i n 1 SS df … is actually a family of distributions, indexed by the degrees of freedom, labeled tdf. Z ~ N(0, 1) t10 tt3 2 t1 William S. Gossett (1876 - 1937) As the sample size n gets large, tdf converges to the standard normal distribution Z ~ N(0, 1). So the T-test is especially useful when n < 30. … is actually a family of distributions, indexed by the degrees of freedom, labeled tdf. Z ~ N(0, 1) t4 .025 William S. Gossett (1876 - 1937) 1.96 As the sample size n gets large, tdf converges to the standard normal distribution Z ~ N(0, 1). So the T-test is especially useful when n < 30. Lecture Notes Appendix… or… qt(.975, 4) qt(.025, 4, lower.tail = F) [1] 2.776445 … is actually a family of distributions, indexed by the degrees of freedom, labeled tdf. Z ~ N(0, 1) t4 .025 William S. Gossett (1876 - 1937) .025 1.96 2.776 Because any t-distribution has heavier tails than the Z-distribution, it follows that for the same right-tailed area value, t-score > z-score. If n is small, T-score > 2. … the “T-score" increases (from ≈ 2 to a max of 12.706 for a 95% confidence level) as n decreases larger margin of error less power to reject, even if a genuine statistically significant difference exists! If n is large, T-score ≈ 2. Given: X = Age at first birth ~ N(μ , σ ) • H0: μ = 25.4 yrs Null Hypothesis • HA: μ ≠ 25.4 yrs Alternative Hypothesis Previously… σ = 1.5 yrs, n = 400, x 25.6 yrs statistically significant at = .05 Now suppose that σ is unknown, and n < 30. Example: n = 16, x 25.9 yrs, s = 1.22 yrs • standard error (estimate) = • .025 critical value = t15, .025 s 1.22 yrs 0.305 yrs 16 n Lecture Notes Appendix… Given: X = Age at first birth ~ N(μ , σ ) • H0: μ = 25.4 yrs Null Hypothesis • HA: μ ≠ 25.4 yrs Alternative Hypothesis Previously… σ = 1.5 yrs, n = 400, x 25.6 yrs statistically significant at = .05 Now suppose that σ is unknown, and n < 30. Example: n = 16, x 25.9 yrs, s = 1.22 yrs • standard error (estimate) = s 1.22 yrs 0.305 yrs 16 n • .025 critical value = t15, .025 = 2.131 95% Confidence Interval = (25.9 – 0.65, 25.9 + 0.65) = (25.25, 26.55) yrs p-value = 2 P ( X 25.9) 2P(T15 1.639) Test Statistic: T15 25.9 - 25.4 0.305 95% margin of error = (2.131)(0.305 yrs) = 0.65 yrs Lecture Notes Appendix… Given: X = Age at first birth ~ N(μ , σ ) • H0: μ = 25.4 yrs Null Hypothesis • HA: μ ≠ 25.4 yrs Alternative Hypothesis Previously… σ = 1.5 yrs, n = 400, x 25.6 yrs statistically significant at = .05 Now suppose that σ is unknown, and n < 30. Example: n = 16, x 25.9 yrs, s = 1.22 yrs • standard error (estimate) = s 1.22 yrs 0.305 yrs 16 n • .025 critical value = t15, .025 = 2.131 95% Confidence Interval = (25.9 – 0.65, 25.9 + 0.65) = (25.25, 26.55) yrs p-value = 2 P ( X 25.9) 2 P(T15 1.639) = 2 (between .05 and .10) = between .10 and .20. 95% margin of error = (2.131)(0.305 yrs) = 0.65 yrs The 95% CI does contain the null value μ = 25.4. The p-value is between .10 and .20, i.e., > .05. (Note: The R command 2 * pt(1.639, 15, lower.tail = F) gives the exact p-value as .122.) Not statistically significant; small n gives low power! Edited R code: y = rnorm(16, 0, 1) z = (y - mean(y)) / sd(y) x = 25.9 + 1.22*z Generates a normally-distributed random sample of 16 age values… sort(round(x, 1)) 24.0 24.2 24.6 24.7 25.1 25.6 25.6 25.7 25.9 26.0 26.4 27.0 27.0 27.2 27.6 28.0 c(mean(x), sd(x)) [1] 25.90 1.22 … with sample mean = 25.9 and sample sd = 1.22. t.test(x, mu = 25.4) One Sample t-test data: x t = 1.6393, df = 15, p-value = 0.1219 alternative hypothesis: true mean is not equal to 25.4 95 percent confidence interval: 25.24991 26.55009 sample estimates: mean of x 25.9 See… http://pages.stat.wisc.edu/~ifischer/Intro_Stat/Lecture_Notes/6__Statistical_Inference/HYPOTHESIS_TESTING_SUMMARY.pdf