Download Probability Distributions Continuous Probability Distributions The

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Central limit theorem wikipedia , lookup

Transcript
Chapter 6
6-1
The Normal Distribution and
Other Continuous Distributions
Learning Objectives
Probability Distributions
Probability
Distributions
In this chapter, you learn:
The Normal Distribution
Ch. 5
The Standardized Normal Distribution
Evaluating the Normality Assumption
The Uniform Distribution
The Exponential Distribution
Discrete
Probability
Distributions
Continuous
Probability
Distributions
Binomial
Normal
Poisson
Uniform
Hypergeometric
Ch. 6
Exponential
1
2
Continuous Probability Distributions
A continuous random variable is a variable that can assume any
value on a continuum (can assume an uncountable number of
values)
thickness of an item
time required to complete a task
temperature of a solution
height, in inches
Values from interval of numbers
Absence of gaps
These can potentially take on any value, depending only on the
ability to measure accurately.
Continuous Probability Distribution
The Normal Distribution
‘Bell Shaped’
Symmetrical
Mean, Median and Mode are Equal
f(X)
Location is determined by the mean, µ
Spread is determined by the standard
deviation, σ
σ
X
µ
The random variable has an infinite
theoretical range: + ∞ to − ∞
Interquartile Range equals 1.33 σ
Online normal distributions – uniform to
normal
Distribution of continuous random variable
Mean
= Median
= Mode
Most Important Continuous Probability Distribution
The normal distribution
3
Basic Business Statistics, 10/e
4
© 2006 Prentice Hall, Inc.
Chapter 6
6-2
The (Standard) Normal Probability
Density Function
Many Normal Distributions
There are an Infinite Number of Normal Distributions
The formula for the normal
probability density function is
f(X) =
Where
The formula for the standardized
normal probability density function
is
2
1
e −(1/2)[(X −µ)/σ]
2πσ
f(Z) =
1
2π
e − (1/2)Z
2
e = the mathematical constant approximated by 2.71828
π = the mathematical constant approximated by 3.14159
µ = the population mean
σ = the population standard deviation
By varying the parameters µ and σ, we obtain different
normal distributions
X = any value of the continuous variable
Z = any value of the standardized normal distribution
Changing µ shifts the distribution left or right.
Changing σ increases or decreases the spread.
5
Any normal distribution (with any mean and standard deviation
combination) can be transformed into the standardized normal
distribution (Z)
Need to transform X units into Z units
The Standardized Normal
Distribution
Example
When X is normally distributed with a mean µ and a standard deviation σ, Z =
(X - µ)/σ follows a standardized (normalized) normal distribution with a mean
0 and a standard deviation 1 (always).
Values above the mean have positive Z-values, values below the mean have
negative Z-values
Z=
µ
µZ = 0
Basic Business Statistics, 10/e
X
If X is distributed normally with mean of 100
and standard deviation of 50, the Z value for
X = 200 is
Z=
σ
σZ =1
X ~ N ( µ ,σ 2 )
Z ~ N ( 0,1)
X −µ
σ
f(Z)
6
X − µ 200 − 100
=
= 2.0
σ
50
This says that X = 200 is two standard
deviations (2 increments of 50 units) above
the mean of 100.
Z
7
8
© 2006 Prentice Hall, Inc.
Chapter 6
6-3
Finding Normal Probabilities
Comparing X and Z units
Probability is the
Probability is measured
area under the
curve! under the curve
f(X)
100
0
200
2.0
X
Z
by the area
P (a ≤ X ≤ b)
= P (a < X < b)
(µ = 100, σ = 50)
(µ = 0, σ = 1)
(Note that the
probability of any
individual value is zero)
Note that the distribution is the same, only the
scale has changed. We can express the problem in
original units (X) or in standardized units (Z)
a
b
X
9
10
Probability as
Area Under the Curve
Empirical Rules
The total area under the curve is 1.0, and the curve is
symmetric, so half is above the mean, half is below
f(X) P( −∞ < X < µ) = 0.5
What can we say about the distribution of values
around the mean? There are some general rules:
f(X)
P(µ < X < ∞ ) = 0.5
σ
0.5
σ
0.5
µ
X
µ-1σ
P( −∞ < X < ∞) = 1.0
µ µ+1σ
68.26%
11
Basic Business Statistics, 10/e
µ ± 1σ encloses about
68% of X’s
X
12
© 2006 Prentice Hall, Inc.
Chapter 6
6-4
The Standardized Normal Table
The Empirical Rule
(continued)
µ ± 2σ covers about 95% of X’s
µ ± 3σ covers about 99.7% of X’s
2σ
3σ
2σ
x
µ
3σ
x
µ
95.44%
The Cumulative Standardized Normal table
in the textbook (Appendix table E.2) gives the
probability less than a desired value for Z
(i.e., from negative infinity to Z)
=NORMSDIST(2)
0.9772
Example:
P(Z < 2.00) = 0.9772
99.73%
0
2.00
Z
13
14
General Procedure for
Finding Probabilities
The Standardized Normal Table
(continued)
Z
The column gives the value of
Z to the second decimal point
To find P(a < X < b) when X is distributed normally:
0.00
Translate X-values to Z-values
0.01
Draw the normal curve for the problem in terms of X
0.02 …
Use the Standardized Normal Table
The row shows
the value of Z
to the first
decimal point
0.0
Suppose X is normal with mean 8.0 and standard deviation 5.0
0.1
Find P(X < 8.6)
.
.
.
2.0
.9772
2.0
P(Z < 2.00) = 0.9772
The value within the
table gives the
probability from Z = − ∞
up to the desired Z
value
X
8.0
8.6
15
Basic Business Statistics, 10/e
© 2006 Prentice Hall, Inc.
16
Chapter 6
6-5
Finding Normal Probabilities
Solution: Finding P(Z < 0.12)
(continued)
Suppose X is normal with mean 8.0 and standard deviation 5.0. Find
P(X < 8.6)
=NORMDIST(8.6,8,10,TRUE)
=NORMSDIST(0.12)
Z=
Standardized Normal Probability
Table (Portion)
X − µ 8.6 − 8.0
=
= 0.12
σ
5.0
Z
.01
.02
.5478
0.0 .5000 .5040 .5080
µ=0
σ=1
µ=8
σ = 10
.00
P(X < 8.6)
= P(Z < 0.12)
0.1 .5398 .5438 .5478
0.2 .5793 .5832 .5871
8 8.6
X
P(X < 8.6)
Z
Z
0 0.12
0.00
0.3 .6179 .6217 .6255
0.12
P(Z < 0.12)
17
More Examples of Normal
Distribution Using Excel
Upper Tail Probabilities
A set of final exam grades was found to be normally
distributed with a mean of 73 and a standard deviation of 8.
Suppose X is normal with mean 8.0 and
standard deviation 5.0.
Now Find P(X > 8.6)
What is the probability of getting a grade no higher than 91
on this exam? =NORMDIST(91,73,8,TRUE)
X ~ N ( 73,82 )
Mean
Standard Deviation
P ( X ≤ 91) = ?
18
σ =8
73
8
Probability for X <=
X Value
91
Z Value
2.25
P(X<=91)
0.9877756
X
µ = 73 91
0
X
8.0
Z
8.6
2.25
19
Basic Business Statistics, 10/e
20
© 2006 Prentice Hall, Inc.
Chapter 6
6-6
Probability Between
Two Values
Upper Tail Probabilities
(continued)
Suppose X is normal with mean 8.0 and standard deviation 5.0.
Now Find P(X > 8.6)…
=1-NORMDIST(8.6,8,10,TRUE) or =1-NORMSDSIT(0.12)
Suppose X is normal with mean 8.0 and standard deviation 5.0.
Find P(8 < X < 8.6)
=NORMDIST(8.6,8,5,TRUE)-0.5 or =NORMSDIST(0.12)-0.5
P(X > 8.6) = P(Z > 0.12) = 1.0 - P(Z ≤ 0.12)
Calculate Z-values:
= 1.0 - 0.5478 = 0.4522
0.5478
1.000
1.0 - 0.5478
= 0.4522
8.0 8.6
X
.00
.01
.02
22
Probabilities in the Lower Tail
P(8 < X < 8.6)
= P(0 < Z < 0.12)
= P(Z < 0.12) – P(Z ≤ 0)
= 0.5478 - .5000 = 0.0478
0.0 .5000 .5040 .5080
Z
21
Solution: Finding P(0 < Z < 0.12)
Standardized Normal Probability
Table (Portion)
X
0 0.12
= P(0 < Z < 0.12)
0.12
0.12
8 8.6
P(8 < X < 8.6)
0
0
Z
X −µ 8 −8
=
=0
σ
5
X − µ 8.6 − 8
Z=
= 0.12
=
σ
5
Z
Z
Z=
Suppose X is normal with mean 8.0 and
standard deviation 5.0.
Now Find P(7.4 < X < 8)
0.0478
0.5000
0.1 .5398 .5438 .5478
0.2 .5793 .5832 .5871
0.3 .6179 .6217 .6255
X
Z
8.0
0.00
7.4
0.12
23
Basic Business Statistics, 10/e
24
© 2006 Prentice Hall, Inc.
Chapter 6
6-7
More Examples of Normal
Distribution Using Excel
Probabilities in the Lower Tail
(continued)
P(7.4 < X < 8)
= P(-0.12 < Z < 0)
X ~ N ( 73,82 )
0.0478
= P(Z < 0) – P(Z ≤ -0.12)
= 0.5000 - 0.4522 = 0.0478
7.4 8.0
-0.12 0
P ( 65 ≤ X ≤ 89 ) = ?
Probability for a Range
From X Value
65
To X Value
89
Z Value for 65
-1
Z Value for 89
2
P(X<=65)
0.1587
P(X<=89)
0.9772
P(65<=X<=89)
0.8186
0.4522
The Normal distribution is
symmetric, so this probability
is the same as P(0 < Z < 0.12)
(continued)
What percentage of students scored between 65 and 89
inclusively? Mean=73 and Std=8
=NORMDIST(89,73,8,TRUE)-NORMDIST(65,73,8,TRUE)
Now Find P(7.4 < X < 8)…
=0.5-NORMDIST(7.4,8,5,TRUE) or =0.5-NORMSDIST(-0.12)
X
Z
X
65
µ = 73
89
-1 0
Z
2
25
Finding the X value for a
Known Probability
26
Finding the X value for a
Known Probability
(continued)
Given a probability value 0.975, to find the corresponding z value = 1.96?
1). Draw a Normal Curve
2). Identify the area on the normal curve
3). Use the Normal Table to find the answer
=NORMSINV(0.975)
Example:
Suppose X is normal with mean 8.0 and standard deviation 5.0.
Now find the X value so that only 20% of all values are below this X
=NORMINV(0.20,8,5)
=NORMSINV(-0.84)
Steps to find the X value for a known probability:
1. Find the Z value for the known probability
2. Convert to X units using the formula:
=NORMINV(p*,µ,σ) or =µ+NORMSINV(p*)*σ
0.2000
X = µ + Zσ
?
?
27
Basic Business Statistics, 10/e
8.0
0
X
Z
28
© 2006 Prentice Hall, Inc.
Chapter 6
6-8
Find the Z value for
20% in the Lower Tail
Finding the X value
1. Find the Z value for the known probability
2. Convert to X units using the formula:
Standardized Normal Probability 20% area in the lower
Table (Portion)
tail is consistent with a
Z
-0.9
…
.03
.04
.05
Z value of -0.84
= 8.0 + ( −0.84)5.0
… .1762 .1736 .1711
-0.8 … .2033 .2005 .1977
-0.7
X = µ + Zσ
= 3.80
0.2000
… .2327 .2296 .2266
X
Z
?
8.0
-0.84 0
So 20% of the values from a distribution
with mean 8.0 and standard deviation
5.0 are less than 3.80
29
More Examples of Normal
Distribution Using Excel
More Examples of Normal Distribution
Using Excel
(continued)
Only 5% of the students taking the test scored higher than what
grade? Mean=73 and STD=8
=NORMINV(0.95,73,8)
X ~ N ( 73,82 )
30
Online normal distribution calculator
The middle 50% of the students scored between what two
scores?
=NORMINV(0.25,73,8) and =NORMINV(0.75,73,8)
X ~ N ( 73,82 )
P ( X > ? ) = .05
P ( a ≤ X ≤ b ) = .50
Find X and Z Given Cum. Pctage.
Cumulative Percentage
25.00%
Z Value
-0.67449
X Value
67.60408
Find X and Z Given Cum. Pctage.
Cumulative Percentage
95.00%
Z Value
1.644853
X Value
86.15882
.25
.25
X
µ = 73 ? =86.16
0
Z
1.645
X
Find X and Z Given Cum. Pctage.
Cumulative Percentage
75.00%
Z Value
0.67449
X Value
78.39592
31
Basic Business Statistics, 10/e
(continued)
67.6 µ = 73 78.4
-0.67 0
Z
0.67
32
© 2006 Prentice Hall, Inc.
Chapter 6
6-9
Evaluating Normality
Assessing Normality
(continued)
Observe the distribution of the data set
Not all continuous random variables are normally
distributed
It is important to evaluate how well the data set is
approximated by a normal distribution
Construct charts or graphs
Do approximately 2/3 of the observations lie within
mean ±1 standard deviation?
Do approximately 4/5 or 80% of the observations lie
within mean ±1.28 standard deviations?
Do approximately 19/20 or 95% of the observations
lie within mean ±2 standard deviations?
For small- or moderate-sized data sets, do stem-and-leaf
display and box-and-whisker plot look symmetric?
For large data sets, does the histogram or polygon appear bellshaped?
Evaluate normal probability plot
Compute descriptive summary measures
Is the normal probability plot approximately linear
with positive slope?
Do the mean, median and mode have similar values?
Is the interquartile range approximately 1.33 σ?
Is the range approximately 6 σ?
33
The Normal Probability Plot
34
The Normal Probability Plot
(continued)
A normal probability plot for data
from a normal distribution will be
approximately linear:
Normal probability plot
Arrange data into ordered array
Find corresponding standardized normal quantile
values
X
Plot the pairs of points with observed data values on
the vertical axis and the standardized normal quantile
values on the horizontal axis
90
60
30
Evaluate the plot for evidence of linearity
-2
-1
0
35
Basic Business Statistics, 10/e
1
2
Z
36
© 2006 Prentice Hall, Inc.
Chapter 6
6-10
Normal Probability Plot
The Uniform Distribution
(continued)
Left-Skewed
Right-Skewed
X 90
X 90
60
60
30
The uniform distribution is a
probability distribution that has equal
probabilities for all possible
outcomes of the random variable
30
-2 -1 0
1 2 Z
-2 -1 0
1
2 Z
Rectangular
Nonlinear plots
indicate a deviation
from normality
X 90
60
Also called a rectangular distribution
30
-2 -1 0
1
2 Z
37
38
Properties of the
Uniform Distribution
The Uniform Distribution
(continued)
The Continuous Uniform Distribution:
1
b−a
if a ≤ X ≤ b
0
otherwise
The mean of a uniform distribution is
µ=
f(X) =
a+b
2
The standard deviation is
where
f(X) = value of the density function at any X value
a = minimum value of X
b = maximum value of X
σ=
(b - a) 2
12
39
Basic Business Statistics, 10/e
40
© 2006 Prentice Hall, Inc.
Chapter 6
6-11
Uniform Distribution Example
Uniform Distribution Example
(continued)
Example: Uniform probability distribution
over the range 2 ≤ X ≤ 6:
Example: Using the uniform probability
distribution to find P(3 ≤ X ≤ 5):
1
f(X) = 6 - 2 = 0.25 for 2 ≤ X ≤ 6
P(3 ≤ X ≤ 5) = (Base)(Height) = (2)(0.25) = 0.5
f(X)
f(X)
a+b 2+6
µ=
=
=4
2
2
0.25
2
6
X
σ=
(b - a) 2
=
12
0.25
(6 - 2) 2
= 1 . 1547
12
2
3
4
5
6
X
41
The Exponential Distribution
42
The Exponential Distribution
Often used to model the length of time
between two occurrences of an event (the
time between arrivals)
Defined by a single parameter, its mean λ
(lambda)
The probability that an arrival time is less than
some specified time X is
Examples:
P(arrival time < X) = 1 − e − λX
Time between trucks arriving at an unloading dock
Time between transactions at an ATM Machine
Time between phone calls to the main operator
where
e = mathematical constant approximated by 2.71828
λ = the population mean number of arrivals per unit
X = any value of the continuous variable where 0 < X <
43
Basic Business Statistics, 10/e
∞
44
© 2006 Prentice Hall, Inc.
Chapter 6
6-12
Exponential Distribution
Example
Normal Approximation to the
Binomial Distribution
Example: Customers arrive at the service counter at
the rate of 15 per hour. What is the probability that the
arrival time between consecutive customers is less
than three minutes?
The mean number of arrivals per hour is 15, so λ = 15
Three minutes is 0.05 hours
P(arrival time < .05) = 1 – e-λX = 1 – e-(15)(0.05) = 0.5276
So there is a 52.76% probability that the arrival time
between successive customers is less than three
minutes
The binomial distribution is a discrete
distribution, but the normal is continuous
To use the normal to approximate the binomial,
accuracy is improved if you use a correction for
continuity adjustment
Example:
X is discrete in a binomial distribution, so P(X = 4)
can be approximated with a continuous normal
distribution by finding
P(3.5 < X < 4.5)
45
Normal Approximation to the
Binomial Distribution
(continued)
The closer p is to 0.5, the better the normal
approximation to the binomial
The larger the sample size n, the better the
normal approximation to the binomial
General rule:
46
Normal Approximation to the
Binomial Distribution
(continued)
The mean and standard deviation of the
binomial distribution are
µ = np
σ = np(1 − p)
The normal distribution can be used to approximate
the binomial distribution if
Transform binomial to normal using the formula:
np ≥ 5
Z=
and
X −µ
X − np
=
σ
np(1 − p)
n(1 – p) ≥ 5
47
Basic Business Statistics, 10/e
48
© 2006 Prentice Hall, Inc.
Chapter 6
6-13
Using the Normal Approximation
to the Binomial Distribution
Chapter Summary
If n = 1000 and p = 0.2, what is P(X ≤ 180)?
Approximate P(X ≤ 180) using a continuity correction
adjustment:
P(X ≤ 180.5)
Transform to standardized normal:
Presented key continuous distributions
normal, uniform, exponential
Found probabilities using formulas and tables
Recognized when to apply different distributions
X − np
180.5 − (1000)(0.2 )
Z=
=
= −1.54
np(1 − p)
(1000)(0.2 )(1− 0.2)
Applied distributions to decision problems
So P(Z ≤ -1.54) = 0.0618
180.5
-1.54
Basic Business Statistics, 10/e
200
0
X
Z
49
50
© 2006 Prentice Hall, Inc.