Survey							
                            
		                
		                * Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Probability
Distributions
(Session 04)
SADC Course in Statistics
Learning Objectives
At the end of this session you will be able to:
• solve basic problems concerning realvalued probability distributions.
• distinguish between discrete and
continuous random variables (r.v.’s).
• explain what is meant by a probability
distribution.
• calculate the population mean and variance
of a given distribution.
To put your footer here go to View > Header and Footer
2
Session Contents
In this session you will
• be introduced to the theory of probability
distributions.
• be shown how to build a firm foundation of
the theory of probability distributions in
preparation for applications in statistical
inference (Module H2).
• strengthen the mathematical skills that are
required to deal correctly with probability
ideas.
To put your footer here go to View > Header and Footer
3
Random variables
• In the previous two sessions we dealt with
probabilities of events.
• In practice events of interest are those
generated by random variables.
• A random variable is a variable that
associates outcomes in the sample space
with numerical values.
To put your footer here go to View > Header and Footer
4
An example – birth of a baby
Girl
Line showing
numerical scale
X
Boy
Sample space
0
1
The figure above depicts a random variable
X defined as
X=0
if outcome is a boy
X=1
if outcome is a girl
To put your footer here go to View > Header and Footer
5
A second example:
Often the outcomes are actual measurements.
Thus, we could have:
a random variable Y which records
measurements of weights (of say maize cobs)
into numbers with kilograms as units.
Outcomes of any experiment can be recorded
as real numbers by defining an appropriate
random variable.
We do this because it is easier to work with
numbers.
To put your footer here go to View > Header and Footer
6
Types of random variables
• A random variable is said to be discrete if
the set of possible values is countable.
Examples of discrete random variables are
those that records events on gender, family
size, number of traffic offenses, ...
• A random variable is said to be continuous
if the set of possible values is not countable.
Examples of continuous random variables
are those that record events such as weight,
height, time, etc ...
To put your footer here go to View > Header and Footer
7
Continuous to discrete?
• Continuous random variables can be mapped
into discrete random variables by grouping.
• For example, age X is a continuous random
variable since it is a measure of time since
birth.
• We can define a discrete random variable Y as
Y = 1 if 0≤X<5.
= 2 if 5≤X<10
= 3 if 10≤X<15
= etc.
• You cannot convert a discrete random variable
into a continuous one.
To put your footer here go to View > Header and Footer
8
Probability distributions
• A probability distribution is a table, a
function or a graph that presents possible
outcomes of a trial, say E (e.g. throw of a
die), together with their corresponding
probabilities.
• Note that the outcome probabilities must
sum to 1 since occurrence of E results in
exactly one outcome.
To put your footer here go to View > Header and Footer
9
An example
• The following is an example of a probability
distribution for the gender of a new born
child:
Outcome
Values (x) of a
random variable X
P(x)
Male
0
0.5
Female
1
0.5
Total
1
To put your footer here go to View > Header and Footer
10
Probability mass/density function
A probability distribution can sometimes be
specified using a function f called a
probability (mass/density) function.
The function f must satisfy the following
conditions:
1. f (x)  0 for all x.
2.
 f (x)  1
allx
or
 f (x)dx  1.
To put your footer here go to View > Header and Footer
11
Points to note:
The function P(x) of the slide 10 is a
probability mass function since it satisfies
the two conditions above.
Point 1 of slide 11 satisfies the first law of
probability, as it must since P(x) represents
a probability.
Point 2 of slide 11 indicates that the sum is
used if the set of values x is countable;
otherwise the integral applies.
To put your footer here go to View > Header and Footer
12
Expected values
The weighted “centre” of a probability
distribution is called the expected value
written E(X).
More formally the expected value of a random
variable X is defined as:
E( X )   xf ( x ),
in the discrete case.
allx
E( X ) 
 xf ( x)dx,
in the continuous case.
E(X) is also called the population mean and
is usually denoted by  .
To put your footer here go to View > Header and Footer
13
Example (i)
• If f(x) is given by
x
f(x) = Prob(x)
0
0.5
1
0.5
Total
1
then E(X) = 0(0.5) + 1(0.5) = 0.5
To put your footer here go to View > Header and Footer
14
Example (ii)
Let f(x) = 2x, for 0  x  1
1
E ( X )   xf ( x)dx
0
1
3 1
x
  2 x dx  2
3
0
2
0
2
 .
3
To put your footer here go to View > Header and Footer
15
Moments
• The k-th moment of a random variable X is
defined as:
E ( X )   x f ( x),
k
k
in the discrete case.
x 0
E( X k ) 
k
x
 f ( x)dx, in the continuous case.
The moments of a distribution characterize
the shape of a distribution. The notation k
is often used to denote the k-th moment.
To put your footer here go to View > Header and Footer
16
Class exercise
Suppose a coin is tossed twice.
(a) Write down the possible values for the
random variable X defined as:
X = number of heads that occur
(b) Prepare a table showing the probability
distribution function of X
(c) Use this table to determine the
expected value of X
To put your footer here go to View > Header and Footer
17
Measures of spread
• The variance of a random variable X is
defined as
Var( X )  E ( X   )  E ( X )   .
2
2
2
• Notice that E(X2) is the second moment of X.
• The variance of X is also called the
population variance and is denoted by 2.
• The square root of the variance is called the
standard deviation of X. It is denoted by .
To put your footer here go to View > Header and Footer
18
Patterns for differing variances
  0.25
2
 1
2
Note that the bigger the variance,
the larger is the spread.
To put your footer here go to View > Header and Footer
19
Skewness and kurtosis
• If the probability distribution is not
symmetrical about the mean it is said to be
skew. The distribution has a positive
skewness if the tail of high values is longer
than the tail of low values, and negative
skewness if the reverse is true.
• Kurtosis is a measure of the peakness of a
probability distribution. It is usually used as a
comparison with the normal distribution (see
later sessions) since a kurtosis of more than 3
indicates that the distribution has a higher
peak than the normal distribution.
To put your footer here go to View > Header and Footer
20
Cumulative probability distribution
• In many applications we want to calculate
probabilities of the type P(X≤k) or P(X>k)
instead of P(X=k).
• The probabilities P(X≤k) for k = 0, 1, 2, ..
provide an example of what is called the
cumulative distribution of a random
variable X.
• Here, the random variable X is discrete.
To put your footer here go to View > Header and Footer
21
Some results
• P(X>k) = 1 – P(X ≤k). This is a direct result
of the probability result that
P(Ac) = 1 – P(A).
• Similar results can be obtained for
continuous random variables.
That is, if a < b then the event {X ≤ a} is a
sub-event of the event {X ≤ b}.
Hence P(X ≤ a) < P(X ≤ b).
To put your footer here go to View > Header and Footer
22
Definition of F(x)
• The cumulative distribution at x, denoted
F(x), is formally defined as:
x
F ( x)   f ( y),
y 0
in the discrete case for a
positive random variable.
x
F ( x) 
 f ( y)dy,
in the continuous case
By definition, cumulative distribution is an
increasing function having certain
properties. These are shown below.
To put your footer here go to View > Header and Footer
23
Results concerning F(x)
• F (– ) = 0.
• F (+ ) = 1. This says that the total area
under the probability density function is 1.
• F(a) < F(b) for a<b.
Thus F is an increasing function.
• P( a < X ≤ b) = F(b) - F(a).
• P(X = x) = 0, for every point x if X is a
continuous random variable.
To put your footer here go to View > Header and Footer
24
An example using F(x) - discrete
A discrete r.v. X, representing the number of girls
in families with 5 children, has the foll: distn:
X = No. of girls
P(X=x)
0
0.03125
1
0.15625
2
0.31250
3
0.31250
4
0.15625
5
0.03125
F(x)
Complete
the table
with
values of
F(x)
What is the probability of 4 children or less?
To put your footer here go to View > Header and Footer
25
An example using F(x) - continuous
A continuous random variable r.v. X, has
probability density function given by
x
f ( x )  e ,x  0
What is its cumulative distribution function?
x
Answer:
F( x )   e dy  1  e
y
x
0
To put your footer here go to View > Header and Footer
26
Practical work follows to
ensure learning objectives
are achieved…
To put your footer here go to View > Header and Footer
27