Download Lecture 5 - Statistics

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Statistics 400 - Lecture 5
 Last class: Finished 4.1-4.4 and started 4.5
 Today: Finish 4.5 and begin discrete random variables (5.1-5.5)
 Next Day: More discrete random variables (5.5-5.7) and begin
continuous R.V.’s (6.1-6.3)
 Assignment #2: 4.14, 4.24, 4.41, 4.61, 4.79, 5.13(a and c), 5.32,
5.68, 5.80
 Due in class Tuesday, October 2
Independent Events
 Two events are independent if:
P( A | B)  P( A)
 The intuitive meaning is that the outcome of event B does not
impact the probability of any outcome of event A
 Alternate form:
P( A and B)  P( A) P( B)
Example
 Flip a coin two times
 S=
 A={head observed on first toss}
 B={head observed on second toss}
 Are A and B independent?
Example
 Mendel used garden peas in experiments that showed inheritance
occurs randomly
 Seed color can be green or yellow
 {G,G}=Green otherwise pea is yellow
 Suppose each parent carries both the G and Y genes
 M ={Male contributes G}; F ={Female contributes G}
 Are M and F independent?
Example (Randomized Response Model)
 Can design survey using conditional probability to help get honest
answer for sensitive questions
 Want to estimate the probability someone cheats on taxes
 Questionnaire:
 1. Do you cheat on your taxes?
 2. Is the second hand on the clock between 12 and 3?

YES
NO
 Methodology: Sit alone, flip a coin and if the outcome is heads
answer question 1 otherwise answer question 2
More on Probability
 Will take a more formal look at describing random phenomenon
 A random variable, X, associates a numerical value to each
outcome of an experiment
 Will consider two types:
 Discrete random variables
 Continuous random variables
Discrete versus Continuous
 Discrete random variables have either a finite number of values or
infinitely many values that can be ordered in a sequence
 Continuous random variables take on all values in some interval(s)
Examples
 Discrete or continuous
 Number of people arriving in a supermarket
 Hair color of randomly selected people
 Weight lost from a diet program
 Random number between 0 and 4
Discrete Random Variables
 Describe chances of observing values for a discrete random variable
by probability distribution
 Probability distribution of a discrete random variable, X, is the list of
distinct numerical outcomes and associated probabilities
Value of X
Probability f(xi)
x1
f(x1)
x2
…
f(x2) …
xk
f(xk)
 If distribution is estimated from data, it is called the empirical
distribution
Properties
 f (x ) 
i
k
  f (x ) 
i 1
i
for each value xi of X
 Can display distribution using a probability histogram
 X-axis represents outcomes
 Y-axis is the probability of each outcome
 Use rectangles, centered at each value of X, to display probabilities
Example
 Probability distribution for number people in a randomly selected
household
X=# people
f(xi)
1
2
3
4
5
6
7
0.25 0.32 0.17 0.15 0.07 0.03
 Draw the probability histogram
Mean and Variance for Discrete
Random Variables
 Suppose have 1000 people in a population (500 male and 500
female) and average age of the males is 26 and average age of
females is 24
 What is the mean age in the population?
 Suppose have 1000 people in a population (900 male and 100
female) and average age of males is 26 and average age of females
is 24
 What is the mean age in the population?
 Mean must consider chance of each outcome
 Mean is not necessarily one of the possible outcomes
 Is a weighted average of the outcomes
 The mean (or expected value) of a discrete R.V., X, is denoted E(X)
k
E( X )   f ( x ) x
i 1
 Is also denoted as 
   E (X )
i
i
 Variance of a discrete R.V. weights the squared deviations from the
mean by the probabilities
  Var ( X )   ( x   ) f ( x )
k
2
i 1
2
i
i
 The standard deviation is
   (x  ) f (x )
k
i 1
2
i
i
Example
 Compute mean and variance of number of people in a household
Example (true story)
 People use expectation in real life
 Parking at Simon Fraser University (B.C., Canada) is $9.00 per day
 Fine for parking illegally is $10.00
 When parking illegally, get caught roughly half the time
 Should you pay the $9.00 or risk getting caught?
 Probability Model - is an assumed form of a distribution of a
random variable
Bernoulli Distribution
 Bernoulli distribution:
 Each trial has 2 outcomes (success or failure)
 Prob. of a success is same for each trial
 Prob. of a success is denoted as p
 Prob. of a failure, q, is
 Trials are independent
 If X is a Bernoulli random variable, its distribution is described by
f ( x)  P( X  x)  p x q1 x
 where X=0 (failure) or X=1 (success)
Example
 A backpacker has 3 emergency flares, each which light with
probability of 0.98.
 Find probability the first flare used will light
 Find probability that first 2 flares used both light
 Find probability that exactly 2 flares light
Mean and Standard Deviation
 Mean:
 Standard Deviation:
Related documents