Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Statistics 400 - Lecture 5 Last class: Finished 4.1-4.4 and started 4.5 Today: Finish 4.5 and begin discrete random variables (5.1-5.5) Next Day: More discrete random variables (5.5-5.7) and begin continuous R.V.’s (6.1-6.3) Assignment #2: 4.14, 4.24, 4.41, 4.61, 4.79, 5.13(a and c), 5.32, 5.68, 5.80 Due in class Tuesday, October 2 Independent Events Two events are independent if: P( A | B) P( A) The intuitive meaning is that the outcome of event B does not impact the probability of any outcome of event A Alternate form: P( A and B) P( A) P( B) Example Flip a coin two times S= A={head observed on first toss} B={head observed on second toss} Are A and B independent? Example Mendel used garden peas in experiments that showed inheritance occurs randomly Seed color can be green or yellow {G,G}=Green otherwise pea is yellow Suppose each parent carries both the G and Y genes M ={Male contributes G}; F ={Female contributes G} Are M and F independent? Example (Randomized Response Model) Can design survey using conditional probability to help get honest answer for sensitive questions Want to estimate the probability someone cheats on taxes Questionnaire: 1. Do you cheat on your taxes? 2. Is the second hand on the clock between 12 and 3? YES NO Methodology: Sit alone, flip a coin and if the outcome is heads answer question 1 otherwise answer question 2 More on Probability Will take a more formal look at describing random phenomenon A random variable, X, associates a numerical value to each outcome of an experiment Will consider two types: Discrete random variables Continuous random variables Discrete versus Continuous Discrete random variables have either a finite number of values or infinitely many values that can be ordered in a sequence Continuous random variables take on all values in some interval(s) Examples Discrete or continuous Number of people arriving in a supermarket Hair color of randomly selected people Weight lost from a diet program Random number between 0 and 4 Discrete Random Variables Describe chances of observing values for a discrete random variable by probability distribution Probability distribution of a discrete random variable, X, is the list of distinct numerical outcomes and associated probabilities Value of X Probability f(xi) x1 f(x1) x2 … f(x2) … xk f(xk) If distribution is estimated from data, it is called the empirical distribution Properties f (x ) i k f (x ) i 1 i for each value xi of X Can display distribution using a probability histogram X-axis represents outcomes Y-axis is the probability of each outcome Use rectangles, centered at each value of X, to display probabilities Example Probability distribution for number people in a randomly selected household X=# people f(xi) 1 2 3 4 5 6 7 0.25 0.32 0.17 0.15 0.07 0.03 Draw the probability histogram Mean and Variance for Discrete Random Variables Suppose have 1000 people in a population (500 male and 500 female) and average age of the males is 26 and average age of females is 24 What is the mean age in the population? Suppose have 1000 people in a population (900 male and 100 female) and average age of males is 26 and average age of females is 24 What is the mean age in the population? Mean must consider chance of each outcome Mean is not necessarily one of the possible outcomes Is a weighted average of the outcomes The mean (or expected value) of a discrete R.V., X, is denoted E(X) k E( X ) f ( x ) x i 1 Is also denoted as E (X ) i i Variance of a discrete R.V. weights the squared deviations from the mean by the probabilities Var ( X ) ( x ) f ( x ) k 2 i 1 2 i i The standard deviation is (x ) f (x ) k i 1 2 i i Example Compute mean and variance of number of people in a household Example (true story) People use expectation in real life Parking at Simon Fraser University (B.C., Canada) is $9.00 per day Fine for parking illegally is $10.00 When parking illegally, get caught roughly half the time Should you pay the $9.00 or risk getting caught? Probability Model - is an assumed form of a distribution of a random variable Bernoulli Distribution Bernoulli distribution: Each trial has 2 outcomes (success or failure) Prob. of a success is same for each trial Prob. of a success is denoted as p Prob. of a failure, q, is Trials are independent If X is a Bernoulli random variable, its distribution is described by f ( x) P( X x) p x q1 x where X=0 (failure) or X=1 (success) Example A backpacker has 3 emergency flares, each which light with probability of 0.98. Find probability the first flare used will light Find probability that first 2 flares used both light Find probability that exactly 2 flares light Mean and Standard Deviation Mean: Standard Deviation: