STAT Exam 2

Pataasin ang iyong marka sa homework at exams ngayon gamit ang Quizwiz!

For any particular interval we state that we are __ confident that the true parameter value lies between the lower and upper bounds of the confidence interval.

(1-a)*100%

the confidence level is denoted by

(1-a)/100% -> most of the time is 95% (a=0.05)

if the sampling distribution of p^ is normal, wee can use the z score formula :

(p^-p)/ square root(p(1-p)/n)

Example: Now suppose the economist didn't have this previous estimate. What size sample should be obtained if he wishes the estimate to be within 0.02 with 90% confidence?

.25(1.645/0.02)^2= 1691.27= 1692

find p(z<=-2.43)

0.0075

IQ scores have a bell shaped distribution with a mean of 100 and a standard deviation of 15. what is the probability that a randomly selected person has an IQ score between 75 and 90

0.2039

if there is no estimate for p^(1-p^) in the sample size equation use

0.25

in 2004 baseball season, inchiro Suzuki of the Seattle mariners set the record for the most hits in a season with a Toal of 262 hits. in the following probability distribution, the random variable X represents the number of hits inchiro obtained in a game. x 0. 1. 2. 3. 4. 5 p(x). 0.17. 0.33. 0.29 0.15 0.04 0.02 what is the probability that in a randomly selected game inchiro got more than 1 hit

0.50

for a normal distribution 95% of the observations fall between __ deviations from the mean

1.96

binomial distributions only have __ possible outcomes

2

the sample proportion will almost certainly fall between

3 standard deviations of the mean

a random variable with a standard normal distribution is typically denoted by

Z

normal probability plot

a graph that plots observed data versus normal scores

think of a population distribution as

a parameter that selects a subject at random from a population

The sampling distribution of 𝑝𝑝̂ is approximately normal for large samples. Thus, in confidence intervals for the population proportion 𝑝𝑝, the standard normal distribution is used to find the appropriate critical values. Critical values from a normal distribution are denoted 𝑧𝑧𝛼𝛼 2⁄ , which indicates they are the z-value with an area of 𝛼𝛼 2⁄ to the right. (a) Illustrate why the critical value for a 95% confidence interval for a population proportion p is 1.96. (b) Find the critical value for a 90% confidence interval for a population proportion p. (c) Find the critical value for a 99% confidence interval for a population proportion p.

a) 1-a=0.95 a=0.5 a/2= 0.025 -> in table is -1.96 and 1.96 and Z.025= 1.96 b) 1-a= 0.90 a=0.10 a/2= 0.05 ->-1.645 but Z.05 is 1.645 bc its to the right c) 1-a=0.99 a=0.01 a/2=0.005 ->2.575

A hot dog manufacturer claims that its new lean beef hot dogs have an average of 3.5g of fat per hot dog. The fat content of these hot dogs is known to be approximately normally distributed with a standard deviation of 0.5g. (a) What is the probability that one of these lean beef hot dogs has more than 4g of fat? (b) What is the probability that a randomly selected package of 8 lean beef hot dogs will have an average fat content greater than 4g?

a) 4-3.5/0.5= 1.00 -> 1-0.8413= 0.1587 b) 4-3.5/(.5/8) = 2.83 -> 1-0.9977 = 0.0023

the mean credit card debit for a US household is $7115 with a standard deviation of $2160. the mean is such a large value because of deeply indebted households. consider a random sample of 50 US households and let x bar represent sample mean and credit card debt a) describe the distribution of x bar b) what is the approximate probability that the mean credit card debt for a sample of 50 households is less than $6500

a) approximately normal with a mean of $7115 and a standard deviation of $305.47 (theta/ square root n = 2160/square root 50) b) x-mu/(theta/square root n)= 6500-7115/305.47 = -2.01 -> 0.0222

Example: According to the American Humane Society, 47% of American households own a dog. Let 𝑝𝑝̂ represent the proportion of households that own a dog in a random sample of 100 American households. (a) Describe the sampling distribution of 𝑝𝑝̂. (b) Draw a sketch of the sampling distribution of 𝑝𝑝̂. Label the horizontal axis with the mean and with the values that are one, two, and three standard deviations from the mean rounded to two decimal places. (c) What is the probability that 35 or fewer households in a random sample of 100 American households own a dog? (d) If you actually obtain a sample in which 35 of 100 randomly selected American households own a dog, would this cause you to suspect that the Humane Society's claim that 47% of American households own a dog is too high? Explain.

a) center= p= 0.47 shape: np(1-p)= 100*.47(1-0.47)= 24.91 >=10 so p is normal spread: square root (p(1-p)/n = .47(1-.47)/100)= 0.04991 b) draw normal distribution of values c) p^=35/100 = .35 z= p^-p/square root(p(1-p)/n) = .35-.47/0.04991 = -2.40 find z score in table-> 0.0082 d) yes because the probability calculated is a rare event, and having an observation out so far int eh tail makes us think its less than 47%

Example: In a random sample of 500 US adults, 255 answered "yes" to the question "Do you approve of the President's job performance?" The margin of error is reported to be 4%. (a) Identify the target parameter in this study. (b) Find the point estimate of the target parameter. (c) Construct the interval estimate for the target parameter. (d) Is this evidence that a majority of Americans approve of the President's performance?

a) p= true proportion of US adults that approve of presidents performance b) p^= x/n= 255/500= 0.51 c) p^ + E = .51+- 0.4 = (.47, .55) d) is p>=.5 -> no, the entire interval would need to be greater than .5 to claim majority

in a poll, 874 of 1016 candidates said they would vote for the president. a)find the point estimate of the proportion of all adults americans that would vote for the president b) find the 99% confidence interval

a) p^ = x/n = 874/1016= 0.86024 b) p^ +- 2.575(standard error) = 0.86024 +- 2.575(.08024*0.139/1016)

Example: Suppose that a particular candidate for public office (in a race with only two candidates) is in fact favored by 48% of the voting population in the district. (a) A polling organization takes a random sample of 625 voters and will use 𝑝𝑝̂, the sample proportion, to estimate p. What is the approximate probability that 𝑝𝑝̂ will be greater than 0.5, causing the polling organization to incorrectly predict the result of the upcoming election? (b) How would this probability change if the polling organization instead took samples of 2,500 voters? (c) Comment on how the sample size affects the likelihood of this polling organization incorrectly predicting the election results.

a) shape: np(1-p)= 156 >=10 center: p=.48 spread: 0.01998 z= .5-.48/0.01988= 1.00 z score in table is 0.8413 1-0.8413= 0.1587 b) theta = square root(0.48*.52/2500)= 0.01 z=0.50-0.48/0.01= 2.00 -> 1-0.9772= 0.0228 c) the distribution gets smaller so they are less likely to be incorrect

if sample data are taken from a populate that is normally distributed, a normal probability plot of the observed values versus the expected scores will be __

approximately linear

a bottling company uses a filling machine to fill plastic bottles with cola. the bottles are supposed to contain 300ml of cola. the contents vary according to normal distribution with mean=302 and standard deviation of 3. let x bar represent the contents of the bottles in a 6 pack. describe the sampling distribution

approximately normal with a mean of 302 and standard deviation of 1.22 -> 3/square root 6 = 1.22

the time required to get an oil change at a particular oil change facility is skewed right. records indicate that the mean time for an oil change at this facility is 11.4 minutes with a standard deviation of 3.2 minutes. let x bar represent the mean time for an oil change at this facility for a random sample of 50 cars. describe the sampling distribution of x bar -> normal or skewed, mean, standard deviation

approximately normal with mu xbar of 11,4 and theta xbar of 0.45 ->theta x bar = theta/square root n = 3.2/square root 50= 0.45 -> normal bc the sample size is greater than 30

point estimate

best guess for target parameter

For a binomial random variable X with n trials and probability p of success in each trial, the sampling distribution of the sample proportion of successes 𝑝𝑝̂ has the following characteristics:

center: the mean of the sampling distribution of 𝑝̂ is the probability of success p spread: theta=square root (p(1-p)/n) (standard error: as the sample size increases the spread decreases) shape: a sample size is large if np(1-p)>=10

every confidence interval has an associated __

confidence level

spread is described by the word

consistent

as sample size increases, the denominator increases so the standard error __

decreases

increasing sample size __ the width of confidence interval

decreases

round __ up even if its a small value

determining sample size for confidence intervals

a or alpha is the __ rate

error

confidence level contains the __ of confidence intervals that will contain __

expected proportion; the value of a target parameter if there is a large number of samples

if the population isn't normal, the sample size must be __ for x bar to be normal

greater than or equal to 30

the standard error of x bar describes

how much the sample mean varies from sample to sample

if a random sample if taken from a large population, the shape shows

if the population is normal, the sampling distribution of the mean will be normal regardless of size

increasing the confidence level __ the width of the confidence interval

increases

interval estimate

interval of numbers where the parameter is believed to fall

an. interval estimate is more useful than point estimate because

more likely to contain true parameter and lets us see the accuracy of the point estimate

population parameter mean symbol

mu

if a random sample if taken from a large population, the center shows

mu x = mu the mean of the sampling distribution is equal to the mean of the population

the margin of error is a __ of the sampling distribution

multiple of standard

The sample size required to obtain a (1 −𝛼)100% confidence interval for a population proportion p with a specified margin of error E is:

n= p^(1-p^)(Za/2 /E)^2

population parameter binomial symbol

p

(1-a)/100% confidence interval for p is:

p+-Za/2 * square root(p(1-p)/n)

sample statistic binomial symbol

p^

interval estimate formula

p^ +- E (E=margin of error)

a 95% confidence interval for p is :

p^ +-1.96*square root(p(1-p)/n) or the standard error formula point estimate +- critical value*standard error

An urban economist wishes to estimate the proportion of Americans who own their homes. A prior study found that 64.8% of Americans own their homes. What size sample should be obtained if he wishes the estimate to be within 0.02 with 90% confidence?

p^= 0.648 E=0.02 Za/2= 1.645 n= .648(0.352)(1.645/0.02)^2= 1543.08 = 1544

Example: A recent Gallup poll found that 493 of 1000 randomly selected U.S. adults believe it is the responsibility of the federal government to make sure all Americans have healthcare coverage. Based on these results, find a 95% confidence interval for the proportion of all U.S adults who believe it is the responsibility of the federal government to make sure all Americans have healthcare coverage.

p^= 493/1000 p= true proportion of all adults Americans that believe the gov is responsible for making sure all Americans have healthcare conditions: stand confidence interval: p^ +- Za/2 *(square root p^(1-p^)/n) = .493 +- 1.96(.493*.507/1000) = (.46, .52) we're 95% confident that the true proportion of adult Americans who believe its the responsibility of the government is between .46 and .52->about m95% of possible samples will result in a confidence interval that contains the true proportion

point estimate formula

p^= x/n= number of successes/sample size

suppose a random sample is taken where an individual either does or does not have a charcterisitic, the sample proportion p^ is:

p^=x/n= number of successes/ sample size

using x bar to estimate mu or using p^ to estimate p is an example of

point estiamtes

confidence interval formula

point estimate +/- margin of error

for quantitative data the target parameter is

population mean (mu)

the sampling distribution of the sample statistic is a

probability distribution for all possible values of the statistic computed from a sample of size

distiribution of the sample proportions __ data

qualitative

sample statistic standard deviation symbol

s

we verify the independence assumption by checking if

sample size n is no more than 5% of the population size N (n<=0.05N)

independence assumption violation

sample values must be independent of one another, so sampling is done without replacement

as the sample size increases what happens to the sampling distribution (shape, center, spread)

shape: normal center: mean stays consistent spread: decreases

we use normal probability plots for __ because __

small data sets; the shape of the histogram doesn't accurately represent that shape of the population

standard error formula

square root(p(1-p)/n)

the standard deviation of the sampling distribution of x bar is called

standard error

in a sample proportion, x is the number of

successes in n trials

for large data sets, a random variable x is approximately normally distributed provided the histogram is __ and __

symmetric and bell shaped

the margin of error measures

the accuracy y of the point estimate in estimating the parameter

the critical value is determined by

the desired confidence interval

normal score

the expected z-score of the data value, assuming that the distribution of the random variable is normal

the means of the samples tend to vary less than

the individual observations

central limit theorem

the larger the sample (30 or above), the the sampling distribution mean will be normal and close to the populate mean

For a (1 −a)100% confidence interval, the critical value corresponds to the number of standard errors within which __ of possible sample statistics are contained

the middle (1 −a)100%

confidence interval is an interval containing

the most plausible values for a parameter

critical value

the number of standard errors we add and subtract from the point estimate to achieve the desired confidence level (1.96 or Za/2)

for qualitative data the target parameter is

the population proportion of successes p

the sample proportion p^ is a statistic that estimates the

the population proportion p

target parameter

the unknown population parameter we want to estimate p

population parameter standard deviation symbol

theta

if you are asked to find the probability of an individual observation, divide __ in a z score formula

theta (x-mu/theta)

if a random sample if taken from a large population, the spread shows the formula

theta x = theta/ square root n or standard deviation of the population/square root sample size of population

if you are asked to find the probability of a sample mean, divide by __ in a z score formula

theta/square root n ((x bar - mu/theta)/square root n)

center is described by the word

unbiased

the larger the sample3r, the lower the __ of the sampling distribution

variability

statistical inferences allow us to make decisions and predictions about populations even if

we have data for few subjects

sample statistic mean symbol

x bar

IQ scores have a bell shaped distribution with a mean of 100 and a standard deviation of 15. what IQ score represents the 90th percentile

x= 100+ 1.28*15 = 119.2

conditions of a confidence interval

• The sample data come from a random sample or a randomized experiment. • The sample size is small relative to the population size (𝑛≤0.05𝑁). • The sample size n is large enough such that 𝑛𝑝̂(1 −𝑝̂) ≥10.

the mean of a probability distribution is denoted by __ and its standard deviation by__

𝜇 ; 𝜎

for a random variable of X, its mean is denoted by __ and its standard deviation by __

𝜇x ; 𝜎x

standard deviation of a discrete random variable formula

𝜎^2 x = sigma (x-𝜇x)^2*p(x) so the deviation is the square root of it

IQ scores have a bell shaped distribution with a mean of 100 and a standard deviation of 15. what proportion of the population has an IQ higher than 140

(z<=2.67) = 0.9962 1-0.9962= 0.0038 -> key word higher, to the right

any observation that lies __ may be considered unusual

+ or - 2 standard deviations away from the mean

find p(z<1.56)

0.9406 (table)

the standard normal distribution has a mean of __ and standard deviation of __

0;1

Find Z0.025

1- 0.025= 0.9750 find this value in the table tan which numbers it corresponds to 1.96

find the z scores that separate the middle 99% of the distribution from the areas in the tails

1-0.99= 0.01 0.01/2= 0.0050 0.0050 is the average of the 2 z scores its in between, so average to the 2 z scores -z.005= -2.575 z.005= 2.575

properties of probability distributions

1. areas under the curve correspond to probabilities 2. the probability for one particular outcome is always 0 (the probability that X=x is 0) 3. the interval containing all possible values has probability equal to 1, so the total area under the curve equals 1

ex: determine whether each is continuous or discrete and give possible values 1. the number of heads observed when flipping 3 coins 2. the height of a randomly selected college student 3. the amount of snowfall in clemson during the month of January 4. the number of football games clemson will win next season

1. discrete (x=0,1,2,3) 2. continuous (x>0) 3. continuous (x>=0) 4. discrete (x=0,1,2...15)

standard deviation of a discrete random variable x measures

the spread of a distribution and describes how far the random variable falls from the mean

formula converting z to x

x = mu + z*theta

find Z.7950

z= -0.82

random variable

a variable that assumes numerical values associated with the outcomes of a probability experiment, where only one numerical value is assigned to each outcome

continuous random variable X can take on

any value within a given interval

find p(IzI>1.67)

area to the left of -1.67 and the right of 1.67 are the same value p(z<-1.67) + p(z>1.67) = 0.0475x2= 0.0950

for a fixed p, as the number of trials n in a binomial experiment increases, the probability distribution of a random variable becomes __

bell shaped

find the z score such that the area to its left is 0.15

between 2 values so pick the one its closer to -1.04

a continuous random variable __ take on every possible value in an interval of numbers

can

a discrete random variable __ take on every possible value in an interval of numbers

cannot

mean and standard deviation are most common to describe __ and __

center and spread

if you measure to get the value of a random variable its __

continuous

expected value of x can be denoted as

e(x)

the mean of a discrete value of x is also called the

expected value of x

a continuous random variable

has an infinite number of possible values that are not countable

a discrete random variable

has either a finite number of values or at least a countable number of values

a continuous random variable is approximately normally distributed if

its relative frequency histogram has the shape of a normal curve

table v in appendix a gives the area to the __ of a specific z score

left

expected values are what we expect in the __ of observations

long run

Example: Find the mean 𝜇𝑋 of the random variable X = number of cars a randomly selected American household owns. X 0 1 2 3 4 5 P(X) 0.09 0.36 0.35 0.13 0.06 0.01

mean = 0(.09)+1(.36)+2(.35)+4(.06)+5(.01)= 1.74

to describe characteristics of a discrete probability distribution, we can use

mean and standard deviation

nCx=

n!/x!(n-x)!

is the expected value the most likely value

no

if ___>=10 then the binomial distribution is bell shaped

np(1-p)

binomial probabilities formula

p(x)=p(X-x)= nCx p^x (1-p)^n-x

summaries of populations are called

parameters

binomial distribution

specific type of discrete probability distribution that can be presented using a formula

a probability distribution can be in the form of

table, graph, or mathematical formula

the notation Za is the z score that

the area under the standard normal curve to the right is a

inclusive means

the first and last values are included

ex: consider the sample of flipping 3 coins S: {HHH,HHT,HTH,THH,THT,TTH,TTT} let x= the number of heads observed when flipping 3 coins. then x is a random variable. what are the possible values of x and what is the probability of each value?

S:{HHH,HHT,HTH,THH,THT,TTH,TTT} X: 3 2 2 1 2 1 1 0 x: 0. 1. 2. 3 p(x):1/8 3/8 3/8 1/8 ->this table represents the probability of the distribution of x

Example: Choose an American household at random and let the random variable X be the number of cars they own. Below is the probability distribution of X in table form. Note that while it is possible for a household to own more than five cars, this probability is so small that it is negligible. X 0 1 2 3 4 5 P(X) 0.09 0.36 0.35 0.13 0.06 0.01 (a) Verify that this is a legitimate discrete probability distribution. (b) What is the probability that a randomly selected American household owns 2 cars? (c) What is the probability that a randomly selected American household owns at least 1 car? (d) Your company builds houses with two-car garages. What percent of households have more cars than the garage can hold?

a) 0<=p(x)<=1 and sigma p(x) =1 b) p(X=2) = .35 c) p(x>=1)= 1-p(x=0)= 1-0.09= .91 d) p(X>2)= p(x>=3)= .13+.06+.01= 0.20

ex: drive through times have mean of 138.5 and a standard deviation of 29. a) what time would you recommend Wendy's advertise the maximum wait time before a free meal is awarded to 1% of patrons? b) the fastest 5% of cars to make it through the drive through will spend what amount of time waiting?

a) 1- 0.0100= 0.9900 -> z score is 2.33 x = 138.5 + (2.33)(29) = 206.07 b) x = 138.5 + (-1.645)(29) = 90.8

steps for finding a probability corresponding to a normal random variable

1) draw a normal curve and shade the desired area 2) convert the values of x to a z score using the z score formula 3)use the standard normal table to find the area to the left of the z score

ex: mean driver through time is 138.5 and the standard deviation is 29 seconds a) what is the probability that a randomly selected customer will get through the drive through in less than 100 seconds b) what is the proportion of cars that spend between 120 seconds and 180 seconds in the drive through

a) let x = the amount of time spent in the drive through z= (x-mu)/theta = 100-138.5/29= -1.33 -> table probability of -1.33 is 0.0918 b) find the z score for both values z= 180-138.5/29= 1.43 -> 0.2611 z= 120-138.5/29 = -0.64 ->0.9236 0.9236-0.2611= 0.6625

Example: You operate a restaurant. You read that a sample survey by the National Restaurant Association shows that 40% of adults are committed to eating healthy food when dining out. To help plan your menu, you decide to conduct a sample survey in your own area. You will use random digit dialing to contact a random sample of 20 households by telephone. If the national result holds in your area... (a) What is the expected number of respondents who seek healthy food when dining out? (b) What is the probability at most 2 respondents seek healthy food when dining out? (c) What is the probability at least 3 respondents seek healthy food when dining out?

a) let x= #wanting healthy food when dining out then x is binomial with n=20 p=0.4 b) p(x<=2)= p(x=0)+p(x=1)+p(x=2)= 20C0 (.4)^0 (0.6)^20+ 20C1(.4)^1(0.6)^19+20C2(0.4)^2(0.6)^18 = 0.0036 c) p(x>=3)= 1-p(x<=2)= 1-0.0036= 0.9964

__ can be negative but __ cannot be negative

values of x; probabilities

steps for finding the value of a normal random variable corresponding to a give probability

1) draw the normal curve and shade the given area 2)use the standard normal table to find the z score that corresponds to the shaded area 3)substitute the values of z, mu, and theta into the z score formula solved

find p(z>0.78)

1-p(z<0.78)= 1-0.7823= 0.2177

let p(x) denote the probability that a discrete random variable X equals x, or p(X=x). then:

1. 0<=p(x)<=1 2. sigma p(x)=1

characteristics of a binomial distribution

1. there are only 2 possible outcomes in each trial, we denote one by S(success) and the other by F(failure) 2. the experiment consists of a fixed number (n) of identical trials 3. the n trials are independent 4. the probability of S remains the same from trial to trial. this is denoted by p, and the probability of F is 1-p

IQ scores have a bell shaped distribution with a mean of 100 and a standard deviation of 15. consider a person with an IQ of 140, find the z score and interpret the value

2.67

how to do distributions on calculator

2nd, vars, binomial pdf or cdf

the area under the normal curve for any interval of values of the random variable represents __ or __

the proportion of the population with the characteristic; the probability that a randomly selected individual from the population will have the characteristic

normal tables give probabilities for

the standard normal distribution

in the graph of a discrete probability distribution, the horizontal axis corresponds to __ and the vertical axis represents __

the values of the random variable; the probability of each value

binomial standard deviaton formula

theta= square root of np(1-p)

in the graph of a discrete probability distribution, vertical lines are drawn to

to each values height that is the probability of that value (you can describe the shape of it)

Example: Determine which of the following probability experiments qualify as a binomial experiment. For those that are binomial experiments, define the binomial random variable X, identify the number of trials (n), identify the probability of success (p), and list the possible values of X. (a) You roll a 6-sided die 50 times. The number of ones observed is recorded. (b) You continue rolling a 6-sided die until you get a one. The total number of rolls is recorded. (c) You want to know what percent of married people believe that mothers of young children should not be employed outside the home. You plan to interview 50 people, and for the sake of convenience you decide to interview both the husband and the wife in 25 married couples. You record the number among the 50 persons interviewed that think mothers should not be employed. (d) You buy a ticket in your state's "Pick 3" lottery game every week and record the number of times in a year that you won a prize.

a) n=50 p=1/6 b) not binomial, the number of trials isn't fixed c) not binomial, the trials aren't independent bc the opinions of the husbands and wives are likely related d) n=52 p=0.01

consider a standardized test where the multiple choice questions have 5 answer choices, and 1 point is awarded for a correct answer but 1/4 of a point is deducted for an incorrect answer. a) give the probability distribution. b) what is the expected number of points when a student guesses the answer to a multiple choice question on a test

a) x: 1. -1/4 p(x): 1/5. 4/5 b) e(x)= 1(1/5)-1/4(4/5)=0

__ are terms that describe continuous variables

amount, much, and less

for greater than or equal to binomial outcomes, use __ on calculator

binomial cdf (at least __)

for individual binonmial outcomes, use __ on calculator

binomial pdf

is the distance a baseball travels in the air after being hit a discrete or a continuous random variable

continuous

to apply the standard normal table to a random variable x, we must first

convert the value of x to a z score

probability distribution of a discrete random variable x lists

the possible values of the radon variable and their corresponding probabilities

IQ scores have a bell shaped distribution with a mean of 100 and a standard deviation of 15. let x=IQ score. which of the following expressions is equivalent to the probability that a randomly selected person has an IQ score between 75 and 90 a) p(75<x<90) b) p(-1.67<x<-0.67) c) p(-1.67<=x<=-0.67) d) all of the above

d) all of the above the equal signs don't change anything

if you count to get the value of a random variable its __

discrete

is the number of runs scored in a baseball game discrete or continuous

discrete

to find p(a<=x<=b) ___

find the area under the curve between a and b

Example: According to a study done by Nick Wilson of Otago University Wellington, the probability a randomly selected individual will cover his or her mouth when sneezing is 0.73. Suppose you sit on a bench in a mall and observe people's habits as they sneeze. What is the probability that among 5 randomly observed individuals exactly 3 of them cover their mouth when sneezing?

let x= # of people covering their mouth then x is binomial with n=5 and p=0.73 p(x=3)= 5C3(.73)^3(.27)^2 = .28

the mean of the discrete random variable can be thought as the

mean outcome of the probability experiment if we repeated it multiple times

Example: Find the standard deviation of X = ideal number of children. X 0 1 2 3 4 P(X) 0.05 0.13 0.60 0.15 0.07

mean= 2.06 =(0-2.06)^2(.05)+(1-2.06)^2(.13)+(2-2.06)^2(.6)+(3-2.06)^2(.15)+(4-2.06)^2(0.7)= 0.75460 square root of 0.75460= 0.87

binomial mean formula

mu = np

a __ is symmetric, bell shaped, and characterized by mean and standard deviation

normal curve

__ are terms that describe discrete variables

number, fewer, and many

the cumulative probabilities on jmp shows

p(x<=_)

the p(x) column on jmp shows

p(x=_)

ex: when using jmp to dine the probability of 10 or more the formula is

p(x>=10)= 1-p(x<=9)

the formula for probability distribution curve is called a

probability density function

histogram curves portray __ of continuous random variables

probability distributions

the normal distribution is an important type of continuous distribution bc it provides a

reasonable approximation to the distribution of many variables

as the width of the intervals decreases, the histogram gradually approaches a

smooth curve

__ if you need the area to the right

subtract from 1

find p(-1.20<=z<=0.58)

subtract the area to the left of the upper value- the area to the left of lower value 0.7190-0.1151= 0.6039

__ if you need the area between two x values

subtract the smaller area from the larger area

binomial random variable is the number of

successes in n trials, denoted by X

Example: In the "Pick 3" lottery, a player chooses one of the 1000 three-digit numbers between 000 and 999. The lottery chooses a three-digit winning number at random. With a single bet of $1, the player wins $500 if their number is selected. Otherwise, the lottery takes the player's $1. (a) Construct the probability distribution for X = profit from one ticket. (b) What is the expected profit for a single ticket? (c) If you played the game 10 times, how much would you expect to lose?

win. lose a) x $499 -1 p(x) 1/1000. 999/10000 B) e(x)= 499*(1/1000) - 1(999/1000)= -$0.50 c) -$0.50(10) = -$5

According to a study done by Nick Wilson of Otago University Wellington, the probability a randomly selected individual will cover his or her mouth when sneezing is 0.73. Suppose you sit on a bench in a mall and observe people's habits as they sneeze. What is the mean and standard deviation that among 5 randomly observed individuals exactly 3 of them cover their mouth when sneezing?

x is binomial with n=5 and p=0.73 mu=np= 5(.73)= 3.65 theta= square root 5(.73)(.27)= .9927

Example: The possible values of a random variable X are 0, 1, 2, or 3. The probability distribution of X is defined by P(x) = 0.1x + 0.1 for x = 0, 1, 2, 3. Find the probability of each possible value of X, and verify that it is a legitimate discrete probability distribution.

x. 0. 1. 2. 3 p(x)= .1x + 1. .1 .2. .3. .4 ( .1(0) + .1 =.1) 0<=p(x)<=1 sigma p(x) = .1+.2+.3+.4 = 1

a normal random variable can be converted into a standard normal random variable by taking its

z score

formula for converting x to z

z= (x-mu)/theta

mean of a discrete random value formula

𝜇x= sigma (x * p(x)) ->x is the random variable and p(x) is the probability of observing the value x


Kaugnay na mga set ng pag-aaral

RealEstateU Course 2 Final Study

View Set

Biology 111 - Problem Set 9 - Genes, Proteins, and Gene Expression Part 1 (Exam 3)

View Set

Physiology Unit 1 - Practice Quiz

View Set

Conduction, Convection and Thermal Radiation: Weather

View Set