Statistics Final

Ace your homework & exams now with Quizwiz!

Population

Entire group of interest

As the population standard deviation increases, the margin of error:

Increases

"Test Statistic"

Z

Suppose 1063 people out of a random sample of 2054 adults report that they play video games. What is the standard error of pˆ?

.0110

The Z distribution is a special normal distribution whose mean is ______ and standard deviation is ________.

0,1

About 68% of observations fall within _____ of the mean.

1 standard deviation

Criteria that need to be met in order for Binomial Distribution

1) Defined n 2) Same chance of getting the outcome each time 3) 2 possible outcomes 4) Set time 5) Trials independent of each other

Steps for solving for normal distribution

1)Draw a picture of the X distribution. Mark off the mean, and the 3 standard deviations on each side of the mean. 2)Write down the probability you are trying to find, using probability notation. Shade it in on the picture. 3)Change your probability for X to a probability for Z using the Z formula 4)Use the Z table to find the corresponding probability for Z 5)Use this to find the probability you want for X. (Your drawing will help!)

About 95% of observations fall within _______ of the mean.

2 standard deviations

Suppose you want to estimate the percentage of all OSU students who will take classes this summer. You take a random sample of 200 OSU students and find that 50 of them will take classes this summer. What is the statistic in this problem?

25%

About 99.7% of observations fall within ______ of the mean.

3 standard deviations

Suppose the time to serve a customer (X) has a normal distribution with mean 5 minutes and standard deviation 2 minutes. You want to investigate the 5% of customer service times that were the longest. What cutoff point are you looking for?

95th percentile for X

What is a sampling distribution?

A sampling distribution is the set of all values of a statistic for all possible samples of size n from the population.

Parameter

A single number summarizing the population

Parameter

According to the population

Statistic

According to the sample

The population mean:

Can be smaller than, or larger than, or equal to the sample mean.

"Estimate this parameter"

Confidence interval

The normal distribution is:

Continuous

f you take a random sample of 50 M&M's and record the number of M&Ms of each color, you have a binomial distribution. T/F

False

Let's say we have a random sample of n=61 and are testing a two sided hypothesis. The calculated z value is .04, is this sufficient evidence to reject the null hypothesis? Why or why not?

False (Z is the test-statistic, not the p-value. If Z is small, your data is close to the claim in Ho, so not much evidence against it.

If you are not able to prove the alternative hypothesis, this means the null hypothesis is correct. T/F

False (just means you didn't have enough evidence against it)29. The smaller α

If someone claims the population mean is 5 and you believe it's greater than 5, you write the following hypotheses:

False (x-bar is about data; need μ's)

If you want to estimate the population mean, which technique do you use?

Find a confidence interval for μ

If X is ABOVE the mean, the Z score will be...

Positive

Scatterplots examine relationships between what type(s) of variables?

Quantitative

P-value less than Ho

Reject the null, our Ha was correct

Statistic

Single number summarizing a sample

Standard Deviation=

Square root of the variance

Square root of np(1-p)

Standard deviation, Sigma

A z-value tells you how many ________ the x-value is above or below the _______.

Standard deviations, Mean

Correlations words

Strong, moderate or weak Positive or negative Linear

Sample

Subset of the population

The shape of the normal distribution is:

Symmetric

A p-value can change if you take a new sample.

TRUE (each sample gives a different test statistic and hence a different p-value)

To get the standard deviation,

Take the square root of the variance

If X has a normal distribution, when does X-bar also have a normal distribution?

The Central Limit Theorem is not needed; n can be any size.

Let X be the number of professional golfers that said YES to your survey question. What is the name of the distribution for X?

The binomial distribution

The Central Limit Theorem is important in statistics because it says:

The distribution of X-bar is approximately normal, no matter what the distribution of X is, as long as n is large enough.

Suppose you take a random sample of size n and look at the average X-bar. As n increases, which of the following statements is true?

The mean of X-bar stays the same.

As N goes to infinity...

The more likely it is to narrow to the actual population mean

pˆ=

The proportion of yeses in one sample

Which of the following is not a characteristic of a binomial distribution?

There is a set of n trials b. Each trial results in more than one possible outcome. The trials are independent of each other. Probability of success p is the same from one trial to another.

In lecture we learned that the mean of all the sample means equals the mean of the population. Explain WHY this equality makes sense.

This equality makes sense because our population is all possible values of X and our sampling distribution is the set of all possible samples which includes all possible values of X, n times.

A 95% confidence interval is wider than a 90% confidence interval if all else remains the same.

True

A large p-value means you have little evidence against Ho. T/F

True

P(Z < -2.74) = P(Z > 2.74)

True

There is a 100% chance that your sample mean lies within your 95% confidence interval T/F

True

We use statistics to estimate population parameters, not the other way around.

True

The smaller α (alpha) gets, the harder it is to reject Ho. T/F

True (alpha is your cutoff value to reject Ho)

We may get different conclusions for the same problem if hypotheses are different. For example: conclusion for H0: μ=8 and HA: μ<8 might be different with conclusion for H0: μ=8 and HA: μ≠8.

True (not-equal-to Ha has a doubled p-value)

Why do we have to check BOTH conditions rather than just one, to be able to use the normal approximation? (Including an example with your explanation is OK.)

We need to check both conditions because if our probability is large, np may be greater than 10 whereas n(1-p) may not be greater than 10. However, if our probability is small, n(1-p) may be greater than 10 whereas np may not be greater than 10.

If Z = 0 then where is X compared to its mean?

When Z = 0, then X = the mean.

X-bar is normally distributed if....

X is normally distributed OR X is not normally distributed but n>30 because CLT

If you roll a die 100 times and find the average, and I roll a die 200 times and find the average, who is more likely to have an average that is greater than 5?

YOU, Because you roll fewer times, your results have a higher chance to be further away from the mean (3.5). The more times you roll the closer to 3.5 your results are likely to be.

Which of the following is a test statistic?

Z=-2.67 (correct answer)

z score formula

Z=X‒μ ----- Where X=value you are trying to find σ out about, μ=the mean, and σ=standard deviation

What two ways can we do statistical inference?

a. Estimate a population parameter using information from a sample b. Test a population parameter using information from a sample

How many we expect in Binomial Distribution=

np

The mean of a binomial probability distribution with n trials and probability p of success is:

np

You can use the normal distribution to approximate a binomial distribution when the following conditions are met:

np ≥ 10 and n(1-p) ≥ 10.

There are two conditions that you have to check before using the normal approximation when X is binomial.

np≥10 and n(1‒p)≥10

If you get a new sample, which of the following elements of a hypothesis test will change?

p-value

Correlation of a sample (r) will always be a number between

-1 and 1

Suppose your confidence interval for the percentage of all American families planning a vacation for the summer is 30% to 40%. Now suppose the media reported that 50% of American families go on vacation during the summer, would you agree or disagree with them, based on your data?

Disagree

Goal of statistical inference

Estimate a test or population parameter using info from a sample

A p-value in hypothesis testing means the same thing as the sample proportion.

FALSE

Suppose your p-value in a hypothesis test is .055. Using the standards from this class what do you conclude?

Fail to reject Ho (for us, we use .05 as our cutoff value to reject Ho. This doesn't quite make it.)

Suppose your p-value in a hypothesis test is .055. Using the standards from this class what do you conclude?

Fail to reject Ho (since the p-value .055 > significance level of .05.)

How does the sample size effect the standard error?

Inversely, sample size up means standard error down

If X has some unknown distribution. What do we know about the distribution of X-bar?

It has an approximate normal distribution if the sample size n is large enough.

The Central Limit Theorem is important in statistics because:

It says for n ≥ 30, and any distribution that's not normal, the sampling distribution of Xis approximately normal.

As n increases, what happens to μx?

It stays the same

We collected some data and wanted to know if the data came from a normal distribution. We made a normal probability plot and it showed a straight line. What does this tell us?

It tells our data comes from a normal distribution.

What does the CLT apply to?

Just shape

P-value more than Ho

Keep the null, we were wrong

The Z distribution has a table that displays what type of probabilities:

Less than

Let X̄ be the sample mean from X's distribution. The standard error of X̄ is:

Less than the standard deviation of X

Correlation is a measure of the strength and direction of what type of relationship between two quantitative variables? _

Linear

Correlation is affected by outliers. Explain why, briefly.

Looking at the formula for r, correlation is based on the mean of X, the mean of Y, the SD of X, and the SD of Y. All four of these items are affected by outliers, as we learned in Chapter 1.

If X is below the mean, the Z score will be...

Negative

Correlation has ____________ units.

No

Does x̄ =X̄

No

Does x-bar=mu sub x?

No, mu sub x=mu sub x-bar

Suppose you want to estimate the percentage of all American families planning a vacation for the summer. Your confidence interval is 30% to 40%. What was your value of p-hat ?

Not enough info


Related study sets

Environmental Science (BIO107)- Exam 2

View Set

CHAPTER 3 AP HUG NO STRESS FINAL

View Set

Extensors and Flexors of the Wrist and Hand

View Set

PrepU Chapter 9: Teaching and Counseling

View Set

Ch 9. Forming and Maintaining Personal Relationships.

View Set