"Must Pass" Quiz- Stat

¡Supera tus tareas y exámenes ahora con Quizwiz!

65. Describe your thought process when deciding upon the type of statistical test (or interval) to use in various problems: 1-sample t, 2-prop. z, chi-squared, g.o.f., etc.

1 sample t: do you know sigma? is there one sample? 2 sample t: do you know sigma? is there two samples? 1 prop z: is it a proportion? is there only one set of data? you must know sigma 2 prop z: is it a proportion? are there two sets of data? you must know sigma chi squared gof: is it one sample? are you looking for the expected value of said sample chi squared for independence: are you testing a relationship? do you have two categories? do you have observed values? chi squared for homogenity: are you testing likeness? do you have two different sets of things? do you have observed values?

88. There is a popular saying involving correlation (more generally, association) and causation. What is the saying, and what does it mean?

Correlation does not imply causation

14. How does one recognize lack of normality?

Easiest way is look for a pattern that is not straight in NQP. If you are a glutton for punishment (as on #4 from the Chap. 13-14 free response), you can use chi-squared g.o.f. to test for departures from expected bin counts. There are also several standard "canned" tests that are beyond the scope of AP Statistics.

9. In regression, what names are given to the x and y variables?

Explanatory, response.

54. Why is it usually a very bad idea to use the word probability in any sentence involving confidence intervals? Is it possible to make a true statement that combines these terms?

It is possible to write a true sentence using the words probability and confidence interval. However, it is also very easy to make an error along the way. That is why it is much better to say, "We are 95% confident that the true proportion of voters favoring Smedley is between 48% and 54%," not anything involving probability. Probability is a technical term meaning long-run relative frequency, and it cannot be haphazardly misused in the way laypeople misuse it. It would be correct to say, "If we repeatedly generated confidence intervals with samples of this size and with m.o.e. of 3%, then the probability that a future confidence interval will bracket the true proportion of voters favoring candidate Smedley is 95%; that is, 95% of the confidence intervals generated by this process will bracket the true value." However, you cannot make a probability statement about a confidence interval once it has been generated, because then you are not making a statement about the process (which is legitimate), but rather about this one-shot confidence interval. There is no "long run" in a one-shot confidence interval!

87. What is meant by the saying, "Statistical significance is not the same as practical significance"?

Just because an effect is not plausibly caused by chance alone does not mean that it is large enough to be of any real-world significance.

84. What do the letters SRS stand for, and what is an SRS?

Simple random sample; a sample in which every possible subset is equally likely to be selected.

4. What are the parameters of a uniform distribution? A normal distribution? A binomial distribution? A geometric distribution? A t distribution? A chi-squared distribution?

Uniform: min and max (also need to know whether distrib. is discrete or continuous) Normal: not specific, they are estimated parameters Binomial: n and p Geometric: p t: degrees of freedom chi-squared: degrees of freedom

50. critical value Define and describe how it is determined or computed.

the value corresponding to a given significance level. This cutoff value determines the boundary between those samples resulting in a test statistic that leads to rejecting the null hypothesis and those that lead to a decision not to reject the null hypothesis. If the calculated value from the statistical test is less than the critical value, then you fail to reject the null hypothesis. If the calculated statistic is outside of the critical value, then you reject the null hypothesis and are forced to accept the alternate hypothesis.

76. Give the "approved wording" for a conclusion to a statistical test that does not show significance.

"There is insufficient evidence that ..." It is a good idea to list the test statistic, n or df, and the P-value in parentheses. Be sure to phrase the conclusion in the context of the problem.

75. Give the "approved wording" for a conclusion to a statistical test that shows significance.

"There is strong evidence that ..." It is a good idea to list the test statistic, n or df, and the P-value in parentheses. Be sure to phrase the conclusion in the context of the problem.

77. Give the "approved wording" for a conclusion to a confidence interval problem.

"We are XX% confident that the true ... is between YY and ZZ." Be sure to phrase the "..." in the context of the problem, e.g., "true mean boiling point," "true difference in voter preference proportions," "true mean improvement in test scores," etc.

8. Describe how to find outliers (a) in a column of data; (b) in a regression setting.

(a) Easiest way is to make modified boxplot, then TRACE to see the points (use arrow keys). Outliers are more than 1.5IQR below Q1 or more than 1.5IQR above Q3. (b) No rule of thumb—just judge visually. Outliers have "large" residuals.

72. chi-squared g.o.f. (CSDELUXE) Describe how you would check assumptions

- independence: population greater than 10% of sample - Random: should say - counts: frequency greater than 5 and must be units of counts

73. chi-squared 2-way (CSDELUXE or STAT TESTS C) Describe how you would check assumptions

- independence: population greater than 10% of sample - Random: should say - counts: frequency greater than 5 and must be units of counts

68. 2-sample z (STAT TESTS 3, 9) Describe how you would check assumptions

- independence: population greater than 10% of sample - Random: should say - large enough population: all p(n) greater than 10 and q(n) greater than 10

66. 1-sample z (STAT TESTS 1, 7) Describe how you would check assumptions

- independence: population greater than 10% of sample - Random: should say - large enough population: p(n) greater than 10 q(n) greater than 10

70. 1-prop. z (STAT TESTS 5, A) Describe how you would check assumptions

- independence: population greater than 10% of sample - Random: should say - large enough: p(n) greater than 10 q(n) greater than 10

67. 1-sample t (STAT TESTS 2, 8) Describe how you would check assumptions

- independence: population greater than 10% of sample - Random: should say - matched: or only one sample -differences are normal

71. 2-prop. z (STAT TESTS 6, B) Describe how you would check assumptions

- independence: population greater than 10% of sample - Random: should say -large enough population: p(n) greater than 10 q(n) greater than 10

69. 2-sample t (STAT TESTS 4, 0) Describe how you would check assumptions

- independence: population greater than 10% of sample - Random: should say -normal distributed: symmetric and unimodal

74. LSRL t-test (STAT TESTS E) Describe how you would check assumptions

- linear: points in regression should be spread out - independent: no points following something - constant variably: standard deviations should be similar - normal distribution: symmetric and unimodal

22. Describe a few interesting properties of the LSRL (least squares regression line).

1). The line minimizes the sum of squared differences between observed values (the y values) and predicted values (the ŷ values computed from the regression equation). 2). The regression line passes through the mean of the X values (x) and through the mean of the Y values (y). 3). The regression constant (b0) is equal to the y intercept of the regression line. 4). The regression coefficient (b1) is the average change in the dependent variable (Y) for a 1-unit change in the independent variable (X). It is the slope of the regression line.

90. Explain what is meant by double blinding, and why it is so important in clinical trials.

A double blind test is a scientific test in which neither test subjects nor administrators know who is in the control group and who is in the experimental group. The intent is to create an unbiased test environment, ensuring that the results of the testing are accurate and will stand up to analysis by other members of the scientific community.

1. What is a statistic? Give several examples.

A number computed from data. Examples: Sample mean, sample median, sample variance, sample standard deviation, quartiles, percentiles, t statistics, chi-squared statistics, f statistics, and skewness

2. What is a parameter? Give several examples.

A number that describes a population. Examples: Function parameters, value parameters, reference parameters, default parameters, variable parameters, and missing parameters

60. The AP formula sheet gives two versions of the s.e. for a 2-sample t situation (difference of ____________). Explain how to tell which one to use.

Always use the first one, never the second.

3. What alternate meaning does the word parameter have in other mathematical disciplines?

An "adjustable constant" that defines the nature of a mathematical model, much as a tuning knob or volume slider adjusts the output of a television or radio.

63. Define the term bias and give several examples of types of bias.

Bias = any situation in which the expected value of a statistic does not equal the parameter being estimated. Selection bias refers to a methodology that produces samples that are systematically different from the population in a way that causes a parameter to be systematically underestimated or overestimated. An SRS is not biased; although an SRS often fails to match the population, the differences are random differences, not systematic differences. "Systematic" means that there are methodological flaws that may become evident over a period of time, because the flaws are built into the design of the process. For example, if we try to poll the STA parent body on the question, "How many days per year does your son spend traveling?" we will get a statistic that is biased on the high side if we use an SRS of all parents. (That is because students with stepparents, who may well travel more than the average, will be more likely to have a parent chosen as part of the SRS.) If the SRS were based on students instead of parents, the question should be able to avoid selection bias. Common types of bias include selection bias (undercoverage or overcoverage), response bias (a.k.a. lying), nonresponse bias, voluntary response bias, hidden bias, experimenter bias, and wording of the question.

40. What does CLT stand for? State it correctly and in one of the many ways in which people misconstrue it.

Central limit theorem. CORRECT: Consider any population, not necessarily normal, having finite sigma. As n approaches infinity, the sampling distribution of x bar approaches N(u(sigma/square root of n)). WRONG: "Everything is normal." (Not true: Sampling distributions of s are certainly not normal. Geometric and x squared distributions are certainly not normal.) "Any sampling distribution of x bar is normal." (Not true: Sampling distributions of x bar approximately follow a t distribution if sigma is unknown.) "Sampling distribution of x bar is not normal unless n is large." (False: Sampling distribution of sample mean is normal, regardless of sample size, provided pop. is normal with known sigma.)

18. What name do we give to r2? What does r2 mean?

Coefficient of determination. Tells what portion of the variation in one variable can be explained by variation in the other. If r = .8, then 64% of the variation in y (or x) can be explained by variation in x (or y).

78. Describe how to transform an "interval format" C.I. into an "estimate t m.o.e." format.

Compute C.I. using TI-83. Then punch upper-lower, i.e., VARS 5 TEST I - VARS 5 TEST H, divide result by 2 and STO into M (for m.o.e.). Your can then write your C.I. as est. t M. Depending on the problem, "est." will be x bar, p hat, x bar1-x bar2, or p hat1-p hat2.

52. Explain the difference between confidence level and confidence interval.

Confidence level is a percentage, typically 95% or 99%. It is how confident you are that the true mean, proportion, or other statistic for which you are trying measure is between the two endpoints in the confidence interval. The numbers in the confidence interval are of the same units as the statistic you are trying to locate. For example, let's say you are trying to determine the mean age of Camaro drivers. Then you take a random sample, calculate the sample mean, then use that as the point estimate for the true population mean (which is unknown). You can construct many different confidence intervals from your gathered information. Let's say that you ended up sampling 50 drivers, and their mean age was 26. Then you decided to calculate some confidence intervals. The 95% confidence interval might be something like [24, 28]. In other words, you are 95% confident that the true mean age of Camaro drivers is between 24 and 28. The 99% confidence interval might be something like [22, 30]. In other words, you are 99% confident that the true mean age of Camaro drivers is between 22 and 30. The higher the level of confidence, the wider the interval. The lower the level of confidence, the shorter the interval. It's a trade off. Ideally, you would like to pinpoint the true value, that is, you would like to have a very short confidence interval, but you would also like to be very confident. This can be a difficult task. This is why the size of the sample becomes important. The larger the sample size, the smaller the standard error and the smaller the interval.

62. True or false: If there are two columns of data in an experiment, then the situation calls for use of 2-sample procedures. Explain your answer.

False: If there are matched pairs, you really have only one sample (namely, a column of differences).

98. Explain the following paradox: For a gambler to return from a casino as a winner is not rare, yet casinos are reliably profitable.

For the individual player who plays a few dozen hands of blackjack or pulls the arm on a slot machine a few dozen times, the sampling distribution of net outcomes is relatively short and wide, meaning that a good portion of the sampling distribution can spill into positive territory even though the mean is negative. This is why it is not rare for people to return home from Las Vegas as winners. (Fewer than half are this lucky, but since the lucky ones are usually the only ones who say anything, it is easy to get a false impression that winning is common.) From the casino's point of view, however, the sampling distribution for the vast numbers of dollars that are wagered in a day is completely different. The casino's sampling distribution is extremely tall and pointed, with a mean that virtually guarantees a profit as a fixed percentage of the amount wagered. (This percentage varies depending on whether the game is roulette, or blackjack, or slot machines, or whatever.)

24. Give several examples of "good" and "bad" residual plots and what they should be telling us.

Good: show you randomly dispersed difference between y and y (hat). Bad: show you a pattern within the graph of the residuals, indicating that the line of fit is not accurate and the data are most likely non linear.

7. Define IQR and describe how to find it.

IQR (interquartile range) = Q3 - Q1. Use STAT CALC 1 to get 5-number summary, then VARS 5 PTS 9 - VARS 5 PTS 7.

12. What special geometric meaning does standard deviation have in a normal distribution?

In a normal distribution, the distribution curve is bell-shaped, satisfies the 68-95-99.7 rule, and has inflection points.

99. Is gambling rational?

In general, no. On rare occasions, a game may have a positive expected value for the player, but nobody should ever wager more than he can afford. For example, if the lotto jackpot is large enough, spending a few dollars on tickets may be mathematically rational, but spending hundreds of dollars is not. The vast majority of games of chance are a waste of time and money.

79. Describe, in general terms, how the t statistic is calculated.

In testing the null hypothesis that the population mean is equal to a specified value μ0, one uses the statistic t= (x bar-mu)/(s/square root of n) where is the sample mean, s is the sample standard deviation of the sample and n is the sample size. The degrees of freedom used in this test is n − 1.

13. What is skewness? Give two examples of different ways to detect skewness.

Lack of symmetry. Right skewness means the central hump dribbles out to the right, forcing mean > median, since mean is less resistant to extreme values. Left skewness is the opposite, forcing mean < median. Easy ways to detect skewness involve looking at histogram, boxplot, or stemplot to see where the tail is longer. If you use NQP, trace dots from left to right; if they bend to left, plot shows left skewness, but if they bend to right, plot shows right skewness.

39. What does LOLN stand for? State it correctly and in one of the many ways in which people misconstrue it.

Law of large numbers. CORRECT: As n approaches infinity, p hat approaches p. (Sometimes stated as "x bar approaches mu as n approaches infinity.") WRONG: If p hat is less than p, then the proportion of successes will start to increase until we "catch up." (Or, if p hat is greater than p, the proportion of successes will start to decrease until we are "back down to the correct value.") These are both wrong, because what really happens is that the effect of any finite collection of observations becomes diluted as n approaches infinity. A coin has no memory, no desire to set things right, and no ability to iron out past discrepancies. Nevertheless, the proportion of heads—even if the coin is biased—will, over time, approach whatever the true probability is.

17. What name do we give to r? What does r mean? How do we compute r?

Linear correlation coefficient. Signed strength of linear pattern (-1 = pure negative linear association, 0 = no linear association, +1 = pure positive linear association.) Use STAT CALC 8 and make sure your Diagnostics are on (2nd CATALOG DiagnosticOn).

15. What is the most common type of regression?

Linear least-squares. (It is not sufficient to say linear, because the LSRL is not the only type of linear regression. For example, there is the median-median line, which is useful in some situations and which is more resistant than the LSRL.)

86. Explain marginal and conditional probabilities. With what data (quantitative or categorical) are marginal and conditional probabilities usually computed?

Marginal probabilities = fractions involving row or column totals divided by grand total. Conditional probabilities = fractions involving individual cells divided by a row or column total. Both are usually concerned with categorical data in 2-way tables.

10. What does MSE mean? Is it a synonym for variance?

Mean squared error = pop. variance (mean squared deviation from the mean). Sample variance is different, since denom. is n - 1 instead of n.

83. Is the binomial parameter p the same as the P-value of a test? What symbol is commonly used as an equivalent for 1 - p? Would the AP graders understand this without further explanation?

No; q; yes.

19. Is r affected by choice of units (e.g., mm, cm, inches, feet, light-years)? How about b0 and b1?

No; yes.

20. Is r affected by choice of which variable is x and which is y? How about b0 and b1?

No; yes.

96. Who coined the saying, "There are three kinds of lies: lies, d_____d lies, and statistics"?

Nobody knows. The statement is usually attributed to Mark Twain, although he himself credited it to Benjamin Disraeli.

82. Does the m.o.e. of a statistic depend on the size of the population? Explain briefly, giving an example if possible.

Not really. For example, the m.o.e. (at a 95% confidence level) of a 1300-person poll will be about 3 percentage points, regardless of whether the poll is taken in California or in Wyoming. You do not need a larger sample to get the same accuracy in California, even though the population of California is about 34 million, more than 60 times larger than that of Wyoming. Think of visiting the plant where M&M's are made. Imagine taking a scoop of M&M's out of a huge tub of M&M's. Your goal, let's say, is to estimate the proportion of blue M&M's that are in the tub. What affects the accuracy of your estimate? Clearly, your m.o.e. will be large if you take a small scoop, and your m.o.e. will be smaller if you take a really big scoop. However (and this where many people have trouble), the m.o.e. does not depend on the size of the tub. The m.o.e. depends only on how large a scoop you take (i.e., your sample size). The reason that m.o.e. does not depend on population size is that m.o.e. is always equal to s.e. multiplied by a critical value. The s.e. (and, for t distributions, the critical value as well) will depend on n (sample size). However, s.e. and critical value do not depend on N (population size), as long as the population is "large" relative to the sample. For small populations (e.g., the population of the Upper School), it is technically true that m.o.e. depends on population size. There is a formula known as the "finite population correction" that makes this relationship clear. However, this is not on the AP curriculum. For the common situation where someone is trying to estimate a parameter from a large population (e.g., the proportion of voters in a state who support Smedley), the size of the population simply does not matter. The only thing that matters in that case is the size of the SRS.

97. Explain how odds work. In particular, given a probability P(A) expressed as a fraction, explain how to compute the odds in favor of the event as well as the odds against the event. Explain why "casino odds" never equal the mathematical odds.

Odds in favor = ratio of favorable to unfavorable outcomes. Odds against = ratio of unfavorable to favorable outcomes. For example, if p = P(A) = 4/13, then the odds in favor of event A are 4 to 9, and the odds against A are 9 to 4. Because casinos are in the business of making money, they never quote payoff odds that equal the mathematical odds. For example, in roulette, there are 36 numbers you can bet on, half of which are red and half of which are black. In Las Vegas, there are also a zero and a double zero for which nobody is paid off; in other words, the house takes all the bets if the ball falls in slot 0 or 00. A successful bet on black pays off at 1:1 odds (i.e., net profit of $1 for each $1 placed at risk). However, since the player's probability of success is 18/38, the mathematical odds against are actually 20:18, which is slightly more than 1:1. The excess represents the casino's profit margin in the long run, as proved by the LOLN.

89. How does one prove causation?

Only a controlled experiment is considered convincing. In situations (e.g., smoking in humans) where it is not ethical to run a controlled experiment, various types of observational and correlative studies can suggest, but not prove, a cause-and-effect link.

41.*** In experiments, probability arises at the end in the form of a ____________ computed from the ____________ statistic. Describe the three ______ __ ___ ________ _______ and briefly describe how you would implement them when designing an experiment of possible interest to you personally.

P-value, test; principles of good experimental design; [add your personal description]

11. What does standard deviation measure? and how is it computed?

Population standard deviation and sample standard deviation are measures of data spread. Use STAT CALC 1 to compute, never the formula on AP formula sheet. Technically, u equals the square root of MSE (square root of pop. variance), and s equals the square root of sample variance.

27. What do the letters r.v. mean? Give two examples, one that is ____________ and another that is ____________ .

Random variable. 1). Discrete random variables 2). Continuous random variables

6. Define range and describe how to find it.

Range is a single number for the spread of values in a column of data: range = max - min. People who say things like "the range is from 28 to 75" are misusing the term in its statistical sense.

25. Tell whether the following regression-related terms are synonyms: ____________ outlier and ____________ observation. If not, why not?

Regression outlier and influential observation are not synonyms. A point can be a regression outlier (large residual), but if it is near the center of the x values, it is usually not influential. Similarly, a point can be influential (large effect on slope or r if removed) but have only a small residual, meaning the point is not an outlier. It is also possible for a point to be both influential and an outlier.

23. What is a residual? How does one make a residual plot? If a residual plot for a LSRL model has residuals on the y axis, what variable goes on the x axis?

Residual: an observable estimate of the unobservable statistical error (Examples: actual y - predicted y). Residual plot is scatterplot with RESID on y-axis and either the x or y variable on the x-axis. (It doesn't matter, since x and y are linearly related.) In beginning statistics courses, we usually make resid. plot with x on the x-axis and RESID on the y-axis, but there was at least one AP exam that had y values on the x-axis of the resid. plot. Don't let that bother you.

85. Which assumption is more important, normality (if applicable) or the assumption that data come from an SRS? Why?

SRS, since bias can invalidate the results quite easily. Normality of population is not an issue in large samples (courtesy of CLT), since normality of the sampling distribution rescues us.

21. How do we typically compute b0 and b1? What other ways are there?

STAT CALC 8, or with formulas 6 and 8 on first page of AP formula sheet. (Never use formula 5.)

37. Explain what a ____________ distribution is. Give three examples, using the three test statistics that we care most about in AP Statistics.

Sampling distribution of x bar or diff. of means: Follows z if sigma is known (rare), otherwise t. Sampling distribution of p hat: Really binomial, but almost normal if pop. is large, np 10, and nq 10. Sampling distribution of difference of proportions: Almost normal if pops. are large, n1p1 5, n1q1 5, n2p2 5, n2q2 5. Sampling distrib. of the summation of (observed-expected) divided by expected: Follows x squared, with df given either by (# of bins - 1) for g.o.f., or by (rows - 1)(cols. - 1) for 2-way tables.

16. Which is usually of greater interest, the LSRL slope or the LSRL y-intercept? Why?

Slope, since it estimates how many response units will increase (or decrease) for each additional explanatory unit. Intercept is less crucial, even meaningless in some contexts.

34. What is the purpose of a z score? Under what circumstances may one compute a z score? Describe how to compute it and what it means.

Standardized (dimensionless) representation of a data point, in s.d.'s. Can always be computed, even if data set is non-normal. Use formula z=(x-u)/o. Tells how many s.d.'s a data value is above or below the mean.

57. What is meant by statistical significance?

Statistical significance refers to the importance of the results of research studies; it can be used to form theories and serves for the improvement of practice. Significance in this context is often confused with importance in the sense that it is relevant data, when really it has more to do with how certain the results are that you can even ignore the null hypothesis (the hypothesis that is falsified by the statistical results).

36. Why do we care about probability? Is it merely of interest to casinos and misguided people who waste their money on state lotteries?

The aspect of probability that we care most about is sampling distributions. If we understand the sampling distribution of a statistic, we can determine how statistically significant a result is. Without this, we would never know whether experiments or clinical trials of new drugs were showing anything of value or were merely "flukes."

61. The AP formula sheet gives two versions of the s.e. for a 2-prop. z situation (difference of ____________). Explain how to tell which one to use.

The first one (unequal proportions) is for a 2-prop. z confidence interval, and the second one is usually for a 2-prop. z test. There are rare situations in which you would use the first formula for a 2-prop. z test. The rule is this: Look at what H0 is claiming. If H0 is claiming that the proportions differ by a constant (for example, p1 = .025 + p2), then you would use the first formula (unequal proportions). However, a much more common situation, shown in virtually all of the practice AP problems, involves H0 claiming that p1 = p2, and in such a case you would use the second formula (equal proportions). If all of this is too confusing, you would not go too far wrong if you simply always used the second formula for 2-prop. z tests, leaving the first formula only for 2-prop. z intervals. One additional note: In the second formula, you have to know how to estimate p: Take total # of successes divided by total # of subjects.

95. It has been said that 79.4% of all statistics are made up on the spot, that 5 out of every 3 Americans are weak at mathematics, that smoking is the leading cause of statistics, and that a statistician is someone who follows an unwarranted assumption to a foregone conclusion. Which of these flippant remarks is most unfair?

The last one. Statisticians are mostly from mathematical or scientific backgrounds, which means we are on a quest for truth. Our clients may mangle, misuse, and abuse our conclusions, but we try very hard not to do that ourselves.

53. Which is usually preferred: a one-tailed test or a two-tailed test? When should the decision be made regarding the type of test? What is the relevant question to consider in determining whether to use a one-tailed or two-tailed test?

Two-tailed, since if the experiment goes the wrong way (as sometimes occurs in science), there will still be the possibility of making an inference. All decisions regarding methodology are supposed to be made before any data-gathering occurs. (Otherwise, people could say that the methodology was tailored toward achieving a low P-value. In theory, the experiment should be repeatable, so that anyone following the same methodology would likely reach a similar conclusion.) The one-tailed/two-tailed decision should be based on the research question posed. If the researcher is wondering whether there is "a difference," direction unspecified, then plan for a two-tailed test. If the researcher is wondering whether treatment X increases hair strength, decreases yellowness of teeth, or whatever, then plan for a one-tailed test.

5. Describe how to recognize uniform, normal, binomial, geometric, t, and chi-squared distributions.

Uniform: flat line in relative frequency histogram Normal: classic continuous bell-shaped curve, satisfies 68-95-99.7 rule Binomial: discrete ("stairsteppy"); skew right if p < .5, skew left if p > .5, symmetric if p = q = .5 Geometric: discrete ("stairsteppy"), always skew right t: continuous, bell-shaped; virtually normal for large df, except with more "flab" in the tails Chi-squared: continuous, always skew right

93. Give several examples of ways in which people lie with statistics.

Using deceptive ("gee-whiz") graphs, changing the subject, confusing correlation with causation, using inappropriate averages (e.g., mean with highly skewed distributions), citing anecdotal data, using biased samples, concealing the wording of a survey question, computing absurd precision with qualitative data (e.g., "74% more beautiful skin!"), etc., etc.

56. Can Ha ever be proved? Why or why not?

We can sometimes gather overwhelming evidence that H0 can be rejected in favor of Ha. In the real world, even in a court of law, that is good enough. (Of course, in the world of mathematics, that is not considered a proof—one of the reasons that mathematicians and statisticians do not consider themselves to be equivalent.)

55. Can H0 ever be proved? Why or why not?

We cannot prove H0. All we can do is judge whether the evidence against it is "sufficient to reject" or "insufficient to reject."

94. Give several examples of questions you should always ask when hearing or reading a statistic for the first time.

Who says so? How do they know? Did somebody change the subject? Is the result credible? (For example, a claim that a child is kidnapped every 30 seconds in America is absurd, since that would be more than a million children per year.)

100. Is poker a game of chance?

Yes, but it is much more accurate to call poker a game of psychology and applied mathematics. Like pure games of chance, poker is an effective way to waste a great deal of time and money.

91. There are four types of employees at XYZ Corp., whom we will call pitchers, catchers, infielders, and outfielders for lack of a more creative idea. All categories of employees have recently had large cuts in their mean salaries, and yet total payroll costs have risen. Is such a thing possible? Explain.

Yes; perhaps many new employees have been hired.

92. There are four types of employees at XYZ Corp., whom we will call pitchers, catchers, infielders, and outfielders for lack of a more creative idea. All categories of employees have recently had large cuts in their mean salaries, and yet the overall mean salary per employee has risen. Is such a thing possible? Explain.

Yes; the relative mix of employee categories could be a lurking variable. Perhaps there are now proportionally more employees in the higher-paid job categories, so that the weighted average salary has increased even while each category has had cuts in mean salaries. This would be an example of Simpson's Paradox.

26. Interpret b0 and b1 for a layperson.

b0 = value of response if explanatory variable (x value) is set to 0 b1 = estimate of how many response units will increase (or decrease) for each additional explanatory unit For example, suppose that a clinical trial of a diet pill shows that the mean weight change after a year is 2 - 3x lbs., where x = daily dosage (# of pills). Then b0 = 2, since a person taking 0 pills can expect to gain 2 lbs. in a year, and b1 = -3, since each additional pill in the daily dosage is associated with a weight of about 3 lbs. less after a year.

81. Data from a small sample, from a person's own experience, or from a ____________ sample should usually be dismissed on the grounds that they are ____________ . However, data from large samples (for example, responses to on-line surveys or magazine subscriber surveys) are also often worthless. Why?

convenience, anecdotal; voluntary response bias

35. In probability theory, a Venn diagram showing no overlap indicates that two ____________ are ____________ ____________ . Is this term a synonym for ____________ ? If not, explain the difference.

events, mutually exclusive; independence; no; independence of A and B means P(A|B) = P(A), which is not at all the same as P(A same as B)=0.

49. sampling error Define and describe how it is determined or computed.

incurred when the statistical characteristics of a population are estimated from a subset, or sample, of that population. Since the sample does not include all members of the population, statistics on the sample, such as means and quantiles, generally differ from parameters on the entire population.

58. The purpose of ____________ statistics is to ___ ____________ ___ ____________ ____________ . (This is a much more difficult and sophisticated skill than descriptive statistics, in which we assume that any reasonably intelligent person should be able to read a table or a graph, compute s.d., add a LSRL trend line, etc. Be sure you explain this to people if they pooh-pooh your having spent a year studying statistics. There is much more to the subject than learning about means, modes, and medians!)

inferential, use statistics to estimate parameters

59. Describe each step in the PHA(S)TPC process.

problem: state the problem hypothesis: write the null and alternative hypothesis based on the problem assumptions: check all required assumptions for the test state: the formula for the test test statistic: find the respected t or z score based on the information you are given p vale: assert the p value for the test statistic you found conclusion: state your conclusion based on your p value compared to your alpha level

28. If X is a(n) ____________ , then u1 is calculated by ____________ and is known by two names: ____________ or ____________ .

r.v., E xipi, mean, expected value

29. If X is a(n) ____________ , then ____________ is calculated as probability-weighted MSE and is indicated by either of two possible notations: ____________ or ____________. The ____________ ____________ of ____________ equals s.d., denoted ____________ .

r.v., variance, Var(X), o2 1 , square root, Var(X), o 1

33. Describe how each of the following is affected by linear transformations: r, u , o , IQR, range.

r: no change u : affected by both translation and dilation (fancy way of saying that u(new) = lin. fcn. of u(old)) o : affected by dilation (i.e., multiplication by scalar) but not by translation (shift left or right) IQR: affected by dilation but not by translation range: affected by dilation but not by translation

38. What abbreviation is sometimes used for the s.d. of a statistic? Why does the AP generally avoid this term? Would they understand us if we used it?

s.e.; no idea; yes

32. The standard deviation of a ____________ multiple of X equals the ____________ times ____________ . Is this always true?

scalar (i.e., a constant), scalar, o 1 ; yes

44. alpha level Define and describe how it is determined or computed.

specifies the probability level for our evidence to be an unreasonable estimate. By unreasonable, we mean that the estimate should not have taken its particular value unless some non-chance factor(s) had operated to alter the nature of the sample such that it was no longer representative of the population of interest. The researcher has complete control over the value of this significance level.

30. The mean of a ____________ equals the ____________ of the ____________ . Is this always true? What about for differences?

sum, sum, means; yes; mean of difference equals difference of means

31. The variance of a ____________ equals the ____________ of the ____________ . Is this always true? What about for differences?

sum, sum, variances; true only for independent r.v.'s; variance of difference (assuming indep. r.v.'s) equals sum of variances Other consequences: s.d. of sum = square root of sum of variances (similar to Pythagorean Theorem), s.d. of difference = square root of sum of variances (same comment). Both are true only if the r.v.'s are independent.

80. Describe how to use the result of #79 to get a formula for the s.e. of b1 that is much simpler than the one given on the AP formula sheet.

t=observed-expected / standard error

51. m.o.e. Define and describe how it is determined or computed.

the amount of random sampling error from a survey. ME= critical value( s/ square root of population) or ( square root of p times q/ population)

46. P(Type II error) Define and describe how it is determined or computed.

the failure to reject a false null hypothesis

45. P(Type I error) Define and describe how it is determined or computed.

the incorrect rejection of a true null hypothesis

48. degrees of freedom Define and describe how it is determined or computed.

the number of values in the final calculation of a statistic that are free to vary

43. P-value Define and describe how it is determined or computed.

the probability of obtaining a test statistic at least as extreme as the one that was actually observed, assuming that the null hypothesis is true.[1] One often "rejects the null hypothesis" when the p-value is less than the predetermined significance level which is often 0.05[2][3] or 0.01, indicating that the observed result would be highly unlikely under the null hypothesis. Many common statistical tests, such as chi-squared tests or Student's t-test, produce test statistics which can be interpreted using p-values.

47. power Define and describe how it is determined or computed.

the probability that the test will reject the null hypothesis when the null hypothesis is false

42. test statistic Define and describe how it is determined or computed.

to quantify, within observed data, behaviours that would distinguish the null from the alternative hypothesis where such an alternative is prescribed, or that would characterise the null hypothesis if there is no explicitly stated alernative hypothesis.

64. It can be proved, after a page or so of messy algebra, that s2 is an unbiased estimator of o2. (Curiously, though, s is not an unbiased estimator of o.) Describe the two other unbiased estimators we learned about during the year.

x bar is an unbiased estimator of m ; i.e. E( x bar)= m p hat is an unbiased estimator of p; i.e. E( p hat)= p


Conjuntos de estudio relacionados

tax c239 ch 19 advanced tax questions 26-40

View Set

4 - PAP: Liability, Med Pay, and UM Coverage

View Set

series 66 Final exam incorrect answers

View Set

Stopping Distances and Reaction Time - Driver's Ed

View Set

Drivers ed chapter 8 and unit 2 review

View Set

13. Palabras con el sonido que, qui y con la letra c

View Set

Chapter 1 - The Accounting Profession

View Set