Combo with "Combo with "Ch. 10 Stats" and 3 others" and 1 other
According to Scarborough Research, more than 85% of working adults commute by car. Of all U.S. cities, Washington, D.C. and New York City have the longest commute times. A sample of 30 commuters in the Washington, D.C. area yielded the following commute times, in minutes (data set, x-bar=27.97 minutes, s=10.04 minutes). (1) Find a 90% confidence interval for the mean commute time of all commuters in Washington, D.C. (2) Interpret your answer from part (1).
(1) (24.855, 31.085) (2) We are 90% confident that the average commute time of all commuters in Washington, D.C. is between 24.855 minutes and 31.085 minutes.
During the late 1800s, Lake Wingra in Madison, Wisconsin, was frozen over an average of 124.9 days per year. A random sample of eight recent years provided the following data on numbers of days that the lake was frozen over: 103, 80, 79, 135, 134, 77, 80, 111. At the 5% significance level, do the data provide sufficient evidence to conclude that the average number of ice days is less now than in the late 1800s? Use the Wilcoxon signed rank test.
(1) H₀: µ = 124.9 H₁: µ < 124.9 (2) ∝ = 0.05 (3) Test Statistic (from constructed table): W = 3 (4) Critical value = 6 (5) Conclusion: Reject H₀ (6) Interpretation: At the 5% significance level, there is sufficient evidence to conclude that the average number of ice days is less than that in the late 1880s.
2 methods for determining whether to reject or not reject the null hypothesis
(1) compare the test statistic to the critical values; where the test statistic falls (rejection region or non-rejection region) (2) compare the p-value to ∝ If the p-value is low, H₀ must go! If the p-value ≤ ∝, reject H₀ If the p-value > ∝, do not reject H₀
The recommended dietary allowance (RDA) of iron for adult females under the age of 51 is 18 milligrams (mg) per day. A hypothesis test is to be performed to decide whether adult females under the age of 51 are, on average, getting less than the RDA of 18 mg of iron. (1) Determine the null hypothesis, (2) Determine the alternative hypothesis, (3) classify the hypothesis test as two tailed, left tailed, or right tailed.
(1) null hypothesis H₀: µ = 18 mg (2) alternative hypothesis H₁: µ < 18 mg (3) left tailed test
Pooled T-Test Assumptions
(1) simple random sample (2) normal population or large sample (3) independent samples (4) equal population standard deviations
A confidence interval for a population mean has a margin of error of 3.4. If the sample mean is 52.8, obtain the confidence interval.
(49.4, 56.2)
Find the critical value(s) for the following tests: (a) Right tail test with ∝ = 0.10 (b) Left tail test with ∝ = 0.01 (c) Two tail test with ∝ = 0.05
(a) 1.282 (b) -2.326 (c) +/- 1.96
2-mean confidence interval formula for independent samples (standard deviations not equal)
* - refers to: t (sub ∝/2), df = Δ
Calculation of critical values (for hypothesis test for one population when ϑ is known)
+/- Z (∝/2) - two tailed test - Z ∝ - left tailed test Z∝ - right tailed test
Determine the p-value. Z = 2.03 right-tailed test
0.0212
Determine the p-value. Z = -1.66 two-tailed test
0.0970
Determine the p-value. Z = -0.74 left-tailed test
0.2296
Determine the p-value. Z = 0.52 two-tailed test
0.6030
What are the 4 assumptions for parametric samples?
1) Interval scale (interval or ratio, not nominal or ordinal) 2) randomly drawn and independent samples 3) normal distribution of parent population 4) equal variances
How are the F and T-test similar? (4)
1) standard deviations on a curve 2) probability associated 3) alpha levels used 4) procedure is the same
ANOVA accounts for 3 different groups of variability:
1) total SS 2) between group SS 3) within group SS
What is the design concept for a Univariate?
2 IV, 2 or more levels and 1 DV
What is the appropriate z-value for a 99% confidence level?
2.576
Explain the meaning of the term hypothesis as used in inferential statistics.
A hypothesis is a statement that something is true.
one-tailed test
A hypothesis test that is either left-tailed or right-tailed.
How does a large test statistic relate to the area in the tail?
A large test statistic means that there is a smaller area in the tail.
What is a Main effect?
A main effect are those IV's that we've given a name. One for every IV you have in a Univariate.
Technically, what is a nonparametric method? In current statistical practice, how is that term used?
A nonparametric method is an inferential method not concerned with parameters (such as µ and ϑ). Common statistical practice is to refer to most methods that can be applied without assuming normality as nonparametric.
Explain the difference between a point estimate of a parameter and a confidence-interval estimate of a parameter.
A point estimate of a parameter consists of a single value with no indication of the accuracy of the estimate. A confidence interval consists of an interval of numbers obtained from a point estimate of the parameter together with a percentage that specifies how confident we are that the parameter lies in the interval.
What does ANOVA stand for?
Analysis of variance
In a Repeated Measures test, what does the BS stand for?
BS stands for "Between Subjects" Variation
In a Wilcoxon signed rank test, an observation equals µ₀ (the value given for the mean in the null hypothesis), that observation should be removed and the sample size reduced by 1. Why does that need to be done?
Because the D-value for such an observation equals 0, a sign cannot be attached to the rank of |D|
A fundamental principle of data analysis
Before performing a statistical-inference procedure, examine the sample data. If any of the conditions required for using the procedure appear to be violated, do not apply the procedure. Instead use a different, more appropriate procedure, or, if you are unsure of one, consult a statistician.
Find the area to the right of t=2 with df = 30.
Between 0.05 and 0.025
Does this pair comply with the rules for setting up hypotheses? If not explain why. H₀: µ = 123; H₁: µ < 123
Complies
Does this pair comply with the rules for setting up hypotheses? If not explain why. H₀: µ = 50; H₁: µ ≠ 50
Complies
Does this pair comply with the rules for setting up hypotheses? If not, explain why. H₀: µ = 15; H₁: µ = 15
Does not comply. H₁ must be stated as ≠ 15, <15, or > 15.
True or False: The confidence interval can be obtained if you know only the margin of error.
False
Which of the following alternative hypotheses would require using a two-tailed test?
Ha: u does NOT = 100
Possible conclusions for a hypothesis test
If the null hypotheses is rejected, we conclude that the alternative hypothesis is true. If the null hypothesis is not rejected, we conclude that the data do not provide sufficient evidence to support the alternative hypothesis.
two-tailed test
If the primary concern is deciding whether a population mean, µ, is different from a specified value µ₀, we express the alternative hypothesis as: H₁: µ ≠ µ₀
Would it be appropriate to use a t-interval for a sample size of 15? Explain.
It would not be appropriate because the assumption of a normal population or a large sample is not met. We know nothing of the population and the sample is small.
Sampling Distribution for x-bar₁ - x-bar₂
Mean µ₁ - µ₂ Std Deviation: See above Shape: normal distribution
Multivariate
More than 2 IV's and 2 or more levels for each IV and more than 1 DV
df total:
N-1 (N is total # of observations)
On the calculator (TI-84), how do you find the area to the right of a particular z-score?
NORMALCDF (Z-score, 1000, 0, 1)
Can you plot an F value in the Alternative?
No because we can only compare our test statistic to 0 (zero)
Can "F" every be a 2 tailed test?
No! The F can NEVER be a 2 tailed test.
Type I Error
Rejecting the null hypothesis when it is in fact true
Given: n=45 x-bar = 14.68 ϑ = 4.2 ∝ = .01 H₀: µ = 18 H₁: µ < 18 Reject or do not reject null hypothesis?
Test statistic: z = -5.3 Critical value: -2.326 Conclusion: -5.3 is in the rejection region Interpretation: There is sufficient evidence to conclude that ... the alternative hypothesis is true.
normal, small
The connection between the normal distribution and the t-distribution is that the t-distribution is often used for analyzing the mean of a population if the population has a ____ distribution (or fairly close to it). Its role is especially important if your data set is ___ or if you don't know the standard deviation of the population (which is often the case).
Explain why the margin of error determines the precision with which a sample mean estimates a population mean.
The length of a confidence interval, and thus the precision with which x-bar estimates µ, is determined by the margin of error.
True or False. The length of a confidence interval can be determined if you know only the margin of error.
True
True or False: The p-value is the smallest significance level for which the observed sample data result in rejection of the null hypothesis.
True
What is Variance?
Variance is MS
What assumption must be met in order to use the Wilcoxon signed rank test?
a symmetric distribution
critical values
the boundaries for the rejection/non-rejection regions
Effect variance:
"between group variance" we look to establish whether there is a real difference between groups.
The following data are airborne times for United Airlines flight 448 from Albuquerque to Denver on 10 randomly selected days: 57, 54, 55, 51, 56, 48, 52, 51, 59, 59 . (1) Compute and interpret a 90% confidence interval for the mean airborne time for flight 448. (2) Based on your interval in part (1), if flight 448 is scheduled to depart at 10 a.m., what would you recommend for the published arrival time? Explain.
(1) (52.07, 56.33) (2) Recommend an arrival time of 10:57 a.m., so that 0% of the flights would be late.
In the Wilcoxon signed rank test, how do you assign ranks if there is a tie (1) between 2 values of |D|, (2) between 3 values of |D|
(1) If there is a tie between two values of |D|, then average the ranks and assign that average to the two values (2) If there is a tie between three values of |D|, then assign the middle rank to all three values
Steps for Hypothesis Tests for One Population Mean when ϑ is known (6)
(1) State the null and alternative hypotheses (2) Decide on a value for ∝ (significance level) (3) Compute the test statistic Z (4) Find the critical values (5) Conclusion (6) Interpretation
Basic properties of t-curves
(1) The total area under a t-curve equals 1. (2) A t-curve extends indefinitely in both directions, approaching but never touching, the horizontal axis as it does so. (3) A t-curve is symmetric about 0. (4) As the number of degrees of freedom becomes larger, t-curves look increasingly like the standard normal curve.
Given: x-bar = 23 s = 4 n = 24 H₀: µ = 22, H₁: µ ≠ 22 ∝ = 0.05 (1) Use the one-mean t-test to perform the required hypothesis test about the mean, µ, of the population from which the sample was drawn. (2) Find (or estimate) the P-value and determine the strength of the evidence against the null hypothesis.
(1) t = 1.22; critical values = +/- 2.069 p-value = .233 (P > 0.20) Do not reject H₀. (2) Weak or none (P > 0.10)
With the following information, use the one-mean t-interval procedure to find a confidence interval for the mean of the population from with the sample was drawn: x-bar = 20 n = 36 s = 3 confidence level = 95%
(18.98, 21.02)
Studentized version of x-bar
...
What p-value(s) describe moderate evidence against the null hypothesis?
.05 < P ≤ .10
What is the appropriate t-value for the following confidence level and sample size: confidence level 90%, n=12
1.796
What is the appropriate t-value for the following confidence level and sample size: confidence level 99%, n=24
2.807
A confidence interval for a population mean has a margin of error of 3.4 Determine the length of the confidence interval.
6.8
Determine the strength of the evidence against the null hypothesis: (a) p = 0.06 (b) p = 0.35 (c) p = 0.027 (d) p = 0.004
(1) moderate (2) weak or none (3) strong (4) very strong
What are the 3 assumptions of the t-test?
(1) simple random sample (2) normal population or a large sample (3) ϑ unknown
Non-pooled T-Test Assumptions
(1) simple random sample (2) normal population or large sample (3) independent samples (4) unequal population standard deviations
Dr. Thomas Stanley of Georgia State University has surveyed millionaires since 1973. Among other information, Stanley obtains estimates for the mean age, µ, of all U.S. millionaires. Suppose that 36 U.S. millionaires are randomly selected (mean = 58.53). Determine a 95% confidence interval for the mean age, µ, of all U.S. millionaires. Assume that the standard deviation of ages of all U.S. millionaires is 13.0 years.
(54.3, 62.8)
A sample of 25 typists has an average typing speed of 85 wpm. Assume ó=19. Find a 99% confidence interval for the average speed of typists. Interpret your results.
(75.2, 94.8) We are 99% confident the average typing speed for typists is between 75.2 words per minute and 94.8 words per minute.
For which of the following p-values would the null hypothesis be rejected at a level of ∝ = 0.05: (a) .001 (b) .021 (c) .078 (d) .047 (e) .148
(a) reject (b) reject (c) do not reject (d) reject (e) do not reject
Using the test statistic formula for the z-score, determine the required critical value(s) for a left-tailed test with ∝ = 0.05.
-1.645
Determine the p-value. Z = -0.31 right-tailed test
0.6217
Using the test statistic formula for the z-score, determine the required critical value(s) for a right-tailed test with ∝ = 0.05.
1.645
What is the design concept for a Multivariate?
2 or more IV, 2 or more Levels, 1 or more DV.
Does this pair comply with the rules for setting up hypotheses? If not explain why. H₀: µ = 123; H₁: µ = 125
Does not comply. H₁ must use the same number as H₀ and cannot contain the equal sign.
Relate confidence and precision for a fixed sample size
For a fixed sample size, decreasing the confidence level improves the precision, and vice versa.
What is the non-parametric equivalent for one-way ANOVAs?
Kruskal-Wallis test (Mann Whitney used for post hoc) Uses rankings.
Univariate
More than 2 IV's and 2 or more levels for each of those IV's and only has 1 DV
Type II Error
Not rejecting the null hypothesis when it is in fact false.
What do "SS", "df", "MS" stand for?
SS=sum of squares, df=Degrees of Freedom, and MS=Means squared
What does it mean when a computer printout reads "p = 0.03"?
The Z obt. falls in the extreme 3% of the sampling distribution and the probability of a Type I error is 0.03
What value does 1 - ∝ give you?
The confidence level
What factor determines the precision with which x-bar estimates µ?
The length of the confidence interval, determined by the margin of error.
Which hypothesis is actually being tested in statistical hypothesis testing?
The null hypothesis
If you have a p-value of 0.0168 and a z-score of +/- 2.39, interpret the meaning of these values in context.
The probability of getting a z-score more extreme than +/- 2.39 is 0.0168.
What does alpha always represent in statistical hypothesis testing?
The relative size of the region of rejection
Define test statistic
The statistic used as a basis for deciding whether the null hypothesis should be rejected
The null hypothesis describes the
population parameters represented by the sample data if the predicted relationship DOES NOT exist
How do we get MS (mean square) to calculate the F value?
remember: F is MSbg/MSwg We find MS by dividing SS/df (sums of squares divided by the df for that MS value)
What mathematical signs are allowed in the alternative hypothesis?
≠, <, >
In a Singapore edition of Business Times, diamond pricing was explored. The price of a diamond is based on the diamond's weight, color, and clarity. A simple random sample of 18 one-half carat diamonds had the following prices, in dollars: 1676, 1995, 1442, 1876, 1995, 2032, 1718, 1988, 1826, 2071, 2071, 2234, 1947, 2108, 1983, 1941, 2146, 2316. (a) Apply the t-interval procedure to these data to find a 90% confidence interval for the mean price of all one-half carat diamonds. Interpret your result. (note: x-bar = $1964.7 and s = $206.5.)
($1880.07, $2049.37) We can be 90% confident that the mean diamond price for one-half carat diamonds is between $1880.07 and $2049.37.
Researchers randomly and independently selected 32 former prisoners diagnosed with chronic PTSD and 20 former prisoners that were diagnosed with PTSD after release from prison but had since recovered (remitted). The ages, in years, at arrest yielded the following summary statistics: Chronic: x-bar₁ = 25.8, s₁ = 9.2, n₁ = 32 Remitted: x-bar₂ = 22.1, s₂ = 5.7, n₂ = 20 Obtain a 90% confidence interval for the difference µ₁ - µ₂, between the mean ages at arrest of East German prisoners with chronic PTSD and remitted PTSD. Interpret your result.
(.23712, 7.1629) or rounded (.2, 7.2); df = Δ = +/- 1 We can be 90% confident that the difference between the mean ages at arrest of East German prisoners with chronic PTSD and remitted PTSD is between 0.2 and 7.2 years.
A random sample of 18 venture-capital investments in the fiber optics business sector yielded the following data, in millions of dollars (sum of the data is $113.97 million). (1) determine a 95% confidence interval for the mean amount, µ, of all venture-capital investments in the fiber optics business sector. Assume that the population standard deviation is $2.04 million. (2) Interpret your answer from part (1).
(1) $5.389 million to $7.274 million (2) We can be 95% confident that the mean amount of all venture-capital investments in the fiber optics business sector is somewhere between $5.389 million and $7.274 million.
Infants treated for pulmonary hypertension, called the PH group, were compared with those not so treated, called the control group. One of the characteristics measured was head circumference. The mean head circumference of the 10 infants in the PH group was 34.2 cm. (1) Assuming that head circumferences for infants treated for PH are normally distributed with standard deviation 2.1 cm, determine a 90% confidence interval for the mean head circumference of all such infants. (2) Obtain the margin of error, E, for the confidence interval you found in part (1). (3) Explain the meaning of E in this context in terms of the accuracy of the estimate. (4) Determine the sample size required to have a margin of error of 0.5 cm with a 95% confidence level.
(1) (33.108, 35.292) (2) E = 1.1 cm (3) We can be 90% confident that the error made in estimating µ by x-bar is at most 1.1 cm. (4) 68
For the following: Two-tailed test, n = 17, and t = -2.733, (1) Estimate the p-value from the Table (2) Based on your estimate from part (1), state at which significance levels the null hypothesis can be rejected, at which significance levels it cannot be rejected, and which significance levels it is not possible to decide.
(1) 0.01 < P < 0.02 (2) We can reject H₀ at any significance level of 0.02 or larger, and we cannot reject H₀ at any significance level of 0.01 or smaller. For significance levels between 0.01 and 0.02, the table is not sufficiently detailed to help us to decide whether to reject H₀.
For the following: Right tailed test, n=20, t=2.235, (1) Estimate the p-value from the Table (2) Based on your estimate from part (1), state at which significance levels the null hypothesis can be rejected, at which significance levels it cannot be rejected, and which significance levels it is not possible to decide.
(1) 0.01 < P < 0.025 (2) We can reject H₀ at any significance level of 0.025 or larger, and we cannot reject H₀ at any significance level of 0.01 or smaller. For significance levels between 0.01 and 0.025, the table is not sufficiently detailed to help us to decide whether to reject H₀.
A data set gives the additional sleep in hours obtained by a sample of 10 patients using laevohysocyamine hydrobromide (with xbar=2.33 hr, s=2.002 hr). (1) Obtain and interpret a 95% confidence interval for the additional sleep that would be obtained on average for all people using laevohysocyamine hydrobromide. (2) Was the drug effective in increasing sleep? Explain your answer.
(1) 0.90 hr to 3.76 hr We can be 95% confident that the additional sleep that would be obtained on average for all people using the drug is somewhere between 0.90 hr and 3.76 hr. (2) It appears so, because, based on the confidence interval, we can be 95% confident that the mean additional sleep is somewhere between 0.90 hr and 3.76 hr and that, in particular, the mean is positive.
A variable has a mean of 100 and a standard deviation of 16. 4 observations of this variable have a mean of 108 and a sample standard deviation of 12. Determine the observed value of the: (1) standardized version of x-bar (2) studentized value of x-bar
(1) 1 (2) 1.33
Steps for the Wilcoxon Signed Rank Test
(1) Determine null and alternative hypotheses (2) Determine value for ∝ (3) Construct the table to find W, the test statistic (4) Find critical value(s) (5) Number line; plot test statistic and critical value(s) (6) Conclusion: reject H₀ or do not reject H₀ based on where test statistic falls (7) Interpret results
The p-value for a hypothesis test is 0.083. For each of the following significance levels, decide whether the null hypothesis should be rejected: (1) ∝ = 0.05 (2) ∝ = 0.10 (3) ∝ = 0.06
(1) Do not reject (0.083 > 0.05) (2) Reject (0.083 ≤ 0.10) (3) Do not reject (0.083 > 0.06)
When to use the one-mean z-test
(1) For small samples - say of size less than 15 - the z-test should be used only when the variable under consideration is normally distributed or very close to being so. (2) For samples of moderate size - say, between 15 and 30 - the z-test can be used unless the data contain outliers or the variable under consideration is far from being normally distributed. (3) For large samples - say, of size 30 or more - the z-test can be used essentially without restriction. However, if outliers are present and their removal is not justified, you should perform the hypothesis test once with the outliers and once without them to see what effect the outliers have. If the conclusion is affected, use a different procedure or take another sample. (4) If outliers are present but their removal is justified and results in a data set for which the z-test is appropriate (as previously stated), the procedure can be used.
The Kelley Blue Book provides information on retail and trad-in values for used card and trucks. The retail value represents the price a dealer might charge after preparing the vehicle for sale. A 2003 Ford Mustang coupe has a 2006 Kelley Blue Book retail value of $12,850. We obtained the following asking prices, in dollars, for a sample of 2003 Ford Mustang coupes for sale in Phoenix, AZ: 13480, 12499, 12992, 11500, 12988, 10400, 12800, 12500, 12599, 12600. At the 10% significance level, do the data provide sufficient evidence to conclude that the mean asking price for 2003 Ford Mustang coupes in Phoenix is less than the 2006 Kelley Blue Book retail value? Use the Wilcoxon signed rank test.
(1) H₀ = 12,850 H₁ < 12,850 (2) ∝ = 0.10 (3) Test statistic: W₀ = 13 (4) Critical value = (10)(11)/2 - 41 (5) Since W < 14, reject H₀ (6) At the 10% significance level, there is sufficient evidence to conclude that the mean asking price for a 2003 Ford Mustang is less than the 2006 Kelly Blue Book value.
Cadmium, a heavy metal, is toxic to animals. Mushrooms, however, are able to absorb and accumulate cadmium at high concentrations. The Czech and Slovak governments have set a safety limit for cadmium in dry vegetables at 0.5 parts per million (ppm). A random sample of the edible mushroom Boletus pinicola with the resulting data: 0.24, 0.59, 0.62, 0.16, 0.77, 1.33, 0.92, 0.19, 0.33, 0.25, 0.59, 0.32 . At the 5% significance level, do the data provide sufficient evidence to conclude that the mean cadmium level in Boletus pinicola mushrooms is greater than the government's recommended limit of 0.5 ppm? Assume that the population standard deviation of cadmium levels in Boletus pinicola mushrooms is 0.37 ppm. (Note: The sum of the data is 6.31 ppm). USE THE P-VALUE APPROACH.
(1) H₀: µ = 0.5 ppm H₁: µ > 0.5 ppm (2) ∝ = 0.05 (3) test statistic: Z = 0.24 (4) p-value =0.4044 (calculator t-test) (5) conclusion: compare p-value to ∝: Is 0.4044 ≤ 0.05? No. Do not reject. (6) There is not sufficient evidence to conclude that the average cadmium level in the mushrooms is greater than .5 ppm.
According to the Bureau of Crime Statistics and Research of Australia, the mean length of imprisonment for motor-vehicle theft offenders in Australia is 16.7 months. You want to perform a hypothesis test to decide whether the mean length of imprisonment for motor-vehicle theft offenders in Sydney differs from the national mean in Australia. (1) Determine the null hypothesis, (2) Determine the alternative hypothesis, (3) classify the hypothesis test as two tailed, left tailed, or right tailed.
(1) H₀: µ = 16.7 months (2) H₁: µ ≠ 16.7 months (3) two tailed test
In 2002, the median age of U.S. residents was 35.7 years. A random sample of 10 U.S. residents taken this year yielded the following data, in years: 42, 45, 62, 49, 14, 39, 57, 11, 36, 26. At the 1% significance level, do the data provide sufficient evidence to conclude that the median age of today's U.S. residents has increased from the 2002 median age of 35.7 years? Use the Wilcoxon signed rank test.
(1) H₀: µ = 35.7 H₁: µ > 35.7 (2) ∝ = .01 (3) Test Statistic (from constructed table): W = 33 (4) Critical value: 50 (5) Conclusion: Do not reject H₀ (6) Interpretation: At the 1% significance level, there is not sufficient evidence to conclude that the median age of today's U.S. residents has increased from the 2002 median age of 35.7 years.
According to Communications Industry Forecast & Report, the average person watched 4.66 hours of television per day in 2002. A random sample of 20 people gave the number of hours of television watched per day for last year (x-bar = 4.835 hours, s = 2.291 hours). At the 10% significance level, do the data provide sufficient evidence to conclude that the amount of television watched per day last year by the average person differed from that in 2002? USE THE P-VALUE APPROACH.
(1) H₀: µ = 4.66 hours H₁: µ ≠ 4.66 hours (2) ∝ = 0.10 (3) Test statistic t = 3.416 (4) p-value: p=0.7364 (from calculator t-test) (5) Conclusion: compare p-value to ∝ Is 0.7364 ≤ 0.10? No. Do not reject. (6) There is not sufficient evidence that the average number of daily t.v. viewing hours has changed from that in 2002 (average of 4.66 hours daily).
The National Center for Test Statistics reports that the median birth weight of U.S. babies was 7.4 lb in 2002. A random sample of this year's births provided the following weights, in pounds: 8.6, 8.8, 7.4, 8.2, 5.3, 9.2, 13.8, 5.6, 7.8, 6.0, 5.7, 11.6, 9.2, 7.2. Con we conclude that this year's median birth weight differs from that in 2002? Use a significance level of 0.05.
(1) H₀: µ = 7.4 lb H₁: µ ≠ 7.4 lb (2) ∝ = 0.05 (3) Test statistic (from created table): W = 57.5 (4) Critical values - Right side 74, Left side 17 (5) Conclusion: Do not reject H₀ (6) Interpretation: There is not enough evidence to conclude that the median birth weight of U.S. babies is different than 7.4 lbs in 2002.
For the 2002 baseball season, the median baseball salary was determined to be $800,000. A random sample of 14 salaries was conducted with the following salaries in 2005 (in thousands) as follows: 316, 326, 331, 332, 335, 550, 750, 950, 1300, 3000, 4000, 6500, 8250, 13100.At the significance level of 0.05, is there enough evidence to conclude that the median baseball salary for the 2005 baseball season is more than $800,000? Use the Wilcoxon signed rank test.
(1) H₀: µ = 800 H₁: µ > 800 (2) ∝ = 0.05 (3) Test Statistic (Construct Table): W = 71 (4) Critical Value = 79 (5) Conclusion: Do not reject H₀ (6) Interpretation: At the 5% significance level, there is not sufficient evidence to conclude that the median baseball salary is greater than that in 2002 ($800,000).
A hot tub manufacturer advertises that with its heating equipment a temperature of 100 degrees F can be achieved in at most 15 minutes. A random sample of 20 tubs is selected and the time needed to reach 100 degrees is determined for each tub. The sample mean is 16 minutes with a standard deviation of 1 minute. Does this information cast doubt on the company's claim? Assume ∝ = 0.01.
(1) H₀: µ ≤ 15 H₁: µ > 15 (2) ∝ = 0.01 (3) Test statistic: t = 4.47 (4) P-value = .000013 (using t-test on calculator) (5) Compare p-value to ∝ .000013 ≤ 0.01 Reject H₀ (6) There is enough evidence to suggest the average time for a hot tub to reach 100 degrees F is more than 15 minutes (p=.000013, ∝ = 0.01).
An automobile manufacturer who wishes to advertise that one of its models achieves 30 mpg decides to carry out a fuel efficiency test. Six non-professional drivers are selected and each one drives a car from Phoenix to Los Angeles. The resulting fuel efficiencies in mpg are: 27.2, 29.3, 31.2, 28.4, 30.3, 29.6. Assuming that the fuel efficiency is normally distributed, do the data contradict the claim that the true average fuel efficiency is at least 30 mpg? Assume ∝ = 0.05.
(1) H₀: µ ≥ 30 mpg H₁: µ < 30 mpg (2) ∝ = 0.05 (3) test statistic t = -1.16 (4) p-value = .1493 (t-test on calculator) (5) compare p-value to ∝ .1493 ≤ 0.05 ? NO Do not reject H₀ (6) There is not enough evidence to conclude that the average fuel efficiency in mpg is less than 30 mpg.
Researchers obtained the following data on the number of acute postoperative days in the hospital using the dynamic and static systems: dynamic: Dynamic: x-bar₁ = 7.36, s₁ = 1.22, n₁ = 14 Static: x-bar₂ = 10.50, s₂ = 4.59, n₂ = 6 At the 5% significance level, do the data provide sufficient evidence to conclude that the mean number of acute postoperative days in the hospital is smaller with the dynamic system than with the static system?
(1) H₀: µ₁ = µ₂ H₁: µ₁ < µ₂ (2) ∝ = 0.05 (3) Test Statistic: t = -1.645 (4) Critical Value = -2.015, P-value: = 0.0781 (5) Conclusion: Do not reject H₀ (6) At the 5% significance level, there is not sufficient evidence to conclude that the mean number of acute postoperative days in the hospital is smaller with the dynamic system than with the static system.
Do children diagnosed with ADHD have smaller brains than children without this condition? Brain scans were completed for 152 children with ADHD and 139 children of similar age without ADHD. Summary values for total cerebral volume (in milliliters) are given below: With ADHD, n=152, x-bar = 1059.4, s = 117.5 Without ADHD, n=139, x-bar=1104.5, s = 111.3 Do these data provide evidence that the mean brain volume of children with ADHD is smaller than the mean for children without ADHD? Test the relevant hypotheses by using a 0.05 level of significance.
(1) H₀: µ₁ = µ₂ H₁: µ₁ < µ₂ (2) ∝ = 0.05 (3) Test Statistic: t = -3.3527 (4) P-value: .0004516 (5) Conclusion: Reject H₀ (6) Interpretation: There is sufficient evidence to conclude that the mean brain volume of children with ADHD is smaller than the mean for children without ADHD.
The U.S. Bureau of Prisons publishes data in Prison Statistics on the times served by prisoners released from federal institutions for the first time. Independent random samples of released prisoners in the fraud and firearms offense categories yielded the following information on time served, in months: Fraud: x-bar₁ = 10.12, s₁ = 4.90, n₁ = 10 Firearms: x-bar₂ = 18.78, s₂ = 4.64, n₂ = 10 At the 5% significance level, do the data provide sufficient evidence to conclude that the mean time served for fraud is less than that for firearms offenses?
(1) H₀: µ₁ = µ₂ H₁: µ₁ < µ₂ (2) ∝ = 0.05 (3) Test Statistic: t = -4.058 (4) Critical Value = -1.734, P-value: = .000369 (5) Conclusion: Reject H₀ (6) At the 5% significance level, there is sufficient evidence to conclude that the mean time served for fraud is less than that for firearms offenses.
An article investigated the driving behavior of teenagers by observing their vehicles as they left a high school parking lot and then again at a site approximately 1/2 mile from the school. For this test, use a 0.01 significance level. The following measurements represent the amount by which the speed limit was exceeded by male drivers and female drivers: Male: 1.3, 1.3, 0.9, 2.1, 0.7, 1.3, 5.0, 1.3, 0.6, 2.1 Fem: -0.2, 0.5, 1.1, 0.7, 1.1, 1.2, 0.1, 0.9, 0.5, 0.5 Doe these data provide convincing evidence support for the claim that, on average, male teenage drivers exceed the speed limit by more than do female teenage drivers?
(1) H₀: µ₁ = µ₂ H₁: µ₁ > µ₂ (2) ∝ = 0.01 (3) Test Statistic: t = 2.374 (4) P-value: .0181 (5) Conclusion: Do not reject H₀ (6) Interpretation: There is not sufficient evidence to conclude that the male teenage drivers exceed the speed limit by more than do female drivers.
L. Smith and D. Haukos examined the relationship of species richness and diversity to playa area and watershed disturbance. Independent random samples of 126 playa with cropland and 98 playa with grassland in Southern Great Plains yielded the following summary statistics for the number of native species: Cropland: x-bar₁ = 14.06, s₁ = 4.83, n₁ = 126 Wetland: x-bar₂ = 15.36, s₂ = 4.95, n₂ = 98 At the 5% significance level, do the data provide sufficient evidence to conclude that a difference exists in the mean number of native species in the two regions?
(1) H₀: µ₁ = µ₂ H₁: µ₁ ≠ µ₂ (2) ∝ = 0.05 (3) Test Statistic: t = -1.98 (4) Critical Value = -1.971, P-value: = 0.0493 (5) Conclusion: Reject H₀ (6) At the 5% significance level, there is sufficient evidence to conclude that there is a difference in the mean number of native species in the two regions.
Researchers randomly and independently selected 32 former prisoners diagnosed with chronic PTSD and 20 former prisoners that were diagnosed with PTSD after release from prison but had since recovered (remitted). The ages, in years, at arrest yielded the following summary statistics: Chronic: x-bar₁ = 25.8, s₁ = 9.2, n₁ = 32 Remitted: x-bar₂ = 22.1, s₂ = 5.7, n₂ = 20 At the 10% significance level, is there sufficient evidence to conclude that a difference exists in the mean age at arrest of East German prisoners with chronic PTSD and remitted PTSD?
(1) H₀: µ₁ = µ₂ H₁: µ₁ ≠ µ₂ (2) ∝ = 0.10 (3) Test Statistic: t = 1.79 (4) Critical Value = +/- 1.677, P-value: = 0.0794 (5) Conclusion: Reject H₀ (6) At the 10% significance level, there is sufficient evidence to conclude that there is a difference in the mean age of arrest of East German prisoners with chronic PTSD and remitted PTSD.
For the following: Left-tailed test, n=10, t = -3.381, (1) Estimate the p-value from the Table (2) Based on your estimate from part (1), state at which significance levels the null hypothesis can be rejected, at which significance levels it cannot be rejected, and which significance levels it is not possible to decide.
(1) P < 0.005 (2) We can reject H₀ at any significance level of 0.005 or larger. For significance levels smaller than 0.005, the table is not sufficiently detailed to help us to decide whether to reject H₀.
The method for computing the sample size required to obtain a confidence interval with a specified confidence level and margin of error - the number resulting from the formula should be rounded up to the nearest whole number. (1) Why do you want a whole number? (2) Why do you round up instead of down?
(1) The sample size cannot be a fraction. (2) The result (n) is the smallest value that will provide the required margin of error. If the number were rounded down, the sample size would not be sufficient to ensure the required margin of error.
Identify the two types of incorrect decisions in a hypothesis test. For each incorrect decision, what symbol is used to represent the probability of making that type of error?
(1) Type I - rejecting a true null hypothesis (symbol ∝) (2) Type II - not rejecting a false null hypothesis (symbol β)
State two reasons why including the p-value is prudent when you are reporting the results of a hypothesis test.
(1) it allows you to assess significance at any desired level (2) it permits you to evaluate the strength of the evidence against the null hypothesis
Given: safety limit set for cadmium in dry vegetables at 0.5 ppm. A hypothesis test is to be performed to decide whether the mean cadmium level in Bp mushrooms is greater than the government's recommended limit. (1) Determine the null hypotheses, (2) Determine the alternative hypothesis, (3) classify the hypothesis test as two tailed, left tailed, or right tailed.
(1) null hypothesis H₀: µ = .5 ppm (2) alternative hypothesis H₁: µ > .5 ppm (3) right tailed test
Suppose that you know that a variable is normally distributed on each of two populations. Further suppose that you want to perform a hypothesis test based on independent random samples to compare the two population means. In each case, decide whether you would use the pooled or nonpooled t-test. (1) You know the population standard deviations are equal. (2) You know that the population standard deviations are not equal. (3) The sample standard deviations are 23.6 and 25.2, and each sample size is 25. (4) The sample standard deviations are 23.6 and 59.2.
(1) pooled (2) nonpooled (3) pooled (4) nonpooled
Assumptions for the One-Mean t-Interval Procedure
(1) simple random sample (2) normal population or large sample (3) ϑ unknown
What are the assumptions required for using the z-interval procedure?
(1) simple random sample (2) normal population or large sample (≥30) (3) sigma (ϑ) known
Assumptions for the Wilcoxon Signed Rank Test
(1) simple random sample (2) symmetric distribution (triangular, uniform, symmetric bimodal)
Given: x-bar = 20 s = 4 n = 32 H₀: µ = 22, H₁: µ < 22 ∝ = 0.05 (1) Use the one-mean t-test to perform the required hypothesis test about the mean, µ, of the population from which the sample was drawn. (2) Find (or estimate) the P-value and determine the strength of the evidence against the null hypothesis.
(1) t = -2.82; critical value = -1.696 P-value = 0.004 (P < 0.05) Reject H₀. (2) Very strong (P < 0.01)
Given: x-bar = 24 s = 4 n = 15 H₀: µ = 22, H₁: µ > 22 ∝ = 0.05 (1) Use the one-mean t-test to perform the required hypothesis test about the mean, µ, of the population from which the sample was drawn. (2) Find (or estimate) the P-value and determine the strength of the evidence against the null hypothesis.
(1) t = 1.94; critical value = 1.761 p-value = 0.037 (0.025 < P < 0.05) Reject H₀. (2) Strong (0.01 < P < 0.05)
A sample of 36 soda cans was taken and the average number of ounces was found to be 12.4 oz. (assume ϑ=2). Find a 95% confidence interval for µ, the average number of oz in the can. Interpret your results.
(11.747, 13.053) We are 95% confident that the average number of ounces in a soda can is between 11.8 ounces and 13.1 ounces.
A tarantula has two body parts. The anterior part of the body is covered above by a shell, or carapace. A simple random sample of 15 of adult male Brazilian giant tawny red tarantulas provided the following data on carapace length, in millimeters (mm): 15.7, 19.2, 16.4, 18.3, 19.8, 16.8, 19.7, 18.1, 18.9, 17.6, 18.0, 18.5, 19.0, 20.9, 19.5. Find and interpret a 95.44% confidence interval for the mean carapace length of all adult male Brazilian giant tawny red tarantulas. The population standard deviation is 1.76 mm.
(17.52, 19.34) We can be 95.44% confident that the average carapace length of all adult male Brazilian giant tawny red tarantulas is between 17.52 mm and 19.34 mm.
The publication Amusement Business provides figures on the cost for a family of four to spend the day at one of America's Amusement parks. A random sample of 25 families of four that attended amusement parks yielded the following costs, rounded to the nearest dollar (see 8.94 p 391). Obtain and interpret a 95% confidence interval for the mean cost for a family of four to spend the day at an American amusement park. (Note: x-bar=$193.32, s=$26.73).
(182.29, 204.35) We are 95% confident that the average cost for a family of four to spend the day at an American amusement park will be between $182.29 and $204.35.
With the following information, use the one-mean t-interval procedure to find a confidence interval for the mean of the population from with the sample was drawn: x-bar = 30 n = 25 s = 4 confidence level = 90%
(28.63, 31.37)
Use the one-mean z-interval procedure to find a confidence interval for the mean of the population from which the sample was drawn: x-bar = 30 n = 25 ϑ = 4 confidence level = 90%
(28.7, 31.3)
An AP poll found that 38% of parents said they were unlikely to give permission for their kids to be vaccinated at school (sample of 1003 adults). The margin of sampling error is +- 3.1 percentage points for all adults. What is the confidence interval? What is the length of the confidence interval?
(34.9, 41.1) Length of confidence interval: 6.2
With the following information, use the one-mean t-interval procedure to find a confidence interval for the mean of the population from with the sample was drawn: x-bar =50 n = 16 s = 5 confidence level = 99%
(46.32, 53.68)
Use the one-mean z-interval procedure to find a confidence interval for the mean of the population from which the sample was drawn: x-bar = 50 n = 16 ϑ = 5 confidence level = 99%
(46.8, 53.2)
Following are the arterial blood pressures, in millimeters of mercury (mm Hg), for a random sample of 16 children of diabetic mothers: (x-bar = 85.99 mm Hg, s = 8.08 mm Hg) (a) Apply the t-interval procedure to find a 95% confidence interval for the mean arterial blood pressure of all children of diabetic mothers. Interpret your result.
(81.69, 90.29) We can be 95% confident that the mean arterial blood pressure of all children of diabetic mothers is somewhere between 81.69 and 90.29 mm Hg.
Test statistic formula for independent samples (2 population means, equal standard deviations)
(Y-bars should show as x-bars)
Abigail Camp Dimon found the mean shell length of 461 randomly selected specimens of N. trivittata to be 11.9 mm. (a) assuming that ϑ = 2.5 mm, obtain a 90% confidence interval for the mean length, µ, of all N. trivittata. (b) interpret your answer from part (a), (c) What properties should a normal probability plot of the data have for it to be permissible to apply the procedure that you used in part (a)? (d) find the margin of error E. (e) Explain the meaning of E as far as the accuracy of the estimate is concerned. (f) Determine the sample size required to have a margin of error of 0.1 mm and a 90% confidence level. (g) Find a 90% confidence interval for µ if a sample of the size determined in part (f) yields a mean of 12.0 mm.
(a) (11.71, 12.09) (b) We can be 90% confident that the mean length of N. trivittata is somewhere between 11.71 and 12.09 mm. (c) Since the sample size is very large, the distribution of sample means will be approximately normal regardless of the shape of the original distribution. (d) 0.19 (e) We can be 90% confident that the maximum error made in using x-bar to estimate µ is 0.19 mm. (f)1692 (g) (11.90, 12.10)
A variable of a population has a mean of 266 and a standard deviation of 16. Ten observations of this variable have a mean of 262.1 and a sample standard deviation of 20.4. Obtain the observed value of the (a) standardized version of x-bar, (b) studentized version of x-bar.
(a) -0.77 (b) -0.605
For a t-curve with df=21, find each t-value: (a) The t-value having area 0.10 to its right (b) t (sub 0.01), (c) The t-value having area 0.025 to its left (Hint: a t-curve is symmetric about 0), (d) The two t-values that divide the area under the curve into a middle 0.90 area and two outside areas of 0.05.
(a) 1.323 (b) 2.518 (c) -2.080 (d) +/- 1.721
For a t-curve with df=6, use Table IV to find each t-value: (a) t (sub 0.10), (b) t (sub 0.025), (c) t (sub 0.001)
(a) 1.440 (b) 2.447 (c) 3.143
A confidence interval for a population mean has length 20. (a) Determine the margin of error. (b) If the sample mean is 60, obtain the confidence interval.
(a) 10 (b) (50, 70)
Professor Thomas Stanley of Georgia State University has surveyed millionaires since 1973. Among other information, Professor Stanley obtains estimates for the mean age, µ, of all U.S. millionaires. Suppose that one year's study involved a simple random sample of 36 U.S. millionaires whose mean age was 58.53 years with a sample standard deviation of 13.36 years. (a) If, for next year's study, a confidence interval for µ is to have a margin of error of 2 years and a confidence level of 95%, determine the required sample size. (b) Why did you use the sample standard deviation, s=13.36, in place of ϑ in your solution to part (a)? Why is it permissible to do so?
(a) 172 (b) We used s in place of ϑ because ϑ was unknown. We can do this because the sample of size 36 is large enough to provide an estimate of ϑ and the variation is not likely to change much from one year to the next.
For a t-curve with df = 18, obtain the t-values of the following: (a) The t-value having area 0.025 to its right (b) t (sub 0.05) (c) The t-value having area 0.10 to its left (d) The two t-values that divide the area under the curve into a middle 0.99 area and two outside 0.005 areas.
(a) 2.101 (b) 1.734 (c) -1.330 (d) +/- 2.878
Fill in the blanks: (Use the Book tables) For a right-tailed test with n=15, ∝ = 0.01 and a value of the test statistic of t = 3.458, the test statistic lies to the right of ____(a)____. This means the true p-value is (choose one: larger/smaller) ____(b)____ than ____(c)____.
(a) 2.977 (b) smaller (c) 0.005
A confidence interval for a population mean has a margin of error of 10.7. (a) Obtain the length of the confidence interval. (b) If the mean of the sample is 75.2, determine the confidence interval.
(a) 21.4 (b) (64.5, 85.9)
Suppose that you plan to apply the one-mean z-interval procedure to obtain a 90% confidence interval for a population mean, µ. You know that ϑ = 12 and that you are going to use a sample of size 9. (a) What will be your margin of error? (b) What else do you need to know in order to obtain the confidence interval?
(a) 6.58 (b) To obtain the confidence interval, you also need to know x-bar.
Suppose that a simple random sample is taken from a normal population having a standard deviation of 10 for the purpose of obtaining a 95% confidence interval for the mean of the population. (a) If the sample size is 4, obtain the margin of error. (b) Repeat part (a) for a sample size of 16. (c) Can you guess the margin of error for a sample size of 64? Explain your reasoning.
(a) 9.80 (b) 4.90 (c) It appears that quadrupling the sample size will halve the margin of error. Therefore, increasing n from 16 to 64 will decrease the margin of error from 4.90 to 2.45.
When to use the one-mean z-interval procedure
(a) For small samples - say, of size less than 15 - the z-interval procedure should be used only when the variable under consideration is normally distributed or very close to being so. (b) For samples of moderate size - say, between 15 and 30 - the z-interval procedure can be used unless the data contain outliers or the variable under consideration is far from being normally distributed. (c) For large samples - say, of size 30 or more - the z-interval procedure can be used essentially without restriction. However, if outliers are present and their removal is not justified, you should compare the confidence intervals obtained with and without the outliers to see what effect the outliers have. If the effect is substantial, use a different procedure or take another sample. (d) If outliers are present but their removal is justified and results in a data set for which the z-interval procedure is appropriate (as previously stated), the procedure can be used.
According to an FBI document, the mean value lost to purse snatching was $332 in 2002. For last year, 12 randomly selected purse-snatching offenses yielded the following values lost, to the nearest dollar: 207, 237, 422, 226, 272, 205, 362, 348, 165, 266, 269, 430. (a) use a t-test with either the critical-value approach or the P-value approach to decide, at the 5% significance level, whether last year's mean value lost to purse snatching has decreased from the 2002 mean. The mean and standard deviation of the data are $284.1 and $86.9, respectively. (b) Perform the required hypothesis test, using the Wilcoxon signed-rank test.
(a) H₀: µ = 332, H₁: µ < 332 ∝ = 0.05 test statistic: t = -1.909 Critical value = -1.782 Conclusion: reject H₀ Interpretation: At the 5% significance level, the data provides sufficient evidence to conclude that the mean value lost because of purse snatching has decreased from the 2002 mean of $332.00 (b) Critical value: 12(13)/2 - 61 = 17 W = 17 Since the W-value is less than or equal to the critical value, we reject the null hypothesis.
A credit bureau analysis of undergraduate student credit records found that the average number of credit cards in an undergraduate's wallet was 4.09. It was also reported that in a random sample of 132 undergraduates, the sample mean number of credit cards carried was 2.6. Assume the population standard deviation was 1.2. Is there enough evidence that the mean number of credit cards that undergraduates report carrying is less than the credit bureau's figure of 4.09? (a) Hypotheses: (b) ∝: (c) Test statistic: (d) P-value: (e) Conclusion: (f) Summary:
(a) H₀: µ = 4.09 H₁: µ < 4.09 (b) ∝ = .05 (not given) (c) Test statistic: z₀ = -14.27 (d) P-value: ≈ 0 (less than 0.0001) (e) Conclusion: Reject H₀ (f) Interpretation: There is enough evidence to suggest that the average number of credit cards in an undergraduate's wallet is less than 4.09.
A hot tub manufacturer advertises that with its heating equipment, a temperature of 100° F can be achieved in at most 15 minutes. A random sample of 25 tubs is selected, and the time necessary to achieve a 100° F temperature is determined for each tub. The sample average time and sample standard deviation are 17.5 min and 2.2 min, respectively. Does this information cast doubt on the company's claim? Carry out a hypothesis test with a significance level of 0.05. (a) Hypotheses: (b) ∝: (c) Test statistic: (d) p-value: (e) Conclusion: (f) Summary:
(a) H₀: µ ≤ 15 minutes H₁: µ > 15 minutes (b) ∝ = 0.05 (c) Test statistic: t₀ = 5.68 (d) p-value: < .0001 (≈.00000374) (e) Conclusion: Reject H₀ (f) Summary: There is enough evidence to suggest the average time for a hot tub to reach 100° F is greater than 15 minutes.
A certain pen has been designed to that true average writing lifetime under controlled conditions is at least 10 hours. A random sample of 18 pens is selected, the writing lifetime of each is determined, and a normal probability plot of the resulting data support the use of a one-sample z-test. The relevant hypotheses are H₀: µ = 10, and H₁: µ < 10. Use the p-value method. (a) If z = -1.8 and ∝ = 0.05 is selected, what conclusion is appropriate? (b) If Z = -2.4 and ∝ = 0.10 is selected, what conclusion is appropriate? (c) If z = - 0.89, what conclusion is appropriate?
(a) It is appropriate to reject H₀. (b) It is appropriate to reject H₀. (c) Do not reject H₀ (weak or no evidence - P-value > 0.10)
Suppose that you intend to find a 95% confidence interval for a population mean by applying the one-mean z-interval procedure to a sample of size 100. (a) What would happen to the precision of the estimate if you used a sample of size 50 instead but kept the same confidence level of 0.95? (b) What would happen to the precision of the estimate if you changed the confidence level to 0.90 but kept the same sample size of 100?
(a) Reducing the sample size from 100 to 50 will reduce the precision of the estimate (result in a longer confidence interval). (b) Reducing the confidence level from .95 to .90 while maintaining the sample size will increase the precision of the estimate (result in a shorter confidence interval).
Researchers at the University of Washington and Harvard University analyzed records of breast cancer screening and diagnostic evaluations. Discussing the benefits and downsides of the screening process, the article states that, although the rate of false-positives is higher than previously thought, if radiologists were less aggressive in following up on suspicious tests, the rate of false-positives would fall but the rate of missed cancers would rise. Suppose that such a screening test is used to decide between a null hypothesis of H₀: no cancer is present, and an alternative hypothesis of H₁: Cancer is present. (a) Would a false-positive (thinking cancer is present when in fact it is not) be a Type I or Type II error? (b) Describe a Type I error in the context of this problem and discuss the (real-life) consequences of making a Type I error. (c) Describe a Type II error in the context of this problem and discuss the (real-life) consequences of making a Type II error.
(a) Type I (b) The Type I error would be a false positive (thinking cancer is present when there is none) and would likely result in following up by the radiologists. There would be perhaps unnecessary additional tests which would increase costs. (c) A Type II error would be thinking that cancer is not present when in fact it is (a false negative). Thinking that no cancer is present, there would likely be no follow-up. This is the most serious type of error in the scenario presented because there could be cancers that are missed and not treated.
Consider the null and alternative hypotheses: H₀: µ = 30 lb (mean has not increased) H₁: µ > 30 lb (mean has increased)d where µ is last year's mean cheese consumption for all Americans. Explain what each of the following would mean: (a) Type I error (b) Type II error (c) correct decision Now suppose that the results of carrying out the hypothesis test lead to nonrejection of the null hypothesis. Classify that decision by error type or as a correct decision if in fact last year's mean cheese consumption (d) has not increased from the 2001 mean of 30.0 lb (e) has increased from the 2001 mean of 30 lb.
(a) a Type I error would occur if, in fact, µ = 30 lb, but the results of the sampling lead to the conclusion that µ > 30 lb. (b) a Type II error would occur if, in fact, µ > 30 lb, but the results of the sampling fail to lead to that conclusion (c) A correct decision would occur if, in fact, µ = 30 lb and the results of the sampling do not lead to the rejection of that fact; if, in fact µ > 30 lb and the results of the sampling lead to that conclusion (d) If, in fact, last year's mean consumption of cheese for all Americans has not increased over the 2001 mean of 30.0 lb, and we do not reject the null hypothesis that µ = 30 lb, we made a correct decision (e) If, in fact, last year's mean consumption of cheese for all Americans has increased over the 2001 mean of 30.0 lb, and we fail to reject the null hypothesis that µ = 30 lb, we made a Type II error.
For the confidence interval (18.8, 48): (a) Determine the margin of error E. (b) For a 95% confidence level, explain the meaning of E in this context in terms of the accuracy of the measurement.
(a) margin of error = 14.6 (b) We are 95% confident that the maximum error made in using x-bar to estimate µ is 14.6.
Decide in the following situations whether the z-test is an appropriate method for conducting the hypothesis test for a population mean: (a) no outliers, distribution highly skewed, sample size 24, (b) no outliers, mildly skewed, sample size 70
(a) not appropriate (b) appropriate
Fill in the blanks: (a) For a two-tail test, the ____(a)_____ is the probability of observing a value of the test statistic t that is at least as large in magnitude as the value actually observed, which is the area under the t-curve that lies outside the interval from _(b)_______ to ________(c)______.
(a) p-value (b) - |t₀| (c) |t₀|
With the following hypothesis test: (a) identify the variable, (b) identify the two populations, (c) determine the null and alternative hypotheses, (d) classify the hypothesis test as two tailed, left tailed, or right tailed: Samples of adolescent offspring of diabetic mothers (ODM) and non-diabetic mothers (ONM) were taken and evaluated for potential differences in vital measurements, including blood pressure and glucose tolerance. A hypothesis test is to be performed to decide whether the mean systolic blood pressure of ODM adolescents exceeds that of ONM adolescents.
(a) systolic blood pressure (b) ODM adolescents and ONM adolescents (c) H₀: µ₁ = µ₂ H₁: µ₁ > µ₂ * µ₁ = mean systolic bp of ODM adolescents * µ₂ = mean systolic bp of ONM adolescents (d) right-tailed
Use the nonpooled t-test and the nonpooled t-interval procedure to conduct the required hypothesis test and obtain the specified confidence interval: x-bar₁ = 10 x-bar₂ = 12 s₁ = 2 s₂ = 5 n₁ = 15 n₂ = 15 (a) two tailed test, ∝ = 0.05 (b) 95% confidence interval
(a) t = -1.44 critical values = +/- 2.101 do not reject H₀ (b) (-4.92, 0.92)
Use the nonpooled t-test and the nonpooled t-interval procedure to conduct the required hypothesis test and obtain the specified confidence interval: x-bar₁ = 20 x-bar₂ = 24 s₁ = 6 s₂ = 2 n₁ = 20 n₂ = 15 (a) left-tailed test, ∝ = 0.05 (b) 90% confidence interval
(a) t = -2.78 critical value = -1.711 reject H₀ (b) (-6.46, -1.54)
Use the nonpooled t-test and the nonpooled t-interval procedure to conduct the required hypothesis test and obtain the specified confidence interval: x-bar₁ = 20 x-bar₂ = 18 s₁ = 4 s₂ = 5 n₁ = 10 n₂ = 15 (a) right-tailed test, ∝ = 0.05 (b) 90% confidence interval
(a) t = 1.11 critical value = 1.717 do not reject H₀ (b) (-1.10, 5.10)
Using the test statistic formula for the z-score, determine the required critical value(s) for a two-tailed test with ∝ = 0.05.
-1.96 and +1.96
Calculation formula for the test statistic (z-score or t-score)
...
Formula for the confidence interval for µ (t-interval procedure)
...
Standardized version of x-bar
...
What p-value(s) describe strong evidence against the null hypothesis?
.01 < P ≤ .05
What is the appropriate z-value for a 50% confidence level?
.674 (Hint: invnorm (.25))
Determine the p-value. Z = 1.16 left-tailed test
0.8770
One-way
1 IV that has more than 2 Levels and also has more than 1 DV
What is the design concept for a One Way ANOVA?
1 IV, more than 2 Levels and more than 1 DV
In a study of computer use, 1000 randomly selected Canadian internet users were asked how much time they spend using the Internet in a typical week. The mean of the 1000 resulting observations was 12.7 hours. Assume the population standard deviation is 5 hours. Carry out a hypothesis test with a significance level of 0.05 to decide if there is convincing evidence that the mean time spent using the Internet by Canadians is greater than 12.5 hours.
1. H₀: µ = 12.5 hours, H₁: µ > 12.5 hours 2. ∝ = 0.05 3. test statistic: z = 1.26 4. critical value: 1.645 5. conclusion: Do not reject the null hypothesis. 6. interpretation: There is not enough evidence to conclude that the average time spent using the Internet by Canadians is greater than 12.5 hours.
A credit bureau analysis of undergraduate student credit records found that the average number of credit cards in an undergraduate's wallet was 4.09. It was also reported that in a random sample of 132 undergraduates, the sample mean number of credit cards carried was 2.6. Assume the population standard deviation as 1.2. Is there enough evidence that the mean number of credit cards that undergraduates report carrying is less than the credit bureau figure of 4.09?
1. H₀: µ = 4.08, H₁: µ < 4.09 2. ∝ = 0.05 (not given) 3. Test statistic: z = -14.26 4. Critical value: -1.645 5. Conclusion: Reject the null hypothesis. 6. There is sufficient evidence to conclude that the average number of credit cards that undergraduates report is less than the credit bureau's figure of 4.09.
What is the appropriate z-value for a 95% confidence level?
1.96
What is the appropriate t-value for the following confidence level and sample size: confidence level 95%, n=17
2.120
A manufacturer of college textbooks is interested in estimating the strength of the bindings produced by a particular binding machine. Strength can be measured by recording the force required to pull the pages from the binding. If this force is measured in pounds, how many books should be tested to estimate with 95% confidence to within 0.1 lb, the average force required to break the binding? Assume ϑ is known to be 0.8 lb.
246
Which would result in a wider confidence interval? 90% confidence level or 95% confidence level?
95% confidence level would result in a wider confidence interval. Increasing the confidence level increases the length of the confidence interval.
If you obtained one thousand 95% confidence intervals for a population mean, µ, roughly how many of the intervals would actually contain µ?
950
What mathematical signs are allowed in the null hypothesis?
=, ≤, ≥
How does a small test statistic relate to the area in the tail?
A small test statistic means that there is a larger area in the tail.
What Does Nonparametric Statistics Mean?
A statistical method wherein the data is not required to fit a normal distribution. Nonparametric statistics uses data that is often ordinal, meaning it does not rely on numbers, but rather a ranking or order of sorts. For example, a survey conveying consumer preferences ranging from like to dislike would be considered ordinal data. Nonparametric statistics have gained appreciation due to their ease of use. As the need for parameters is relieved, the data becomes more applicable to a larger variety of tests. This type of statistics can be used without the mean, sample size, standard deviation, or the estimation of any other related parameters when none of that information is available. The main weakness of nonparametric tests is that they are less powerful than parametric tests. They are less likely to reject the null hypothesis when it is false. When the assumptions of parametric tests can be met, parametric tests should be used because they are the most powerful tests available.
What is the difference between z-test,t-test and F-test?
A z-test is used for testing the mean of a population versus a standard, or comparing the means of two populations, with large (n . 30) samples whether you know the population standard deviation or not. It is also used for testing the proportion of some characteristic versus a standard proportion, or comparing the proportions of two populations. Example:Comparing the average engineering salaries of men versus women. Example: Comparing the fraction defectives from 2 production lines. A t-test is used for testing the mean of one population against a standard or comparing the means of two populations if you do not know the populations' standard deviation and when you have a limited sample (n < 30). If you know the populations' standard deviation, you may use a z-test. Example:Measuring the average diameter of shafts from a certain machine when you have a small sample. An F-test is used to compare 2 populations' variances. The samples can be any size. It is the basis of ANOVA.
Suppose that you have obtained a sample with the intent of performing a particular statistical-inference procedure. What should you do before applying the procedure to the sample data? Why?
Before applying a particular statistical inference procedure, we should look at graphical displays of the sample data to see if there appear to be any violations of the conditions required for the use of the procedure.
What determines "z" in the confidence interval formula?
Changing the confidence level.
What are the Components of an Anova?
Charted on the far left there is the Between, Within, and Total. along the top there is the SS, df, MS and F
How do you know whether to use the pooled t-test or the non-pooled t-test for 2-mean independent samples?
Compare the standard deviations of the two samples. If one is nearly twice the other, use the non-pooled t-test. If they are fairly close, use the pooled t-test.
What is the difference between a z-test and a t-test in Comparing the Proportions of Binary Variables?
Comparing Two Proportions Now, you may ask yourself: "What if I conduct the t-test to compare means of two binary variables?" or "What is the advantage of comparing proportions over comparing means (t-test)?" The simple answer is no big difference in case of a large sample size. Only difference between comparing means and proportions comes from the computation of denominators in the formula. The difference becomes smaller as the sample size increases. If N is sufficiently large, the t probability distribution and the binomial distribution are approximated to the normal distribution. Z-tests use the normal distribution and T-tests use the T-distribution (obviously). A Z-test is appropriate when your test statistic (the thing you are asking a question about) is normally distributed. This is the case when you are handling a sample proportion where N*P > 10 and N*(1-P) > 10. It is also the case when you are handling a sample mean where N > 25. A T-test is appropriate when your test statistic has a T-distribution. This is the case when you are handling a sample mean where N < 25. EDIT: I should note something rather pointless but still true. Regardless of the size of your sample, you should ALWAYS use a Z-test for a sample mean when you know the standard deviation of your population. However, in real life, you will never know the standard deviation of your population; you only know the standard deviation of your sample.
Suppose a CEO of a company wants to determine whether the average amount of wasted time during an 8-hour day for employees at the company is less than 120 minutes. A random sample of 10 employees gave these results: 108, 131, 112, 113, 117, 113, 130, 105, 111, 128 Assume ϑ = 9. Do these data provide evidence that the mean wasted time for this company is less than 120 minutes?
Conclusion: Do not reject H₀ Interpretation: There is not enough evidence to conclude that the average amount of wasted time at the company is less than 120 minutes.
When calculating post-hoc tests, how do you change your alpha level?
Divide the alpha level by the number of comparisons you are doing to find the new probability you need to hit to make your comparison statistically significant. Online ANOVA program does this for you, but if you're doing by hand, need to divide.
Does this pair comply with the rules for setting up hypotheses? If not explain why. H₀: µ = 10; H₁: µ > 12
Does not comply. H₁ must use the same number as H₀, the null hypothesis.
True or False (and give a reason for your answer): If a 95% confidence interval for a population mean, µ, is from 33.8 to 39.0, the mean of the population must lie somewhere between 33.8 and 39.0
False. We are 95% confident that the mean lies in the interval from 33.8 to 39.0, but about 5% of the time, the procedure will produce an interval that does not contain the population mean. Therefore, we cannot say that the mean must lie in the interval.
Relation between Type I and Type II error probabilities
For a fixed sample size, the smaller we specify the significance level, ∝, the larger will be the probability, β, of not rejecting a false null hypothesis.
What is the relationship between Type I and Type II Error probabilities?
For a fixed sample size, the smaller we specify the significance level, ∝, the larger will be the probability, β, of not rejecting a false null hypothesis.
The Wilcoxon signed rank test can be used to perform a hypothesis test for a population median, η, as well as for the population mean. Why is that so?
For a symmetric distribution, the mean and median are equal.
Cadmium, a heavy metal, is toxic to animals. Mushrooms, however, are able to absorb and accumulate cadmium at high concentrations. The Czech and Slovak governments have set a safety limit for cadmium in dry vegetables at 0.5 parts per million (ppm). A random sample of the edible mushroom Boletus pinicola with the resulting data: 0.24, 0.59, 0.62, 0.16, 0.77, 1.33, 0.92, 0.19, 0.33, 0.25, 0.59, 0.32 . At the 5% significance level, do the data provide sufficient evidence to conclude that the mean cadmium level in Boletus pinicola mushrooms is greater than the government's recommended limit of 0.5 ppm? Assume that the population standard deviation of cadmium levels in Boletus pinicola mushrooms is 0.37 ppm. (Note: The sum of the data is 6.31 ppm).
Given: significance level 0.05 ϑ = 0.37 H₀: µ = 0.5 ppm H₁: µ > 0.5 ppm Test statistic: z=0.24 Critical value: 1.645 Conclusion: 0.24 is in the non-rejection region Interpretation: There is not enough evidence to conclude that the mean level of cadmium in Boletus pinicola mushrooms is greater than 0.5 ppm.
In a one-way ANOVA, what is our Ho for comparing 3 levels of a factor?
Ho: X1=X2=X3
The mean charitable contribution per household in the U.S. in 2000 is $1623. A researcher claims that the level of giving has changed since then. State the null and the alternative hypotheses.
H₀: µ = $1623 H₁: µ ≠ $1623
What do you do if you find a significant difference?
If differences exist, then post-hoc tests are conducted which make comparisons between groups. You need to find where the particular differences reside. (Tukey, bonferroni...t-tests)
Decision criterion for a hypothesis test using the p-value
If the P-value is less than or equal to the specified significance level, reject the null hypothesis; otherwise, do not reject the null hypothesis.
two-tailed test
If the primary concern is deciding whether a population mean, µ, is different from a specified value ₀, we express the alternative hypothesis as: H₁: µ ≠ µ₀ A hypothesis test whose alternative hypothesis has this form is called a two-tailed test.
right-tailed test
If the primary concern is deciding whether a population mean, µ, is greater than a specified value µ₀, we express the alternative hypothesis as: H₁: µ > µ₀
right-tailed test
If the primary concern is deciding whether a population mean, µ, is greater than a specified value µ₀, we express the alternative hypothesis as: H₁: µ > µ₀ A hypothesis test whose alternative hypothesis has this form is called a right-tailed test.
left-tailed test
If the primary concern is deciding whether a population mean, µ, is less than a specified value µ₀, we express the alternative hypothesis as: H₁: µ < µ₀
left-tailed test
If the primary concern is deciding whether a population mean, µ, is less than a specified value µ₀, we express the alternative hypothesis as: H₁: µ < µ₀ A hypothesis test whose alternative hypothesis has this form is called a left-tailed test.
What is meant by 'confidence level' and why is it different than 'confidence interval'?
In statistics, a confidence interval (CI) is a particular kind of interval estimate of a population parameter. Instead of estimating the parameter by a single value, an interval likely to include the parameter is given. Thus, confidence intervals are used to indicate the reliability of an estimate. How likely the interval is to contain the parameter is determined by the confidence level or confidence coefficient. Increasing the desired confidence level will widen the confidence interval. A confidence interval is always qualified by a particular confidence level, usually expressed as a percentage; thus one speaks of a "95% confidence interval". The end points of the confidence interval are referred to as confidence limits.
For a fixed confidence level, how does increasing the sample size affect precision?
Increasing the sample size improves the precision.
what is the p value difference between One-tailed and two-tailed tests
It is easier to reject the null with a one-tailed test than two-tailed test. A one-tailed test is used when we predict the direction of the difference in advance (e.g. one mean will be larger than the other). With that assumption, the probability of incorrectly rejecting the null is only calculated from one tail of the distribution. In standard testing, the probability is calculated from both tails. Thus, the p-value from a two-tailed test ($p_2$) is twice the p-value of a one-tailed test ($p_1$). It is rarely correct to perform a one-tailed test; usually we want to test whether any difference exists.
Why is the Repeated Measures Design more powerful of a design?
It is more powerful specifically because of the Reduction in the amount of Error. So if you are rejecting the Null, it's most likely because your rejection is caused by a True Effect.
A simple random sample of size 100 is taken from a population with unknown standard deviation. A normal probability plot of the data displays significant curvature but no outliers. Can you reasonably apply the t-interval procedure? Explain your answer.
It is reasonable to use the t-interval procedure since the sample size is large and for large degrees of freedom (99), the t-distribution is very similar to the standard normal distribution. Another way of expressing this is that the sampling distribution of x-bar is approximately normal when n is large, so the standardized and studentized versions of x-bar are essentially the same.
When the error value gets smaller, the rejections region becomes ____, giving the ______ Hypothesis more power, leading to a _____ chance of rejecting the _____hypothesis and a _____ F Value.
Larger, Alternative, Larger, Null, Larger.
In the normal distribution or t distribution, what parts of the distribution represents "low probabilities"? In other words, what areas correspond with low probability?
Less than 5 percent (occurs 5 times out of 100)
F score
MS between groups / MS within groups *when F is high and probability is low, we conclude that the null hypothesis is wrong (Ho no differnce among groups), and a difference exists somewhere.
What is MS?
MS is mean squared. This answers the question, "How much does the treatment affect each group?"
What is the Z obt. formula?
Mean (X) - u ____________ Ox ____ square root of N
A variable of two populations has a mean of 40 and a standard deviation of 12 for one of the populations and a mean of 40 and a standard deviation of 6 for the other population. For independent samples of sizes 9 and 4 respectively, find the mean and standard deviation of x-bar₁ - x-bar₂.
Mean: 0 Standard Deviation: 5
df within:
N-k (total # of observations - total # of groups)
On the calculator (TI-84), how do you find the area to the left of a particular z-score?
NORMALCDF (-1000, Z-score, 0, 1)
Decide whether the appropriate method for obtaining the confidence interval is the z-interval procedure, the t-interval procedure, or neither: A random sample of size 13 is taken from a population. A normal probability plot of the sample data shows no outliers but has significant curvature. The population standard deviation is unknown.
Neither procedure should be used.
Decide whether the appropriate method for obtaining the confidence interval is the z-interval procedure, the t-interval procedure, or neither: A random sample of size 20 is taken from a population. A normal probability plot of the sample data shows three outliers but is otherwise roughly linear. Removal of the outliers is questionable. The population standard deviation is unknown.
Neither procedure should be used.
Is it better to plot graphs with SD or SEM error bars?
Neither. If you want to show the variation in your data: If your goal is to compare means with a t test or ANOVA, or to show how closely our data come to the predictions of a model, you may be more interested in showing how precisely the data define the mean than in showing the variability. In this case, the best approach is to plot the 95% confidence interval of the mean (or perhaps a 90% or 99% confidence interval). What about the standard error of the mean (SEM)? Graphing the mean with an SEM error bars is a commonly used method to show how well you know the mean, The only advantage of SEM error bars are that they are shorter, but SEM error bars are harder to interpret than a confidence interval. Whatever error bars you choose to show, be sure to state your choice. Noticing whether or not the error bars overlap tells you less than you might guess. If you want to create persuasive propaganda: If your goal is to emphasize small and unimportant differences in your data, show your error bars as SEM, and hope that your readers think they are SD If our goal is to cover-up large differences, show the error bars as the standard deviations for the groups, and hope that your readers think they are a standard errors. This approach was advocated by Steve Simon in his excellent weblog. Of course he meant it as a joke. If you don't understand the joke, review the differences between SD and SEM.
Must the variable under consideration be normally distributed for you to use the z-interval procedure or t-interval procedure? Explain your answer.
No. The z-interval procedure can be used almost anytime with large samples because the sampling distribution of x-bar is approximately normal for large n. The same is true for the t-interval procedure because when n is large, the t-distribution is very similar to the normal distribution. However, when n is small, especially when n is 15 or less, the z-interval and t-interval procedures will not provide reliable estimates if the distribution of the underlying variable is not normal. For sample sizes in the range of 15 to 30, both procedures can be used if the data is roughly normal and has no outliers.
When you have an effect, or real effect and it is small, we are more likely to fail to reject ____which leads to a type ____ error?
Null, type II
What p-value(s) describe very strong evidence against the null hypothesis?
P ≤ .01
What p-value(s) describe weak or no evidence against the null hypothesis?
P-value > .10
Consider the following quantities: µ, ϑ, x-bar, s. Which are parameters and which are statistics? Which are fixed numbers and which are variables?
Parameters: µ and ϑ Statistics: x-bar and s (sample standard deviation) Parameters are fixed numbers and statistics are variables.
Which of the following accurately defines a Type I Error?
Rejecting the Null (h 0) when the null (H 0) is true
What calculator function is used to find a confidence interval when ó is known?
STAT TESTS Z-interval
What calculator function is sued to find a confidence interval when ϑ is unknown?
STATS - TESTS - T-Interval
What is the difference between standard deviation and standard error of means
STD - NOT for CI, but for area under curve: empirical rule or the 68-95-99.7 rule SE - makes CI also has 68-95-99.7 rule
Of the 4 pooled t-test assumptions (simple random sample, independent samples, normal population or large samples, equal population standard deviations), how important is each of these?
Simple random samples and independent samples are essential assumptions. Moderate violations of the normality assumption are permissible even for small or moderate size samples. Moderate violations of the equal standard deviations requirement are not serious provided the two sample sizes are roughly equal.
What does "pooling" refer to, in the context of a hypotheses test for two population means?
Since we cannot use the population standard deviation as a basis for calculating the test statistic in a hypothesis test for two population means (because ϑ is unknown), we use sample information to estimate ϑ, the unknown population standard deviation. We first have to estimate the unknown population variance, ϑ², and to do so, we take the two sample variances (s₁² and s₂²) and pool (combine into one standard deviation Sp) by weighting them according to sample size (actually degrees of freedom).
Why do you need to know the sampling distribution of the difference between two sample means in order to perform a hypothesis test to compare two population means?
So that you can determine whether the observed difference between the two sample means can be reasonably attributed to sampling error or whether that difference suggests that the null hypothesis of equal population means is false and the alternative hypothesis is true.
Obtaining critical values
Suppose that a hypothesis test is to be performed at the significance level, ∝. Then the critical value(s) must be chosen so that, if the null hypothesis is true, the probability is ∝ that the test statistic will fall in the rejection region.
Wilcoxon Signed-Rank Test versus the t-test
Suppose that you want to perform a hypothesis test for a population mean. When deciding between the t-test and the Wilcoxon signed-rank test, follow these guidelines: (1) If you are reasonably sure that the variable under consideration is normally distributed, use the t-test. (2) If you are not reasonably sure that the variable under consideration is normally distributed but are reasonably sure that it has a symmetric distribution, use the Wilcoxon signed-rank test.
Choosing between a pooled and nonpooled t-procedure
Suppose you want to use independent simple random samples to compare the means of two populations. To decide between a pooled t-procedure and a nonpooled t-procedure, follow these guidelines: If you are reasonably sure that the populations have nearly equal standard deviations, use a pooled t-procedure; otherwise, use a nonpooled t-procedure.
Basic logic of hypothesis testing
Take a random sample from the population. If the sample data are consistent with the null hypothesis, do not reject the null hypothesis; if the sample data are inconsistent with the null hypothesis (in the direction of the alternative hypothesis), reject the null hypothesis and conclude that the alternative hypothesis is true.
In May, 2002, the average cost of a private room in a nursing home was $168 per day. For August 2003, a random sample of 11 nursing homes yielded the following daily costs, in dollars, for a private room in a nursing home: 73, 159, 199, 182, 192, 208, 181, 129, 182, 282, 250. Use the Wilcoxon signed-rank test to decide at the 10% significance level whether the average cost for a private room in a nursing home in August 2003 exceeded that in May 2002.
Test Statistic: W = 48 Reject the null hypothesis.
What is the "between" and the "within"?
The "Between" is the treatment or the variation of the means. The "Within" is the Error. any variation not caused by the treatment
Of 95% and 99% confidence levels, which will result in the confidence interval's giving a more precise estimate of µ?
The 95% confidence level because decreasing the confidence level improves the precision.
What happens when the experimental results are nonsignificant?
The NULL hypothesis has NOT been rejected
When to use the Univariate
The Univariate is used when we want to know if there is an interaction. An Interaction of the levels of 1, will effect the levels of another.
The logic behind the Wilcoxon Signed-Rank Test
The Wilcoxon signed-rank test is based on the assumption that the variable under consideration has a symmetric distribution - one that can be divided into two pieces that are mirror images of each other - but does not require that its distribution be normal or have any other specific shape.
What are the differences between z-distribution and t-distribution?
The Z distribution is just a special case of the normal distribution, with an idealized mean of 0 and standard deviation of 1. This allows us to create a compact and useful table for all normal distributions -- a very important property before the days of fancy calculators and computers. The t distribution is similar to the Z distribution, but is sensitive to sample size and is used for small samples, or moderate size samples when the population standard deviation is unknown. It is little different from Z for large sample sizes.
t-distribution
The ______ is typically used to study the mean of a population, rather than to study the individuals within a population. In particular, it is used in many cases when you use data to estimate the population mean — for example, using the sample mean of 20 homes to estimate the average price of all the new homes in California. Or when you use data to test someone's claim about the population mean — for example, is it true that the mean price of all the new homes in California is $500,000?
t-distribution
The _______ can be thought of as a cousin of the The can be thought of as a cousin of the standard normal distribution — it looks similar in that it's centered at zero and has a basic bell-shape, but it's shorter and flatter around the center than the Z-distribution. Its standard deviation is proportionally larger compared to the Z, which is why you see the fatter tails on each side.
To determine whether the pipe welds in a nuclear power plant meet specifications, a random sample of welds is selected and tests are conducted on each weld in the sample. Weld strength is measured as the force required to break the weld. Suppose that the specifications state that the mean strength of welds should exceed 100 lb/ in². The inspection team decides to test H₀: µ = 100 versus H₁: µ > 100. Explain why this alternative hypothesis was chosen rather than µ < 100.
The alternative hypothesis was chosen because the mean strength of welds should be greater than 100, thus the alternative hypothesis H₁: µ > 100. The primary concern of the research is to decide whether the population mean is greater than the specified value (and meets specifications).
alternative hypothesis
The alternative to the null hypothesis
Identify the assumption for using the two-means z-test and the two-means z-interval procedure that renders those procedures generally impractical.
The assumption that ϑ is known (because population standard deviations are usually unknown).
When is a confidence interval exact? When is a confidence interval approximately correct?
The confidence interval is exact for normal populations and is approximately correct for large samples from non-normal populations.
Explain the difference in the formulas for the standardized and the studentized version of x-bar.
The denominator of the standardized version of x-bar (z-score) uses the population standard deviation, ϑ, whereas the denominator of the studentized version of x-bar (t-score) uses the sample standard deviation, s.
Two t-curves have degrees of freedom, 12 and 20, respectively. Which one more closely resembles the standard normal curve? Explain your answer.
The df=20 because as the number of degrees of freedom increases, t-curves look increasingly like the standard normal curve.
How do you find between group SS? What does SSbg mean?
The grand mean is subtracted from each group mean. Differences are squared and multiplied by the # of subjects in the group. *the variability of each group to the grand mean.
null hypothesis
The hypothesis to be tested
Margin of error, precision, and sample size
The length of a confidence interval for a population mean, µ, and therefore the precision with which x-bar estimates µ, is determined by the margin of error, E. For a fixed confidence level, increasing the sample size improves the precision, and vice versa.
In the Wilcoxon signed rank test, if an observation is equal to µ₀, what should happen to that observation?
The observation is to be tossed out and the sample size reduced accordingly.
Which of the following assumptions is common to all parametric statistics?
The population of dependent scores, which must be interval or ratio scores, forms a normal distribution
What happens to the probability of committing a Type I error if the level of significance is changed from a = 0.01 to a= 0.05?
The probability of committing a Type I error will increase
Define significance level
The probability of making a Type I error, that is, of rejecting a true null hypothesis.
What is the p-value of a hypothesis test?
The probability of observing a value of the test statistic as extreme or more extreme than that observed. By extreme we mean "far from what we would expect to observe if the null hypothesis is true."
hypothesis test
The problem in a hypothesis test is to decide whether the null hypothesis should be rejected in favor of the alternative hypothesis.
non-rejection region
The set of values for the test statistic that lead us not to reject H₀
rejection region
The set of values for the test statistic that lead us to reject H₀ (tail or tails of the distribution)
Define non-rejection region
The set of values for the test statistic that leads to non-rejection of the null hypothesis.
Define rejection region
The set of values for the test statistic that leads to rejection of the null hypothesis.
How does the distributions of the standard and studentized versions of x-bar differ?
The studentized version has more spread (wider).
In the following problem, decide whether applying the t-test to perform a hypothesis test for the population mean appears reasonable. The Florida State Center for Health Statistics reported that, for cardiovascular hospitalizations, the mean age of women is 71.9 years. At one hospital, a random sample of 20 of its female cardiovascular patients had the following ages, in years: 75.9, 78,2, 88.2, 58.9, 83.7, 76.1, 78.9, 97.6, 87.3, 52.8, 81.7, 65.8, 74.5, 56.4, 54.4, 86.4, 82.5, 53.8, 52.7, 72.4.
The t-test is not reasonable because the sample is small (20) and a histogram of the data shows that the data is not normally distributed.
point estimate
The value of a statistic used to estimate a parameter
Define critical values
The values of the test statistic that separate the rejection and non-rejection regions. A critical value is considered part of the rejection region.
Explain why there is more variation in the possible values of the studentized version of x-bar than in the possible values of the standardized version of x-bar.
The variation in the possible values of the standardized version of x-bar is due only to the variation in x-bar, while the variation in the studentized version results not only from the variation in x-bar, but also from the variation in the sample standard deviation.
What does the z or t test statistic tell us?
The z or t test statistic tells us how far x-bar is from µ in standard deviations (i.e. the number of standard deviations from the mean). It is the statistic used as a basis for deciding whether the null hypothesis should be rejected.
How important is the normality assumption for the z-interval procedure?
The z-interval procedure works well when the variable is normally distributed and reasonably well if the variable is not normally distributed and the sample size is small or moderate, provided the variable is not too far from being normally distributed.
What is the difference between a z-test and a chi-square test
The z-test for proportions for independent samples is mathematically equivalent to a chi-square test of the difference between two proportions. It would be easier to run and report as a chi-square that you can run in proc freq. A dependent samples situation is a McNemar's test that can also be run using SAS.
Which of the following is correct regarding statistical hypotheses (null and alternative)?
They describe the population parameters our sample data represent if there is or is not a predicted relationship
Describe the meaning of P-value of a hypothesis test
To obtain the P-value of a hypothesis test, we assume that the null hypothesis is true and compute the probability of observing a value of the test statistic as extreme as or more extreme than that observed. By extreme we mean "far from what we would expect to observe if the null hypothesis is true." We use the letter P to denote the P-value. The p-value is the area beyond the test statistic in either direction.
True or False. The margin of error can be determined if you know only the length of the confidence interval.
True
True or False: The confidence interval can be obtained if you know only the margin of error and the sample mean.
True
True or False: For a fixed sample size, decreasing the significance level of a hypothesis test results in an increase in the probability of making a Type II error.
True. For a fixed sample size, the smaller you specify the significance level ∝, the larger will be the probability β, of not rejecting a false null hypothesis.
True or False: If it is important not to reject a true null hypothesis, the hypothesis test should be performed at a small significance level.
True. The significance level ∝ of a hypothesis test is the probability of making a Type I error (rejecting a true null hypothesis). If this is important, the lower the probability of making such an error the better; thus you should use a small significance level.
Critical value(s) for independent samples (2 population means, equal standard deviations)
Two tailed: +/- t (sub ∝/2), df = n₁ + n₂ - 2 Left tailed : - t (sub ∝), df = n₁ + n₂ - 2 Right tailed: t (sub ∝), df = n₁ + n₂ - 2
Critical value(s) for independent samples (2 population means, standard deviations not equal)
Two tailed: +/- t (sub ∝/2), df = Δ Left tailed : - t (sub ∝), df = Δ Right tailed: t (sub ∝), df = Δ
When to use the Multivariate
Use the multivariate to measure what makes up the construct.
If the total is correct, what two components when added together will have the same total?
When Between and Within are added together they will add up to the Total if the Total was computed correctly.
When is a one-tailed test used?
When a relationship is predicted and the direction in which the scores will change is predicted
When is a two-tailed test used?
When a relationship is predicted without stating the direction in which the scores will change
When does sampling error occur?
When a sample statistic is not equal to the population parameter as a result of chance factors
degrees of freedom
When statisticians use the term t-distribution, they aren't talking about just one individual distribution. There is an entire family of specific t-distributions, depending on what sample size is being used to study the population mean. Each t-distribution is distinguished by what statisticians call its ________. In situations where you have one population and your sample size is n, the degrees of freedom for the corresponding t-distribution is n - 1. For example, a sample of size 10 uses a t-distribution with 10 - 1, or 9, degrees of freedom, denoted t9 (pronounced tee sub-nine). Situations involving two populations use different degrees of freedom.
When would you use the Multivariate ANOVA?
When studying a construct. Because you are studying multiple things that make up just one idea.
When does the p-value provide evidence against the null hypothesis?
When the p-value is less than or equal to the significance level, ∝
When can the z-test be used in statistical hypothesis testing?
When the raw score population's standard deviation is known
Does it matter if my dependent variable is normally distributed?
When you are doing a t-test or ANOVA, the assumption is that the distribution of the sample means are normally distributed. One way to guarantee this is for the distribution of the individual observations from the sample to be normal. However, even if the distribution of the individual observations is not normal, the distribution of the sample means will be normally distributed if your sample size is about 30 or larger. This is due to the "central limit theorem " that shows that even when a population is non-normally distributed, the distribution of the "sample means" will be normally distributed when the sample size is 30 or more.
When would you use the Univariate ANOVA?
When you want to identify an interaction between things. (remember the grass reacts differently with sun and water vs. shade and water)
Why is an ANOVA different than a t-test?
With an ANOVA, we have more than one level. For example: a one-way ANOVA with one independent variable, the modality, and 4 levels of the independent variable. Modalities: - US - Ice - Massage - No modality
How do you find the within group SS? What does SSwg mean?
Within each group, the group mean is subtracted from each data point. Differences are squared and all SS are summed. *the variability of each individual to the mean of their respective group.
Mathematically, with a repeated measures test, which one do you partial out. The Between, the Within, or The BS?
You partial out the BS. (NOTE: I like to think of the BS not so much as "between subjects" but more as all the extra "bull sh*t" that isn't needed in the testing.)
What happens when you find your F value?
Your F score you calculate must be greater than your critical F value (looked up in the table) in order to be statistically significant.
What happens when you Partial The BS out?
Your error size gets smaller.
example of a one-way anova:
comparison of 4 different modailities on 4 different groups of subjects.
Inferential statistics are used to
decide whether sample data represent a particular relationship in the population
standard normal (or Z-distribution)
is the most common normal distribution, with a mean of 0 and standard deviation of 1
df between:
k-1 (where k is total # of groups)
F value:
looked up in the table- (numerator, denominator df) (between, within df)
For the confidence interval (5.289, 7.274), obtain the margin of error by: (a) taking half the length of the confidence interval (b) using the formula for E
margin of error = .94
ANOVA for independent samples, One-Way ANOVA
only one independent variable (1 factor, various levels) samples are independent of each other (not repeated)
Decide whether the appropriate method for obtaining the confidence interval is the z-interval procedure, the t-interval procedure, or neither: A random sample of size 17 is taken from a population. A normal probability plot of the sample data is found to be very close to linear (straight line). The population standard deviation is unknown.
t-interval procedure
Type II Error probability
the probability of a Type II error, denoted β - a Type II error occurs if the test statistic falls in the non-rejection region when in fact the null hypothesis is false.
significance level
the probability of making a Type I error, that is, of rejecting a true null hypothesis (denoted ∝)
In using a sampling distribution of means for statistical hypothesis testing, the mean of the sampling distribution will always equal..
the u described by H0
test statistic
the z-score (or t-score) that determines if an average is unusual or not
Purpose of an ANOVA?
to be able to make comparisons among several samples and ask if a difference exists somewhere between any two samples.
Suppose that a population is known to have a mean of 80; if a researcher predicts the experimental treatment will produce an increase in the population mean, then the null hypothesis of this test would state
u < or = to 80
When choosing between one-tailed and two-tailed tests,
use a one-tailed test only if you have a convincing reason for predicting the direction
Which of the following represents a Type I Error? We say that something
works when it really doesn't
Decide whether the appropriate method for obtaining the confidence interval is the z-interval procedure, the t-interval procedure, or neither: A random sample of size 128 is taken from a population. A normal probability plot of the sample data shows no outliers but has significant curvature. The population standard deviation is known.
z-interval procedure
Decide whether the appropriate method for obtaining the confidence interval is the z-interval procedure, the t-interval procedure, or neither: A random sample of size 50 is taken from a population. A normal probability plot of the sample data is found to be roughly linear. The population standard deviation is known.
z-interval procedure
When you are partialling out the variation, what type of test are you using?
A Repeated Measures test. You are mathematically subtracting out the variance.
How do you find total SS?
A grand mean is formed, and the grand mean is subtracted from each individual score. Differences are squared & summed.
What is the F value?
F = ANOVA
Which of the following accurately defines a Type II error?
Failing to reject the null (H0) when it is false
How do you calculate the test statistic for the w-test?
From a constructed chart using (1) observations, (2) difference between observation and µ₀, (3) absolute values of values calculated in step 2, (4) ranking of the absolute values, (5) signing ranks according to sign in step 2, (6) w = sum of positive ranks.
Researchers have postulated that because of differences in diet, Japanese children have a lower mean blood cholesterol level than U.S. children do. Suppose that the mean level for U.S. children is known to be 170. Let µ represent the true mean blood cholesterol level for Japanese children. What hypothesis should the researchers test? Give the null and alternative hypotheses.
H₀: µ = 170 (the null hypothesis is the hypothesis to be tested) H₁: µ < 170
Federal law requires that a jar of peanut butter labeled 32 oz. must contain at least 32 oz. A consumer advocate feels that a certain manufacturer is shorting customers by underfilling jars so that the mean content is less than 32 oz. State the null and alternative hypotheses.
H₀: µ ≥ 32 H₁: µ < 32 left tailed test
The power of a statistical test is the probability of
Rejecting a false null (h 0)
Which of the following is correct regarding alternative hypotheses?
They describe the population parameters represented by the sample data if the predicted relationship exists
What is the difference between ordinal, interval and ratio variables? Why should I care?
When you are doing a t-test or ANOVA, the assumption is that the distribution of the sample means are normally distributed. One way to guarantee this is for the distribution of the individual observations from the sample to be normal. However, even if the distribution of the individual observations is not normal, the distribution of the sample means will be normally distributed if your sample size is about 30 or larger. This is due to the "central limit theorem " that shows that even when a population is non-normally distributed, the distribution of the "sample means" will be normally distributed when the sample size is 30 or more. Q: What is the difference between ordinal, interval and ratio variables? Why should I care? FAQ# 1089 A: A categorical variable, also called a nominal variable, is for mutual exclusive, but not ordered, categories. For example, your study might compare five different genotypes. You can code the five genotypes with numbers if you want, but the order is arbitrary and any calculations (for example, computing an average) would be meaningless. A ordinal variable, is one where the order matters but not the difference between values. For example, you might ask patients to express the amount of pain they are feeling on a scale of 1 to 10. A score of 7 means more pain that a score of 5, and that is more than a score of 3. But the difference between the 7 and the 5 may not be the same as that between 5 and 3. The values simply express an order. Another example would be movie ratings, from * to *****. A interval variable is a measurement where the difference between two values is meaningful. The difference between a temperature of 100 degrees and 90 degrees is the same difference as between 90 degrees and 80 degrees. A ratio variable, has all the properties of an interval variable, and also has a clear definition of 0.0. When the variable equals 0.0, there is none of that variable. Variables like height, weight, enzyme activity are ratio variables. Temperature, expressed in F or C, is not a ratio variable. A temperature of 0.0 on either of those scales does not mean 'no temperature'. However, temperature in degrees Kelvin in a ratio variable, as 0.0 degrees Kelvin really does mean 'no temperature'. Another counter example is pH. It is not a ratio variable, as pH=0 just means 1 molar of H+. and the definition of molar is fairly arbitrary. A pH of 0.0 does not mean 'no acidity' (quite the opposite!). When working with ratio variables, but not interval variables, you can look at the ratio of two measurements. A weight of 4 grams is twice a weight of 2 grams, because weight is a ratio variable. A temperature of 100 degrees C is not twice as hot as 50 degrees C, because temperature C is not a ratio variable. A pH of 3 is not twice as acidic as a pH of 6, because pH is not a ratio variable. Does it matter for data analysis? The concepts are mostly pretty obvious, but putting names on different kinds of variables can help prevent mistakes like taking the average of a group of zip (postal) codes, or taking the ratio of two pH values. Beyond that, I don't see how putting labels on the different kinds of variables really helps you plan your analyses or interpret the results. Note that the categories are not as clear cut as they sound. What kind of variable is color? In a psychological study of perception, different colors would be regarded as nominal. In a physics study, color is quantified by wavelength, so color would be considered a ratio variable. What about counts? If your dependent variable is the number of cells in a certain volume, what kind of variable is that. It has all the properties of a ratio variable, except it must be an integer. Is that a ratio variable or not? These questions just point out that the classification scheme appears to be more comprehensive than it is. Read more about these problems. Variables like pH and the logEC50 don't really fall into any of these categories.
Type I Error probability
the probability of a Type I error, denoted ∝, also called the significance level of the hypothesis test
General question ANOVA proposes:
whether the effect variance (between groups) is greater than the error variance (within groups). if this is true, we expect a difference between samples in the experiement.
Decide whether the appropriate method for obtaining the confidence interval is the z-interval procedure, the t-interval procedure, or neither: A random sample of size 25 is taken from a population. A normal probability plot of the sample data shows three outliers but is otherwise roughly linear. Checking reveals that the outliers are due to recording errors. The population standard deviation is known.
z-interval procedure
Error variance:
"within group variance" we look to establish whether there is a real difference within a group.
A snack food company produces a 454 g bag of pretzels and insists that the mean net weight of the bags is 454 g. As part of its program, the quality assurance department periodically performs a hypothesis test to decide whether the packaging machine is working properly, that is, to decide whether the mean net weight of all bags packaged is 454 g. (1) Determine the null hypothesis for the hypothesis test. (2) Determine the alternative hypothesis for the hypothesis test. (3) Classify the hypothesis test as two tailed, left tailed, or right tailed.
(1) H₀: µ = 454 g (2) H₁: µ ≠ 454 g (3) two tailed
According to Bride's Magazine, getting married these days can be expensive when all costs are included. A simple random sample of 20 recent U.S. weddings yielded data on wedding costs in dollars (sum of data is $526,538). (1) use the data to obtain a point estimate for the population mean wedding cost, µ, of all recent U.S. weddings. (2) Is your point estimate in part (1) likely to equal µ exactly? Explain your answer.
(1) $26,326.9 (2) No. It is unlikely that a sample mean (x-bar) will exactly equal the population mean, µ. Some sampling error is to be anticipated.
The FAA estimated with 90% confidence that the mean flight time from Albuquerque NM to Dallas TX to be between 99 minutes and 107.8 minutes. Assume n = 9 and ϑ = 8. (1) Find the margin of error. (2) Find how many times larger must a sample size be to halve the margin of error.
(1) 4.4 (2) 4 (36/9)
If you reject the null hypothesis, which of the following may occur?
A correct decision or a Type I error
Which of the following tests is considered to be more powerful? One or two-tailed?
A one-tailed test
What is meant by saying that a statistical procedure is "robust"?
A statistical procedure that works reasonably well even when one of its assumptions is violated (or moderately violated) is called a robust procedure relative to that assumption.
Compare t-curves to the standard normal curve as the number of degrees of freedom becomes larger.
As the number of degrees of freedom becomes larger, t-curves look increasingly like the standard normal curve.
Which of the following is correct regarding experimental hypotheses?
They describe the predicted relationship we may or may not find in an experiment
Suppose you have randomly selected high school students to take a course to improve their SAT scores... the mean of their scores is not significantly different from the population mean of SAT scores for those who didn't take course.. Which is best conclusion?
We have obtained no convincing evidence that the course affects SAT scores
Which of the following is true of any one-sample experiment?
We must know the population mean under some condition of the independent variable other than the one being tested
The population mean for elementary tenacity test (ETT) is 150, with ox = 25. A recent study of elementary school children participating in a school lunch program obtained the following ETT scores. Use two-tailed z-test and a = 0.05
Z crit = + or - 1.96, Z obt = 1.77. Fail to reject Ha
Suppose you conduct an experiment and the Zobt. is statistically NONsignificant at -0.65. How would you correctly report this?
Z obt. = -0.65; p > 0.05
Suppose you conduct an experiment and your Z obt is statistically significant at +3.45. How would you correctly report this result?
Zobt = +3.45; p < 0.05
The physical fitness test (National level for girls) shows the national avg. for 10-year old girls on "right-angle push-ups" to be u = 13. Principal Smyth hypothesizes that her girls at Strong School are significantly above the national mean. If the Ox = 6 and the scores for nine girls are as reported here, compute Zobt and determine if Principal Smyth is right. Use a = .05
Zobt = 1.5, so Principal is wrong
What are the properties of the t-distribution?
a) centered at 0 (like the standard normal distribution) b) symmetric/bell-shaped c) wider than the normal distribution d) area under curve = 1 e) uses degrees of freedom (n-1) to find t-values
Which of the following is the probability of avoiding a Type I error?
p = (1 - a)
Which of the following is correct regarding the probability of making a Type I error?
p = a
The key difference between parametric and nonparametric procedures is that parametric procedures
require that certain stringent assumptions be met
When experimental results are significant, this means that the _____ hypothesis has been _____.
the NULL hypothesis has been REJECTED when significant
For a one-tailed test where the predicted value of the sample mean is larger than the population mean and a = 0.05, Z crit is always equal to...
+1.645
Find the area to the right of t=1.771 with df=13.
.05
Given: safety limit set for cadmium in dry vegetables at 0.5 ppm. Random sample of 12 Bp mushrooms, data obtained sums to 6.31 ppm. Find and interpret a 99% confidence interval for the mean cadmium level of Bp mushrooms. Assume a population standard deviation of cadmium levels in Bp mushrooms of 0.37 ppm.
0.251 ppm to 0.801 ppm We can be 99% confident that the mean cadmium level of all Bp mushrooms is somewhere between 0.251 ppm and 0.801 ppm.
What is the appropriate z-value for an 80% confidence level?
1.282
What is the appropriate z-value for an 85% confidence level?
1.44 (Hint: invnorm (.075))
What is the appropriate z-value for a 90% confidence level?
1.645
According to an article, the mean duration of imprisonment for 32 patients with chronic PTSD was 33.4 months. Assuming that ϑ = 42 months, determine a 95% confidence interval for the mean duration of imprisonment, µ, of all East German political prisoners with chronic PTSD. Interpret your answer in words.
18.8 to 48.0 months We can be 95% confident that the mean duration of imprisonment, µ, of all East German political prisoners with chronic PTSD is somewhere between 18.8 and 48.0 months.
Assume that the population standard deviation is known. Is it reasonable to use the z-interval procedure to obtain a confidence interval for the population mean under each of the following circumstances: (1) the sample data contains no outliers, the variable under consideration is roughly normally distributed, and the sample size is 20, (2) the distribution of the variable under consideration is highly skewed and the sample size is 20, (3) the sample data contains no outliers, the sample size is 250, and the variable under consideration is far from being normally distributed.
(1) Reasonable, because of the roughly normal distribution, sample size need not be greater than 30 and outliers do not exist that might call into question the normality assumption. (2) Not reasonable, because sample size is too small. (3) Reasonable because of the large sample size.
Suppose that a random sample of 50 bottles of a particular brand of cough medicine is selected and the alcohol content of each bottle is determined. Let µ denote the average alcohol content for the population of all bottles of the brand under study. Suppose that the sample of 50 results in a 95% confidence interval for µ of (7.8, 9.4). (1) Would a 90% confidence interval have been narrower or wider than the given interval?
(1) The 90% confidence interval would have been narrower because decreasing the confidence level from 95% to 90% will decrease the confidence interval. The value of Z∝/2 for a 90% confidence level (1.28) is smaller than the Z∝/2 value for the 95% confidence level (1.64); therefore the confidence interval would be narrower.
What are the two ways to create a narrower confidence interval?
(1) decrease ϑ (2) increase n (sample size)
What are the two ways to create a wider confidence interval?
(1) increase ϑ (2) decrease n (sample size)
Two intervals (114.4, 115.6) and (114.1, 115.9) are confidence intervals for µ=true average resonance frequency (in hertz) for all tennis rackets of a certain type. (1) What is the value of the sample mean resonance frequency? (2) The confidence level for one of these intervals is 90% and for the other it is 99%. Which is which, and how can you tell?
(1) x-bar = 115 hertz (2) the 90% confidence interval is (114.4, 115.6) and the 99% confidence interval is (114.1, 115.9). The confidence interval is larger for the 99% confidence level than for the 90% confidence level.
Consumer Reports provides information on new automobile models - including price, mileage ratings, engine size, body size, and indicators of features. A simple random sample of 35 new models yields data on fuel tank capacity, in gallons (sum of data is 664.9 gallons). (1) Find a point estimate for the mean fuel tank capacity of all new automobile models. (2) Determine a 95.44% confidence interval for the mean fuel tank capacity of all new automobile models. Assume ϑ=3.50 gallons. (3) How would you decide whether fuel tank capacities for new automobile models are approximately normally distributed? (4) Must fuel tank capacities for new automobile models be exactly normally distributed for the confidence interval that you obtained in part (2) to be approximately correct? Explain your answer.
(1) x-bar = 18.997 (2) (17.81, 20.18) We can be 95.44% confident that the mean fuel tank capacity of all 2003 automobile models is somewhere between 17.81 and 20.18 gallons. (3) obtain a normal probability plot of the data. (4) No, because the sample size is large.
Use the one-mean z-interval procedure to find a confidence interval for the mean of the population from which the sample was drawn: x-bar = 20 n = 36 ϑ = 3 confidence level = 95%
(19.0, 21.0)
Five hundred randomly selected working adults living in Calgary, Canada were asked how long, in minutes, their typical daily commute was. The resulting sample mean commute time was 28.5 minutes. Construct and interpret a 90% confidence interval for the mean commute time of working adult Calgary residents. (Assume the population standard deviation is 24.2 minutes).
(26.72, 30.28) We are 90% confident that the average daily commute time for working adults living in Calgary, Canada, will be between 26.72 minutes and 30.28 minutes.
Consider the following statement: If the process of selecting a sample of size 50 and then computing the corresponding 95% confidence interval is repeated 100 times, 95 of the resulting interval will include µ. Is this statement correct? Why or why not?
** This statement is correct. In the long run, after computing the corresponding 95% confidence interval many times, 95 of the resulting confidence intervals will include µ.
For a two-tailed test where a = 0.05, z crit is always equal to
+ or - 1.96
Find the confidence level and ∝ for a 90% confidence interval.
Confidence level = .90 ∝ = .10
Find the confidence level and ∝ for a 99% confidence interval.
Confidence level = .99 ∝ = .01
For the same value of ∝, are t-values greater than or less than z-values?
For the same value of ∝, t-values will be greater than z-values.
What is the relationship between confidence (high or low) and the length of a confidence interval?
Higher confidence means a wider interval. Lower confidence means a narrower interval.
When statisticians report that results are nonsignificant, the results are
not too unlikely to accept as representing the same population
What does the null SAMPLING distribution describe?
The SAMPLING distribution when the null hypothesis is TRUE
Which of the following describe(s) the situation if the relationship you are testing in your experiment really exists?
The alternative hypothesis
What does ∝ signify?
The area in both tails. Usually .10 or less
32) Which of the following statements does not belong?
The change in the dependent scores was produced by significant sampling error
Which of the following statements is true when H 0 is NOT rejected?
The data do not provide sufficient evidence of a relationship in nature
What is the relationship between the margin of error and the length of the confidence interval?
The margin of error is half the length of the confidence interval.
Relate the precision with which x-bar estimates µ with the size of the margin of error.
The more precision with which x-bar estimates µ, the smaller the margin of error.
Which of the following is called the hypothesis of "no difference"?
The null hypothesis
What is the difference in assumptions between the one-mean t-test and the one-mean z-test?
The one mean z-test - ϑ is known The one mean t-test - ϑ unknown
When statisticians report the results from an experiment are significant, this means the results
are too unlikely to accept as sampling error
Which would result in a wider confidence interval? n=100 or n=400
n=100 would result in a wider confidence interval. Decreasing the sample size increases the length of the confidence interval.
In a one-tailed test, Z obt is significant only if it lies
in the tail of the distribution beyond Z crit and has the same sign as Z crit
If a sample mean is different from a particular population u, we can conclude that the sample mean probably represents some other population or that
the sample mean occurred as a result of sampling error