SB 15 Jackson

¡Supera tus tareas y exámenes ahora con Quizwiz!

Suppose n students are asked to indicate which of the following sports they like best: basketball, baseball, football, or soccer. It is believed that 20% prefer basketball, 10% prefer baseball, 40% prefer football, and 30% prefer soccer. How would expected frequencies be calculated?

For each sport, their expected percentage would be multiplied by n.

When doing a goodness-of-fit test to compare the distribution of sample data to a normal distribution, we must know the population mean and standard deviation to calculate the normal frequencies. Choose the statements that correctly describe df (degrees of freedom).

If we use the sample mean and standard deviation, then df = k - 3. If the mean is known and sample standard deviation is estimated, then df = k - 2. If the mean and standard deviation are known, then df = k - 1.

What happens to the chi-square distribution as the degrees of freedom of the sample becomes larger?

It approaches the normal distribution in shape.

Which of the following are characteristics of the goodness-of-fit test?

It can be used with nominal or ordinal data. It compares an observed distribution to an expected distribution. Goodness-of Fit tests use the chi-square distribution.

How do you determine the degrees of freedom for a chi-square distribution?

It is k - 1, where k is the number of categories of data.

Explain why the chi-square statistic is always positive.

It uses the square of the difference in the observed and expected frequencies, which must be at least 0.

For all chi-square tests, the null is rejected when the observed frequencies differ substantially from the expected frequencies. What values of chi-square statistics support rejecting the null?

Large

The goodness-of-fit test can compare a distribution of data to a normal distribution. Why is this important?

Many of the tests we have studied assume that the data is from a normal population.

In statistics, what do we mean when we say that a test is "non-parametric"?

No assumption is made regarding the shape of the population distribution.

For a chi-square test with more than two categories, what is the accepted limitation on the use of a chi-square test?

No more than 20% of the cells can have expected frequencies less than 5.

How many chi-square distributions are there?

One for each possible degree of freedom (1, 2, 3, etc.).

How do the goodness-of-fit tests for equal, unequal, and normal frequency distributions differ from one another?

Only in the table of expected frequencies.

What is the shape of the chi-square distribution when the number the degrees of freedom is small?

Positively skewed (a long "tail" on the right).

Which one of the following is a non-parametric test?

The Goodness-of-Fit test

In a data set with only two cells (two frequencies), what is the accepted limitation on the use of a chi-square test?

The expected frequency in each cell must be at least 5.

How is the normality assumption that underlies tests such as the test of two means, ANOVA, and regression related to goodness-of-fit tests?

The goodness-of-fit test can check to see if the normality assumption is met.

If the chi-square statistic has a p-value of 0.008 in a contingency table analysis, what would you conclude concerning the two variables (rows and columns)?

The two variables are related.

For all chi-square tests, the rejection region is located where?

The upper tail

What do we mean when we say that there is a family of chi-square distributions?

There is a chi-square distribution for 1 degree of freedom, one for 2 degrees, 3 degrees, etc. but they all have similar properties.

Suppose we have a contingency table that shows gender (female, male) by favorite movie type (comedy, drama, action, horror). What would the null hypothesis be for this contingency table analysis?

There is no relationship between gender and movie type.

When the chi-square statistic is used to test the variables in a contingency table (i.e. rows and columns) for independence, what is the null hypothesis?

There is no relationship between the two variables.

A chi-square test should not be used if too many cells have expected frequencies below 5. What is the reason for this?

Too much weight is given to categories with relatively low expected frequencies.

If the chi-square statistic has a p-value of 0.82 in a contingency table analysis, What would you conclude concerning the two variables (rows and columns)?

We cannot reject the null that the variables are not related.

A contingency table test of independence gives a chi-square value to the right of the cutoff value. What is the conclusion?

We reject H0. There is evidence for a relationship between the two variables.

If a contingency table has r rows and c columns, how do you calculate the degrees of freedom for the chi-square statistic?

df = (r - 1)(c - 1)

If a contingency table has r rows and c columns, how do you calculate the degrees of freedom for the chi-square statistic? Multiple choice question.

df = (r - 1)(c - 1)

If the sample mean and standard deviation are used as estimates of the population parameters in order to perform a chi-square goodness-of-fit test to check the normality assumption, what are the degrees of freedom for the Χ2 distribution?

df = k - 1 - 2

A sample consists of n observations which are sorted into k frequency bins. What is the degrees of freedom for a chi-square goodness-of-fit test?

k - 1

When calculating the chi-square statistic, the difference in the observed frequency for a particular category and its expected frequency is squared and then divided by what?

the expected frequency for the category

What will happen in a goodness-of-fit test if too many of the cells have expected frequencies lower that 5?

χ2 will be inflated (larger), making it more likely that we will reject H0.

Describe the range of possible values for the chi-square distribution.

0 < Χ2 <∞

Suppose we have a contingency table that shows gender (female, male) by favorite movie type (comedy, drama, action, horror). If the table summarizes a total of 120 people, there are 60 females, and 50 people preferred comedy, what would the expected frequency be for the female/comedy cell?

25

Suppose that 125 students are asked which of four t-shirt designs they like best. If you want to test the null that there is no difference in the proportion of students preferring each design, what would the degrees of freedom be?

3

Suppose that 125 students are asked which of four t-shirt designs they like best. If you want to test the null that there is no difference in the proportion of students preferring each design, what would the degrees of freedom be? Multiple choice question. 4

3

Suppose we have a contingency table that shows gender (female, male) by favorite movie type (comedy, drama, action, horror). What would the degrees of freedom be for this contingency table analysis?

3


Conjuntos de estudio relacionados

MGMT 309 Test 3: Chapter 13—Managing Human Resources in Organizations

View Set

Fractions Number Line & Area Model

View Set

Theory, research and evidence based practice

View Set

Chapter 6: Identity, Access, and Account Management | 6.1-6.10

View Set