BUSI CHAPTER 13 SMARTBOOK - BUSI 2305 - HW

¡Supera tus tareas y exámenes ahora con Quizwiz!

What is the term that is used for the proportion of the total variation in Y that is explained by the variation in X?

The Coefficient of Determination

What are we estimating when we use the "prediction interval" in conjunction with a regression line?

The value of Y for a given value of X.

For two variables the correlation coefficient is found to be nearly equal to zero. How would you describe the relationship between the two variables?

There is very little, if any, relationship between the variables.

The equation for the y-intercept of the regression line is: a = Y-bar- (b*X-bar). Match the variables to their descriptions.

a: the y-intercept b: the slope of the regression line Y-bar: mean of the dependent variable X-bar: mean of the independent variable

Which of the following is the equation for the slope of the regression line?

b = r (sY/sX)

The slope of the regression line is given by b = r (sY/sX). Match the variables to their description.

b: the slope sy: standard deviation of sample Y values sx: standard deviation of sample x values r: sample correlation coefficient

The formula for the correlation coefficient is r= Σ(X−X bar)(Y−Y bar)/[(n−1)sxsy] Match the variables to their description.

n: # of paired observations sx: standard deviation of x Y: one value of y Y bar: mean of the y-values

A scatter diagram in which the points move from the bottom left to the upper right would be characterized by what type of correlation coefficient?

positive

What symbol is used for the Pearson correlation coefficient, which shows the strength of the relation between two variables?

r

Which of the following symbolized the standard error of estimate?

sY⋅X

The equation for the standard error of estimate is: sy⋅x=Σ(y−y∧)2 / (√n−2) Match the variables to their descriptions.

sy*x: standard error of Y for given X y∧: estimated Y for a given X n-2: df: sample size minus 2 y: an observed value of Y

Which of the following is the test statistic for the correlation coefficient?

t = [√n−2] / [√1−r2]

The formula for the test of significance of the sample correlation is: t = r√n−2 / (1-r2). Match the variables to their description.

t: t-distribution test statistic. n-2: degrees of freedom r: sample correlation n: sample size

The equation for the test for the slope of a regression line is: t = (b−0)/sb. Match the variables and their description for this equation.

t: the test statistic n-2: degrees of freedom for t b: r(sY/sX) sb: standard error of the slope

How is the standard error of the estimate calculated from ANOVA information?

√MSE=SSE/(√n−2)

Assume you have obtained a regression model to predict the sales price of a house based on the house's square footage. If the standard error of the estimate was found to be $8,400, which of the following would be true?

95% of your predictions would be within $16,800 of the actual value

What is Correlation Analysis?

A group of techniques to measure the relationship between two variables.

A study of hours spent playing video games versus school grades found a correlation coefficient of r = -0.53. How would you describe this relationship?

A moderate negative correlation

An experiment of study times versus test scores found a correlation coefficient of r = 0.49. How would you describe this relationship?

A moderate positive correlation.

Which of the following is usually the first step in a correlation analysis?

Making a scatter diagram.

The correlation between wait time on a help line and customer satisfaction was found to be r = -0.85. What percentage of variation in customer satisfaction can be predicted by wait time using a regression line?

72% Reason: R²= -0.85² =0.72

How is the "prediction interval" used as a part of regression analysis?

It is used when predicting a particular value of Y for a given X.

If X is the amount a grocer spends on advertising and Y is his gross sales and the regression equation relating them is Y∧= 23044 + 10.4X, what is his predicted income if he spends $4000 on advertising?

$64,644

If X is the size (in square feet) of a home and Y is its sales price and the regression equation relating them is Y∧Y∧= $92,000 + 86X, what is the predicted sales price of a home when X=0? Assume that all homes used to build the model were between 1,800 and 2,500 square feet.

$92,000 Reason: X=0 means there is no home.

A line is drawn through the points on a scatter diagram. Which three of the following are not likely to be a least squares fit?

- Nearly all of the data points are below the line. - All of the data points are above the line. - The line passes through the largest and smallest data points.

What values can the correlation coefficient assume?

-1≤ r ≤ 1

In evaluating a regression equation, what does it mean if the standard error of estimate is small? Choose all that apply.

-The data is close to the regression line. -The predicted Y will have small error.

Which two of the following are statistics that regression analysis provides to evaluate the predictive ability of the regression equation?

-The standard error of the estimate. -The coefficient of determination.

In regression analysis it is assumed that for any given X the Y values are normally distributed (the "Normality" assumption). What else is assumed about these distributions? Choose all that apply.

-Their means lie on the regression line. -They are independent. -They have equal standard deviations.

Place the following steps in correlation analysis in the order that makes the most sense.

1. Make a scatter diagram 2. Calculate a correlation coefficient 3. Draw a least squares fit line

The correlation between the weight of an automobile and its gas mileage was found to be r = 0.77. What percentage of variation in mileage can be predicted from a car's weight using a regression line?

59% Reason: R²=0.77²=0.59

Assume you have obtained a regression model to predict the sales revenue based on marketing expenditures. If the standard error of the estimate was found to be $12,200, which of the following would be true?

68% of your predictions would be within $12,200 of the actual value.

What is the best definition of a regression equation?

An equation that expresses the linear relation between two variables. Reason: Y = a + bX

How is the "confidence interval" used as a part of regression analysis?

It is used when predicting the mean value of Y for a given X.

When testing a correlation, we use a null hypothesis about the population correlation. What question are we trying to answer with the test?

Could the sample correlation be r even though the population correlation is actually zero?

In order to properly apply regression analysis, what assumption must be made about the distribution of Y values?

For each X value, the Y values are normally distributed.

Which of the following is NOT an example of correlation analysis?

Hypothesis testing for equality of means.

A test for the slope of the regression line uses the hypotheses H0: β = 0, H1: β ≠ 0. What are we seeking to discover with this test?

If the regression line has predictive power for the dependent variable.

Which of these statements correctly describes the values that can be assumed by the correlation coefficient, r?

It can be any number from -1 to +1, inclusive.

What is the symbol for the coefficient of determination?

R2, equal to r squared.

How can you transform a non-linear relationship to better use correlation analysis?

Replace one or both variables with its log, square root, reciprocal, etc.

How is the coefficient of determination calculated from ANOVA information?

R²= SSR/ SS total = 1-(SSE/SS total)

Which of the following statistical distributions is used for the test for the slop of the regression equation

Student's t-distribution

In analyzing the strength of the relationship between two variables, what does the symbol r represent?

The Pearson correlation coefficient.

If the standard error of estimate for a regression line is large, what would you expect for the coefficient of determination?

The coefficient of determination should be small.

How do you calculate the coefficient of determination?

The coefficient of determination, R2, is the square of the correlation coefficient, r.

Why does it make sense that the prediction interval for Y would be wider than the confidence interval?

The confidence interval is for the mean of Y, and the prediction interval is for a single value.

What is the definition of the standard error of estimate?

The dispersion (scatter) of observed values around the line of regression for a given X.

The general form of the regression equation is written like this: [Y∧]= a + bX. Why is the dependent variable written as [Y∧] instead of just Y?

The hat is to emphasize that the equation estimates the Y-value for a given X

What are we estimating when we use the "confidence interval" in conjunction with a regression line?

The mean of the distribution of Y for a given X.

Which of the following is the correct null hypothesis for the test of a sample correlation?

The population correlation is zero. Reason: H0: ρ = 0, H1: ρ ≠ 0

Which do we expect to be larger, the confidence interval or the prediction interval, and why?

The prediction interval, because it is more accurate to predict a mean than to predict a single value.

What is the definition of the coefficient of determination?

The proportion of the total variation in Y that is explained by the variation in X.

A test for the slope of the regression line uses the hypotheses H0: β = 0, H1: β ≠ 0. If we reject H0 what have we demonstrated about the regression line?

The regression line has some power to predict the value of the dependent variable.

Which of the following is true regarding the standard error of the estimate?

The smaller the standard error of the estimate, the closer the points are to a straight line.

When a line is drawn on a scatter diagram using the least squares principal, what is the quantity that is minimized?

The sum of the squared difference between the line and the data points.

How can we use correlation analysis to explore non-linear relationships?

Transform one or both variables.

The general form of the regression equation is written like this: Y∧= a + bX. Match the variables to their description

Y∧: estimated Y value X: the independent variable a: the Y intercept b: slope of the line

Which of the following regression equations looks like it matches the scatter diagram below?

Y∧= 14.4 - 1.9X. Hint: contains negative slope. includes point (4,6)

Which of the following regression equations looks like it matches the scatter diagram below?

Y∧= 3.0 + 1.6X.

Which of the following is the formula for the correlation coefficient, r?

[Σ(X−Xbar)(Y−Ybar)] / [(n−1)sxsy]

Which of these is the equation for the y-intercept of the regression line?

a = Y-bar- (b*X-bar)


Conjuntos de estudio relacionados

Cognitive Psych. Chapter 4 (99-123) Memory

View Set

Tax Inefficiencies and Their Implications for Optimal Taxation

View Set

Study Sheet Test 1 Duel Credit Speech - Vivian Lee Ward

View Set

30 review questions on experience of loss, death, and grief

View Set

Chapter 31/Obstetrics and Neonatal Care/EMTB

View Set