eco 4k ch 5-8 other mc q

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

In practical econometric applications,

, it is better to assume that the errors might be heteroskedastic unless you have compelling reasons to believe otherwise.

The adjusted Upper R squared, or Upper R overbar squared, is given by:

. 1 minus StartFraction n minus 1 Over n minus k minus 1 EndFraction StartFraction SSR Over TSS EndFraction .

Consider the following regression equation: Upper Y Subscript i Baseline equals beta 0 plus beta 1 Upper X Subscript i Baseline plus mu Subscript i, where Upper X Subscript i, Upper Y Subscript i, beta 0, beta 1, and mu Subscript i denote the regressor, the regressand, the intercept coefficient, the slope coefficient, and the error term for the i Superscript th observation, respectively. Which of the following is the formula for calculating the heteroskedastic-robust estimator of the variance of ModifyingAbove beta with caret 1?

. ModifyingAbove sigma with caret Subscript ModifyingAbove beta with caret 1 Superscript 2 Baseline equals StartFraction 1 Over n EndFraction times StartStartFraction StartFraction 1 Over n minus 2 EndFraction Summation from i equals 1 to n left parenthesis Upper X Subscript i minus Upper X overbar Subscript i right parenthesis squared ModifyingAbove mu with caret Subscript i Superscript 2 OverOver left bracket StartFraction 1 Over n EndFraction Summation from i equals 1 to n left parenthesis Upper X Subscript i minus Upper X overbar Subscript i right parenthesis right bracket squared EndEndFraction .

Suppose the population regression is in the form: ln Upper Y Subscript i Baseline equals beta 0 plus beta 1 Upper X Subscript i Baseline plus u Subscript i, where beta 0, beta 1 and u Subscript i represent the intercept, the slope coefficient of Upper X and the error term for the i Superscript th observation. What is the expected value of Upper Y Subscript i given Upper X Subscript i (Upper E left parenthesis Upper Y Subscript i Baseline vertial line Upper X Subscript i Baseline right parenthesis)?

. Upper E left parenthesis Upper Y Subscript i Baseline vertial line Upper X Subscript i Baseline right parenthesis equals e Superscript beta 0 plus beta 1 Upper X Super Subscript i Baseline Upper E left parenthesis e Superscript u Super Subscript i Baseline vertial line Upper X Subscript i Baseline right parenthesis.

Consider a regression with two variables, in which Upper X Subscript 1 i is the variable of interest and Upper X Subscript 2 i is the control variable. Conditional mean independence requires:

. Upper E left parenthesis mu Subscript i vertical line Upper X Subscript 1 i comma Upper X Subscript 2 i Baseline right parenthesis equals Upper E left parenthesis mu Subscript i vertical line Upper X Subscript 2 i Baseline right parenthesis.

The following OLS assumption is most likely violated by omitted variables bias:

. Upper E left parenthesis mu Subscript i vertical line Upper X Subscript i right parenthesis equals 0.

Which of the following is the mathematical statement for the conditional mean independence assumption?

. Upper E left parenthesis u Subscript i Baseline | Upper X Subscript 1 i Baseline comma ... comma Upper X Subscript ki Baseline comma Upper W Subscript 1 i Baseline comma ... comma Upper W Subscript r i Baseline right parenthesis equals Upper E left parenthesis u Subscript i | Upper W Subscript 1 i Baseline comma ... comma Upper W Subscript ri Baseline right parenthesis.

Consider the polynomial regression model of degree r, Upper Y Subscript i equals beta 0 plus beta 1 Upper X Subscript i plus beta 2 Upper X Subscript i Superscript 2 Baseline plusbulletbulletbulletplus beta Subscript r Baseline Upper X Subscript i Superscript r plus mu Subscript i. According to the null hypothesis that the regression is linear and the alternative that is a polynomial of degree r corresponds to:

. Upper H 0 : beta 2 equals 0 comma beta 3 equals 0 comma ... comma beta Subscript r equals 0 vs. Upper H 1 : at least one beta Subscript j not equals 0 comma j equals 2 comma ... comma r.

Which of the following statements is true?

. Upper R overbar squared can be negative when the regressors, taken together, reduce the sum of squared residuals by such a small amount that this reduction fails to offset the factor StartFraction n minus 1 Over n minus k minus 1 EndFraction .

A study tests the effect of earning a Master's degree on the salaries of professionals. Suppose that the salaries of the professionals (Upper S Subscript i) are not dependent on any other variables. Let Upper D Subscript i be a variable which takes the value 0 if an individual has not earned a Master's degree, and a value 1 if they have earned a Master's degree. What would be the regression model that the researcher wants to test?

. Upper S Subscript i equals beta 0 plus beta 1 Upper D Subscript i plus u Subscript i, i equals 1 comma ... comma n.

Which of the following specifies a nonlinear regression that model this shape?

. Upper Y Subscript i Baseline equals beta 0 plus beta 1 Upper X Subscript i Baseline plus beta 2 Upper X Subscript i Superscript 2 Baseline plus u Subscript i.

Which of the following is the modified regression so that hypothesis testing can be carried out using the t-statistic?

. Upper Y Subscript i Baseline equals beta 0 plus gamma 1 Edu Subscript 1 i Baseline plus beta 2 Upper W Subscript 1 i Baseline plus u Subscript i comma where gamma 1 equals beta 1 minus beta 2 and Upper W Subscript i Baseline equals Upper E du Subscript 1 i Baseline plus Exp Subscript 2 i Baseline nbsp.

A polynomial regression model is specified as:

. Upper Y Subscript i equals beta 0 plus beta 1 Upper X Subscript i plus beta 2 Upper X Subscript i Superscript 2 Baseline plusbulletbulletbulletplus beta Subscript r Baseline Upper X Subscript i Superscript r plus mu Subscript i.

In the model 1 n left parenthesis Upper Y Subscript i right parenthesis equals beta 0 plus beta 1 Upper X Subscript i plus mu Subscript i, the elasticity of Upper E left parenthesis Upper Y vertical line Upper X right parenthesis with respect to X is:

. beta 1 Upper X.

Consider the regression TestScore Subscript i equals beta 0 plus beta 1 STRsmall Subscript i plus beta 2 STRlarge Subscript i plus u Subscript i What expected signs of beta 1 and beta 2 are consistent with the statement above?

. beta 1 > 0 and beta 2 < 0.

Which of the following statements is true for the components of the homoskedasticity-only t-statistic testing the hypothesis Upper H 0 : beta 1 equals beta Subscript 1 comma 0 vs. Upper H 1 : beta 1 not equals beta Subscript 1 comma 0?

. left parenthesis ModifyingAbove beta with caret 1 minus beta Subscript 1 comma 0 right parenthesis follows a standard normal distribution while ModifyingAbove sigma with tilde Subscript ModifyingAbove beta with caret 1 Superscript 2 follows a chi-squared distribution with nminus2 degrees of freedom.

A "Cobb-Douglas" production function relates production (Q) to factors of production, capital (K), labor (L), raw materials (M), and an error term u using the equation Upper Q equals lambda Upper K Superscript beta 1 Baseline Upper L Superscript beta 2 Baseline Upper M Superscript beta 3 Baseline e Superscript u, where lambda, beta 1, beta 2, and beta 3 are production parameters. Taking logarithms of both sides of the equation yields ln left parenthesis Upper Q right parenthesis equals beta 0 plus beta 1 ln left parenthesis Upper K right parenthesis plus beta 2 ln left parenthesis Upper L right parenthesis plus beta 3 ln left parenthesis Upper M right parenthesis plus u. Suppose that you thought that the value of beta 2 was not constant, but rather increased when K increased. Which of the following regression functions captures this dynamic relationship?

. ln left parenthesis Upper Q right parenthesis equals beta 0 plus beta 1 ln left parenthesis Upper K right parenthesis plus beta 2 ln left parenthesis Upper L right parenthesis plus beta 3 ln left parenthesis Upper M right parenthesis plus beta 4 left bracket ln left parenthesis Upper L right parenthesis times ln left parenthesis Upper K right parenthesis right bracket plus u.

Consider the following least squares specification between test scores and the student-teacher ratio: ModifyingAbove Test Score with caret equals 557.8 plus 36.42 1 n left parenthesis Income right parenthesis According to this equation, a 1% increase in income is associated with an increase in test scores of:

0.36 points.

Consider the following regression output where the dependent variable is test scores and the two explanatory variables are the student-teacher ratio and the percent of English learners: ModifyingAbove Test Score with caret equals 698.9 minus 1.10 times STR minus 0.650 times PctEL. You are told that the t-statistic on the student-teacher ratio coefficient is 2.56. The standard error therefore is approximately:

0.43. (1.1/2.56)

A researcher is interested in the effect on test scores of computer usage. Using school district data, she regresses district average test scores on the number of computers per student. What are possible sources of bias for ModifyingAbove beta 1 with caret, the estimated effect on tests scores of increasing the number of computers per student? For each source of bias below, determine whether ModifyingAbove beta 1 with caret will be biased up or down.

1. Average income per capita in the district. If this variable is omitted, it will likely produce a(an) upward bias of the estimated effect on tests scores of increasing the number of computers per student. 2. The availability of computerized adaptive learning tools in the district. If this variable is omitted, it will likely produce a(an) upward bias of the estimated effect on tests scores of increasing the number of computers per student. 3. The availability of computer-related leisure activities in the district. If this variable is omitted, it will likely produce a(an) downward bias of the estimated effect on tests scores of increasing the number of computers per student.

The critical value of Upper F Subscript 4 comma infinity at the 5% significance level is:

2.37

Assume that you had estimated the following quadratic regression model: ModifyingAbove Test Score with caret equals 607.3 plus 3.85 Income minus 0.0423 Income squared If income increased from 10 to 11 ($10,000 to $11,000), then the predicted effect on test scores would be:

2.96.

You have estimated the relationship between test scores and the student-teacher ratio under the assumption of homoskedasticity of the error terms. The regression output is as follows: ModifyingAbove Test Score with caret equals 698.9 minus 2.28 times STR, and the standard error on the slope is 0.48. The homoskedasticity-only "overall" regression F-statistic for the hypothesis that the regression Upper R squared is zero is approximately:

22.56

Consider the estimated equation from your textbook: ModifyingAbove Test Score with caret equals 698.9 minus 2.28 times STR comma Upper R squared equals 0.051 comma SER equals 18.6Test Score=698.9−2.28×STR, R2=0.051, SER=18.6 (10.4) (0.52) The t-statistic for the slope is approximately:

4.38.

In what fraction of the samples would the value beta 1 = 0 be included in the 95% confidence interval for beta 1?

95% of the confidence intervals would contain the value beta 1 = 0.

Suppose n and k denote the sample size and number of regressors respectively. What degrees-of-freedom adjustment is required while calculating the standard error of the regression (SER) from the sum of squared residuals (SSR)?

A division by nminuskminus1.

A "Cobb-Douglas" production function relates production (Q) to factors of production, capital (K), labor (L), raw materials (M), and an error term u using the equation Upper Q equals lambda Upper K Superscript beta 1 Baseline Upper L Superscript beta 2 Baseline Upper M Superscript beta 3 Baseline e Superscript u, where lambda, beta 1, beta 2, and beta 3 are production parameters. Suppose that you have data on production and the factors of production from a random sample of firms with the same Cobb-Douglas production function. Which of the following regression functions provides the most useful transformation to estimate the model?

A logarithmic regression function.

Which of the following statements describes a way of determining the degree of the polynomial in X which best models a nonlinear regression? Let r denote the highest power of X that is included in the regression.

A way is to check if the coefficients in the regression equation associated with the largest values of r are equal to zero.

Which of the following statements best describe how the Upper R squared and the adjusted Upper R squared (Upper R overbar squared) should be interpreted in practice? (Check all that apply.)

An increase in the Upper R squared or Upper R overbar squared does not necessarily mean that an added variable is statistically significant. A high Upper R squared or Upper R overbar squared does not mean that there is no omitted variable bias.

The F-statistic for omitting BDR and Age from the regression is F = 0.09. Are the coefficients on BDR and Age statistically different from zero at the 1% level?

Because 0.09 is less than the critical value, the coefficients are not jointly significant at the 1% level

Let Female be an indicator variable that is equal to 1 for females and 0 for males. A regression of the logarithm of earnings onto Female yields ModifyingAbove ln left parenthesis Earnings right parenthesis with caret equals 6.48 minus 0.44 Female comma SER equals 2.65 (0.01) (0.05) The estimated coefficient on Female is -0.44. Explain what this value means.

Both A and C are correct. ln(Earnings) for females are, on average, 0.44 lower than men's. Earnings for females are, on average, 44% lower than men's.

You have estimated a linear regression model relating Y to X. Your professor says, "I think that the relationship between Y and X is nonlinear." How would you test the adequacy of your linear regression? (Check all that apply)

Compare the fit between of linear regression to the non-linear regression model. If adding a quadratic term, you could test the hypothesis that the estimated coefficient of the quadratic term is significantly different from zero.

Which of the following describes how to test the null hypothesis that beta 1 equals 0 and beta 2 equals 0?

Compute the standard errors, the correlation between beta 1 and beta 2, the F-statistic, and the p-value associated with the F-statistic. Reject the null hypothesis if the p-value is less than some relevant significance level.

Consider the following multiple regression model Upper Y Subscript i equals beta 0 plus beta 1 Upper X Subscript 1 i Baseline plus beta 2 Upper X Subscript 2 i Baseline plus u Subscript i Which of the following describes how to test the null hypotheses that either beta 1 equals 0 or beta 2 equals 0?

Compute the standard errors, the t-statistic and the p-value for each, beta 1 and beta 2. Reject the null hypothesis if the p-value is less than some relevant significance level.

What are the limitations of the Gauss-Markov theorem? (Check all that apply.)

Even if the conditions of the theorem hold, it is possible that under some conditions, other estimators that are not linear or conditionally unbiased may be more efficient than OLS. If the error term is heteroskedastic, then the OLS estimator is not BLUE.

The coefficient on Female is nowminus0.28. Why has it changed from the first regression?

Female is correlated with the two new included variables.. MarketValue is important for explaining ln(Earnings). The first regression suffered from omitted variable bias. All of the above.

Consider the following multiple regression model Upper Y equals beta 0 plus beta 1 Upper X 1 plus beta 2 Upper X 2 plus u Which of the following explains why two perfectly collinear regressors cannot be included in a linear multiple regressions? (Check all that apply)

For the case of two regressors and homoskedasticity, it can be shown mathematically that the variance of the estimated coefficient ModifyingAbove beta 1 with caret goes to infinity as the correlation between Upper X 1 and Upper X 2 goes to one Intuitively, if one regressor is a linear function of another, OLS cannot identify the partial effect of one while holding the other constant.

Consider the following multiple regression model Upper Y equals beta 0 plus beta 1 Upper X 1 plus beta 2 Upper X 2 plus u Which of the following explains why it is difficult to estimate precisely the partial effect of Upper X 1, holding Upper X 2 constant, if Upper X 1 and Upper X 2 are highly correlated?

For the case of two regressors and homoskedasticity, it can be shown mathematically that the variance of the estimated coefficient ModifyingAbove beta 1 with caret goes to infinity as the correlation between Upper X 1 and Upper X 2 goes to one. Intuitively, estimating the partial effect of Upper X 1 holding Upper X 2 constant becomes difficult because after controlling for Upper X 2, there is little variation left to precisely estimate the partial effect of Upper X 1.

What is the effect on Upper R squared and SSR if the coefficient of the added regressor is exactly 0?

If the coefficient of the added regressor is exactly 0, the Upper R squared and SSR both do not change.

Suppose you have developed and estimated a base specification and a list of alternative specifications. Which of the following statements are true? (Check all that apply.)

If the estimates of the coefficients of interest are numerically similar across the alternative specifications, then this provides evidence that the estimates from your base specification are reliable. If the estimates of the coefficients of interest change substantially across specifications, this provides evidence that the original specification had omitted variable bias and so might your alternative specifications.

Which of the following statements describe what the Gauss-Markov theorem states?

If the three least square assumptions hold and if errors are homoskedastic, then the OLS estimator of a given population parameter is the most efficient linear conditionally unbiased estimator.

Suppose that the errors in a regression model are heteroskedastic.

In large samples, the probability that a confidence interval constructed as plus or minus1.96 homoskedasticity-only standard errors contains the true value of the coefficient will not be 95%.

Which of the following statements are true regarding the specifications given in the above table? (Check all that apply.)

In the 3 Superscript rd specification, F-statistic testing the hypothesis that the coefficients on Upper F squared and Upper F cubed are zero is rejected at the 1% significance level. The value of Upper R overbar squared in the 1 Superscript st specification suggests that the number of cows alone explains 42% of the variation in the production of milk.

Suppose the population multiple regression model is of the form: Upper Y Subscript i equals beta 0 Upper X Subscript 0 i Baseline plus beta 1 Upper X Subscript 1 i Baseline plus beta 2 Upper X Subscript 2 i Baseline plus u Subscript i comma where Upper X Subscript 0 iequals1, i = 1,....,n.

In the given model, beta 0 is the constant term and Upper X Subscript 0 i is the constant regressor.

How would you change the regression if you suspected that the effect of experience on earnings was different for men than for women?

Include interaction terms FemaletimesPotential experience and Femaletimes(Potential experience)squared.

Why is it impossible to compute OLS estimators in the presence of perfect multicollinearity?

It is impossible to compute OLS estimators in the presence of perfect multicollinearity because it produces division by 0.

Which of the following statements are true in describing the Bonferroni method of testing hypotheses on multiple coefficients? (Check all that apply.)

It modifies the "one-at-a-time" method so that it uses different critical values that ensure that its size equals its significance level. Its advantage is that it applies very generally.

Assume a father's weight is correlated with his years of eduction, but is not a determinant of the child's years of formal education. Which of the following statements describes the consequences of omitting the father's weight from the above regression?

It will not result in omitted variable bias because the omitted variable, weight, is not a determinant of the dependent variable.

In which of the following cases would the weighted least squares estimator (WLS) or the least absolute deviations estimator (LAD) be preferred to the OLS estimator? (Check all that apply.)

LAD is preferred to OLS if extreme outliers are not rare in the data. WLS is preffered to OLS if the errors are heteroskedastic.

Suppose you want to test the null hypothesis that beta 1 equals 0 and beta 2 equals 0. Is the result of the joint test implied by the result of the two separate tests?

No.

You are interested in beta 1, the causal effect of Upper X 1 on Y. Suppose that Upper X 1 and Upper X 2 are uncorrelated. You estimate beta 1 by regressing Upper X 1 (so that Upper X 2 is not included in the regression). Does this estimator suffer from omitted variable bias?

No.

Supposed you learned that Upper Y Subscript i and Upper X Subscript i were independent. Would you be surprised?

No, I wouldn't be surprised because the null hypothesis that beta 1 is zero was not rejected at the 5% significance level.

Which of the following statements correctly describes the omitted variable bias?

Omitted variable bias arises when the omitted variable is correlated with a regressor and is a determinant of the dependent variable.

Suppose the population regression function is given by: Upper Y Subscript i Baseline equals beta 0 plus beta 1 Upper X Subscript 1 i Baseline plus beta 2 Upper X Subscript 2 i Baseline plus ... plus beta Subscript k Upper X Subscript ki plus u Subscript i Baseline comma i equals 1 comma ... comma n comma where u Subscript i is the error term and beta 1 comma ... comma beta Subscript k, are the coefficients on the X's. Which of the following statements is not true about the omitted variable bias?

Omitted variable bias violates the assumption that Upper X Subscript 1 i Baseline comma ... comma Upper X Subscript ki are i.i.d.

In which of the following scenarios does perfect multicollinearity occur?

Perfect multicollinearity occurs when one of the regressors is a perfect linear function of the other regressors.

Is the coefficient on BDR statistically significantly different from zero?

Since the t-statistic < 1.96, the coefficient on BDR is not statistically significantly different from zero.

Imagine that you were told that the t-statistic for the slope coefficient of the regression line ModifyingAbove Test Score with caret equals 698.9 minus 2.28 times STR was 4.38. What are the units of measurement for the t-statistic?

Standard deviations.

Are the OLS estimators likely to be biased and inconsistent?

The OLS estimators are likely biased and inconsistent because there are omitted variables correlated with parking lot area per pupil that also explain test scores, such as ability.

Which of the following statements describes the mathematical implications of heteroskedasticity?

The OLS estimators remain unbiased, consistent, asymptotically normal, but do not necessarily have the least variance among all estimators that are linear in Upper Y 1 comma .... comma Upper Y Subscript n conditional on Upper X 1 comma ... comma Upper X Subscript n.

The SER is 2.65. Explain what this value means.

The error term has a standard deviation of 2.65 (measured in log-points).

The error term is homoskedastic if the variance of the conditional distribution of mu Subscript i given Upper X Subscript i is constant for i equals 1. ... n, and in particular does not depend on Upper X Subscript i.

If the error term is homoskedastic, the F-statistic can be written in terms of the improvement in the fit of the regression. How can we measure this improvement in fit? (Check all that apply.)

The improvement in the fit of the regression can be measured by the decrease in sum of squared residuals (SSR) The improvement in the fit of the regression can be measured by the increase in the regression Upper R squared.

Which of the following statements is correct?

The larger the correlation between Upper X 1 and Upper X 2, the larger the variance of ModifyingAbove beta with caret 1. Nevertheless, it is best include Upper X 2 in the regression if it is a determinant of Y.

Suppose that Upper Y Subscript i and Upper X Subscript i are independent and many samples of size n = 495 are drawn and regressions estimated. Suppose that you test the null hypothesis that beta 1 is zero at the 5% level and construct a 95% confidence interval for beta 1. In what fraction of the samples would the null hypothesis that beta 1 is zero at the 5% level be rejected?

The null hypothesis would be rejected in 5% of the samples.

Which of the following statements correctly describe the reasons behind the differences observed in the coefficients in the given specifications? (Check all that apply.)

The number of CCTV cameras installed in the district appears to be redundant. As reported in regression (4), adding it to regression (2) has a negligible effect on the estimated coefficients on LEOP and DTP or their standard errors. The significant rise in the coefficient on LEOP from the 1 Superscript st specification to the 4 Superscript th shows the presence of omitted variable bias in the 1 Superscript st specification.

Which of the following statements are true about the explanatory variables used in this study? (Check all that apply.)

The regression does not fall into the dummy variable trap due to the absence of the intercept term. The estimated coefficients will be jointly normally distributed.

Why are the answers to Scenario A and Scenario B different?

The regression is nonlinear in experience.

Suppose that crime rate is positively affected by the fraction of young males in the population, and that counties with high crime rates tend to hire more police. Use the following expression for omitted variable bias to determine whether the regression will likely overdash or underestimate the effect of police on the crime rate. ModifyingAbove beta with caret 1 right arrow Subscript p Baseline beta 1 plus rho Subscript Xu StartFraction sigma Subscript u Over sigma Subscript Upper X EndFraction

The regression will likely overestimate beta 1. That is, ModifyingAbove beta 1 with caret is likely to be larger than beta 1.

Why is the regressor West omitted from the regression? What would happen if it was included?

The regressor West is omitted to avoid perfect multicollinearity. If West is included, then the OLS estimator cannot be computed in this situation.

Which of the following economic relationships may exhibit a shape like this? (Check all that apply)

The relationship between income and fertility. The relationship between wage earnings and years of experience. The relationship between time spent studying for an exam and grade for such exam.

Using 143 observations, assume that you had estimated a simple regression function and that your estimate for the slope was 0.04, with a standard error of 0.01. You want to test whether or not the estimate is statistically significant. Which of the following decisions is the only correct one?

The slope is statistically significant since it is four standard errors away from zero.

Which of the following statements is NOT true about the Upper R overbar squared? (Check all that apply.)

The value of Upper R overbar squared increases when a new regressor is added to the regression equation. The value of Upper R overbar squared always lies between 0 and 1.

A researcher plans to study the causal effect of police crime using data from a random sample of U.S. counties. He plans to regress the county's crime rate on the (per capita) size of the country's police force. Why is this regression likely to suffer from omitted variable bias?

There are other important determinants of a country's crime rate, including demographic characteristics of the population, that if left out of the regression would bias the estimated partial effect of the (per capita) size of the county's police force.

A researcher tries to estimate the regression TestScore Subscript i equals beta 0 plus beta 1 STRsmall Subscript i plus beta 2 STRmoderate Subscript i Baseline plus beta 3 STRlarge Subscript i plus u Subscript i and finds that her computer crashes. Why?

There is perfect multicollinearity between the regressors.

Suppose that a set of control variables do not satisfy the conditional mean independence condition, Upper E left parenthesis u Subscript i Baseline vertial line Upper X Subscript i comma Upper W Subscript i right parenthesis equals Upper E left parenthesis u Subscript i vertial line Upper W Subscript i right parenthesis, where Upper X Subscript i denotes the variable or variables of interest and Upper W Subscript i denotes one or more control variables. If the OLS estimators are jointly normally distributed and each ModifyingAbove beta with caret Subscript j is distributed Upper N left parenthesis beta Subscript j Baseline comma sigma Subscript ModifyingAbove beta with caret Subscript j Superscript 2 Baseline right parenthesis, j equals 0,..., k, then which of the following statements holds true in this case?

There will remain omitted determinants of Y that are correlated with X, even after holding W constant, and the result is omitted variable bias.

Suppose the population regression is of the form: Upper Y Subscript i equals beta 0 plus beta 1 Upper X Subscript 1 i Baseline plus beta 2 Upper X Subscript 2 i Baseline plus u Subscript i.

The 95% confidence set for the coefficients beta 1 and beta 2 will be an ellipse which contains the pairs of values of beta 1 and beta 2 that cannot be rejected using the F-statistic at the 5% significance level.

In a multiple regression, the interaction term between its two independent variables Upper X 1 and Upper X 2 is their product Upper X 1timesUpper X 2. The coefficient on (Upper X 1timesUpper X 2) is the effect of a one-unit increase in the product of Upper X 1 and Upper X 2, above and beyond the sum of the individual effects of a unit increase in Upper X 1 alone and a unit increase in Upper X 2 alone.

This holds true whether Upper X 1 and divided by or Upper X 2 are continuous or binary.

Consider the following regression equation: Upper Y Subscript i equals beta 0 plus beta 1 Upper X Subscript i plus beta 2 left parenthesis Upper X Subscript i times Upper D Subscript i right parenthesis plus u Subscript i, where beta 0, beta 1, beta 2, and u Subscript i are the intercept, the slope coefficient on Upper X Subscript i, the coefficient on the interaction term which is the product of (Upper X Subscript i timesUpper D Subscript i), where Upper D Subscript i is the binary variable respectively.

This regression equation has a different slope and the same intercept for the two values of the binary variable.

If the errors are heteroskedastic, then:

WLS is BLUE if the conditional variance of the errors is known up to a constant factor of proportionality.

Suppose you are interested in investigating the wage gender gap using data on earnings of men and women. Which of the following models best serves this purpose?

Wage equals beta 0 plus beta 1 Female plus u, where Female (=1 if female) is an indicator variable and u the error term.

Consider the regression model Wage equals beta 0 plus beta 1 Female plus u Where Female (=1 if female) is an indicator variable and u the error term. Identify the dependent and independent variables in the regression model above.

Wage is the dependent variable, while Female is the independent variable.

Which of the following statements is true?

We would reject the null hypothesis if the sum of squared residuals (SSR) from the unrestricted regression is sufficiently smaller than that from the restricted regression.

Does this imply that age is an important determinant of earnings?

Yes, age is an important determinant of earnings because the low p-value implies that the coefficient on age is statistically significant at the 1% level.

All of the following are true with the exception of one condition:

a high Upper R squared or Upper R overbar squared always means that an added variable is statistically significant.

Consider the population regression of log earnings [Upper Y Subscript i, where Upper Y Subscript i = ln(Earningsi)] against two binary variables: whether a worker is married (Upper D Subscript 1 i, where Upper D Subscript 1 i = 1 if the ith person is married) and the worker's gender (Upper D Subscript 2 i, where Upper D Subscript 2 i = 1 if the ith person is female), and the product of the two binary variables Upper Y Subscript i equals beta 0 plus beta 1 Upper D Subscript 1 i Baseline plus beta 2 Upper D Subscript 2 i Baseline plus beta 3 left parenthesis Upper D Subscript 1 i Baseline times Upper D Subscript 2 i Baseline right parenthesis plus mu Subscript i. The interaction term:

allows the population effect on log earnings of being married to depend on gender.

Under the least squares assumptions for the multiple regression problem (zero conditional mean for the error term, all Upper X Subscript i and Upper Y Subscript i being i.i.d., all Upper X Subscript i and mu Subscript i having finite fourth moments, no perfect multicollinearity), the OLS estimators for the slopes and intercept:

are unbiased and consistent.

The interpretation of the slope coefficient in the model 1 n left parenthesis Upper Y Subscript i right parenthesis equals beta 0 plus beta 1 1 n left parenthesis Upper X Subscript i right parenthesis plus mu Subscript i is as follows:

a 1% change in X is associated with a beta 1% change in Y.

In the multiple regression model, the t-statistic for testing that the slope is significantly different from zero is calculated:

by dividing the estimate by its standard error.

If you had a two-regressor regression model, then omitting one variable that is relevant:

can result in a negative value for the coefficient of the included variable, even though the coefficient will have a significant positive effect on Y if the omitted variable were included.

The homoskedasticity-only F-statistic and the heteroskedasticity-robust F-statistic typically are:

different

A binary variable is often called a:

dummy variable.

When there are two coefficients, the resulting confidence sets are:

ellipses.

Consider the multiple regression model with two regressors X1 and X2, where both variables are determinants of the dependent variable. When omitting X2 from the regression, there will be omitted variable bias for ModifyingAbove beta with caret 1:

if Upper X 1 and Upper X 2 are correlated.

Imperfect multicollinearity:

implies that it will be difficult to estimate precisely one or more of the partial effects using the data at hand.

A nonlinear function:

is a function with a slope that is not constant.

The 95% confidence interval for beta 1β1 is the interval:

left parenthesis ModifyingAbove beta with caret 1 minus 1.96 SE left parenthesis ModifyingAbove beta with caret 1 right parenthesis comma ModifyingAbove beta with caret 1 plus 1.96 SE left parenthesis ModifyingAbove beta with caret 1 right parenthesis right parenthesis

Imperfect multicollinearity:

means that two or more of the regressors are highly correlated.

Consider the multiple regression model with two regressors Upper X 1 and Upper X 2, where both variables are determinants of the dependent variable. You first regress Y on Upper X 1 only and find no relationship. However, when regressing Y on Upper X 1 and Upper X 2, the slope coefficient ModifyingAbove beta with caret 1 changes by a large amount. This suggests that your first regression suffers from:

omitted variable bias.

The dummy variable trap is an example of:

perfect multicollinearity.

The best way to interpret polynomial regressions is to:

plot the estimated regression function and to calculate the estimated effect on Y associated with a change in X for one or more values of X.

If you wanted to test, using a 5% significance level, whether or not a specific slope coefficient is equal to one, then you should:

subtract 1 from the estimated coefficient, divide the difference by the standard error, and check if the resulting ratio is larger than 1.96.

Perfect multicollinearity can be rectified by modifying

the independent variables .

In the log-log model, the slope coefficient indicates:

the elasticity of Y with respect to X.

The t-statistic is calculated by dividing:

the estimator minus its hypothesized value by the standard error of the estimator.

If the errors are heteroskedastic,

then the t-statistic computed using homoskedasticity-only standard error does not have a standard normal distribution, even in large samples.

When testing a joint hypothesis, you should:

use the F-statistics and reject at least one of the hypotheses if the statistic exceeds the critical value.

Using the textbook example of 420 California school districts and the regression of test scores on the student-teacher ratio, you find that the standard error on the slope coefficient is 0.51 when using the heteroskedasticity-robust formula, while it is 0.48 when employing the homoskedasticity-only formula. When calculating the t-statistic, the recommended procedure is to:

use the heteroskedasticity-robust formula.

This formula illustrates that

we cannot obtain an unbiased estimate of Y by taking the exponential function of beta 0 plus beta 1 Upper X Subscript i.

In the multiple regression model, the adjusted Upper R squared, Upper R overbar squared:

will never be greater than the regression Upper R squaredR2.

Suppose you run a regression of test scores against parking lot area per pupil. Is the Upper R squared likely to be high or low?

High, because parking lot area is correlated with studentdashteacher ratio, with whether the school is in a suburb or a city, and possibly with district income.

Lot size is measured in square feet. Do you think that measuring lot size in thousands of square feet might be more appropriate?

Yes, because small differences in square footage between two houses is not likely to have a significant effect on differences in house prices.

Lot size is measured in square feet. Do you think that another scale might be more appropriate?

Yes, if the lot size were measured in thousands of square feet, the estimate coefficient would be 2 instead of 0.002, thus making the regression results easy to read and interpret.

You extract approximately 5,000 observations from the Current Population Survey (CPS) and estimate the following regression function: ModifyingAbove AHE with caret equals 3.32 minus 0.45 times Age comma Upper R squared equals 0.02 comma SER equals 8.66 (1.00) (0.04) where AHE is average hourly earnings, and Age is the individual's age. Given the specification, your 95% confidence interval for the effect of changing age by 5 years is approximately:

[$1.86, $2.64]. (MULTIPLY THE MEAN AND THE BOUNDS

You have collected data for the 50 U.S. states and estimated the following relationship between the change in the unemployment rate from the previous year ModifyingAbove left parenthesis triangle symbol ur right parenthesis with caret and the growth rate of the respective state real GDP (g Subscript y). The results are as follows: ModifyingAbove triangle symbol ur with caret equals 2.81 minus 0.23 times g Subscript y comma Upper R squared equals 0.36 comma SER equals 0.78 (0.12) (0.04) Assuming that the estimator has a normal distribution, the 95% confidence interval for the slope is approximately the interval:

[minus0.31, minus0.15].

Voir tous les ensembles d'études

eco 4k ch 5-8 other mc q

Ensembles d'études connexes

chapter 8

A&P Chapter 7 - The Axial Skeleton (lecture notes)

EXP 3204 - Chapter 14 Quiz

Chapter 8 biology

AP Microeconomics Unit 2 MCQ

BUS 195 EXAM 1 SHORT ANSWERS

Ecology

Ch. 15 Oncology: Nursing Management in Cancer Care

Chapter 9

Macro exam practice problems

LEC 13 Anatomy of Ruminant Head

IB Business Management Marketing 4.6 The Extended Marketing Mix - Seven Ps Model

tabc training exam

Connect and Protect: Networks and Network Security

Learning and Memory CH1-6

Biology 3.20 Quiz

ATI Pain and Inflammation (Exam 3)

MGT 3210 Midterm Exam Study Guide

Lifespan G&D Final Exam Review Unit 2

Pop-Up Manager