MAR5625 Exam 3

Ace your homework & exams now with Quizwiz!

____________________ is the appropriate test to use when comparing the means of three or more groups to see if they are significantly different from one another.

ANOVA

The independent samples t -test tests the differences between means from no more than four independent sampels or groups T/F

False

The limiting score serves as a benchmark in discriminant analysis. a. True b. False

False RATIONALE: The cutting score serves as a benchmark in discriminant analysis.

MANOVA predicts multiple continuous dependent variables with multiple continuous independent variables. a. True b. False

False RATIONALE: The independent variables are categorical.

Logistic regression is a multivariate technique involving prediction of a categorical, dichotomous dependent variable. a. True b. False

True

Logistic regression models can be compared with respect to their predictive power by taking the difference of their respective -2LLs. a. True b. False

True

MANCOVA is a variation of MANOVA that can include interval or ratio covariates. a. True b. False

True

Multivariate dependence techniques are variants of the general linear model (GLM) t/f

True

Multivariate dependence techniques are variants of the general linear model (GLM). a. True b. False

True

When the difference between two groupsi s measured, bivariate statistics are used. T/F

True

Wilks Lambda is the most commonly applied statistic for calculating MANOVA model results. a. True b. False

True

discriminant score provides a way of assigning observations to groups. a. True b. False

True

The null hypothesis when using the Z-test for the differences between the proportions of two groups is that ____________________.

π1 = π2

The statistical significance of a correlation can be tested using the t-test. a. True b. False

true

An overall estimate of variance overlap among independent variables is the ____________________.

variance inflation factor VIF

The F-test partitions total variance into ____________________ variance and ____________________ variance.

within-group; between-group

If the correlation between two variables is -.55, the coefficient of determination is approximately ____________________.

+0.30

Which value provides a common metric allowing regression results to be compared to one another, regardless of what the original scale range may have been. A standardized regression coeff(B) B X^2 (chisquare) C coeffecient of determinations (DT) D raw parameter estimates

A standardized regression coeff (B)

In using the t-test to compare the means of two groups, the alternative hypothesis is typically stated as __ a u1 <> u2 b u1 = u2 c u1-u2 = 2 d u1 + u2 = 1

A u1 <> u2

In comparing the difference of the means between two groups, the null hypothesis can also be staed as __ A u1- u2 = 0 B u1+u2 = 0 c u1xu2=0 d u1/u2 = 0

A u1-u2=0

A researcher hypothesizes that males and females differ with the respect to attitude toward sports sponsorships. To investigate this hypothesis that these two groups' attitudes differ,e the researcher will use a _. a bivariate test of differences b univariate test of differences c multivariate test of differnces d cluster analysis

A. bivariate test of differences.

If the regression equation is : Y = -4.2+3.6X, then the expected score for Y when X is 4 would be _______ A -18.6 B 10.2 C 18.6 D 20.2

B 10.2

Suppose that you are using a 9-point rating scale to compare men who have an annual income over $50,000 (Group 1) with men who have an annual income less than or equal to $50,000(group 2) with regard to their assessment of a new product. Tehre are forty men in Group 1 and they have a mean of 7 and a standard deviation of 2.5, while the 35 men in the group 2 have a mean of 5 and a standard deviation of 1.4. What is the approximate value of t using the t-test? A 3.43 B 4.19 C 5.64 D there is not enough info

B 4.19

In reression analysis, the symbol X is commonly used for the ___ variable, and the symbol Y is commonly used for the ___ variable. A depdendent ; moderating B independent ; dependent C dependent ; independent D independent ; moderating

B independent ; dependent

What is another form of the alternative hypothesis when testing the difference between the means of the two groups? A u1-u2=0 B u1-u2<>0 C u1-u2=2 D u1-u2<>2

B u1-u2<>0

In the regression equation Y = a + BX, a is the symbol for the A slope of the regression line B y intercept of the regression line. C Depedent variable D independent variable

B. y intercept is the muscle

The statistical significance of a regression model is determined using the ___ A t test B X^2 C f test D Z test

C - Ftest

In a brand awareness study, 25 of a group of 35 males identify the brand correctly and 15 of a group of 35 females identify this brand correctly. the chi square value for this study is approximately __ A 3.26 B 4.15 C 5.84 D 7.92

C 5.84

The equation, Y = a + BX, is the equation for the A coefficient of determination. B correlation coeffecient C least-squares regression line D F-test

C least-squares regression line

Which type of analysis involves three or more variables? a. univariate statistical analysis b. bivariate statistical analysis c. multivariate statistical analysis d. polyvariate statistical analysis

C multivariate statistical analysis

In the regression equation, Y = a + BX, B is the A y intercept B independent variable C slope of the regression line D dependent variable

C slope of the regression line

Multivariate dependence techniques are variants of the ____, which is a way of modeling some process based on how different variables cause fluctuations from the average dependent variable. a. ordinary linear model (OLM) b. weighted average model (WAM) c. general linear model (GLM) d. metric scaling model (MSM)

C. General Linear Model (GLM)

This formula: X^2 = SUM of (Oi-Ei)^2 / Ei A - Z test B - f test C X^2 Test (Chi Square) D alpha

C. X^2 Test (chi Square)

Consider this regression equation: Y = 24.35 - 14.2X. Here 24.35 is the ____ and -14.2 is the ____ A slope; y -intercept B independent variable; slope C dependent variable ; y-intercept D y-intercept ; slope

D: y-intercept ; Slope

The test for the statistical significance of the regression model is the ____________________ test.

F (test)

The statistical test used to determine whether there is more variability in the scores of one sample than in the scores of another sample is the ____________________.

F Test

Marketing metrics are qualitative benchmarks. a. True b. False

False

A correlation coefficient indicates the magnitude of the linear relationships but not the direction of that relationship. a. True b. False

False A correlation coefficient indicates both the magnitude of the linear relationship and the direction of that relationship.

In multiple regression dummy variables are those that have no effect on the dependent variable. T/F

False A dummy variable uses 0 and 1 to code the different levels of a dichotomous variable.

A pooled estimate of the standard error is a poorer estimate of the standard error than one based on the variance from either sample. T/F

False A pooled estimate of the standard error is a better estimate.

In a regression equation, the slope of the line (BETA) is the change in X that is due to a corresponding change of Y that is due to a corresponding change of one unit of X. T/F

False In a regression eq, the slope of the line (BETA) is the change in Y that is done to a corresponding change of one unit of X.

The degrees of freedom are calculated as df = n - 1 when using the t-test for comparing two means, where n = n1 + n2. T/F

False In a test of two means, degrees of freedom are calculated as df = n-k, where en = n1 +n2 and k = number of groups.

A Z-test for differences of proportions requires a sample size greater than 100. T/F

False It requires a sample size > 30

Measure of association is a general term that refers to causality. a. True b. False

False Measure of association is a general term that refers to a number of bivariate statistical techniques used to measure the strength of a relationship between two variables, and causality is not always hypothesized or determined

A correlation coefficient equal to -1.0 indicates an extremely weak relationship. a. True b. False

False RATIONALE: A correlation coefficient equal to -1.0 indicates a perfect negative relationship.

A negative relationship means that as one variable decreases in value, the other also decreases. a. True b. False

False RATIONALE: A negative relationship means that as one variable increases in value, the other decreases.

Discriminant analysis predicts an interval dependent variable based on a nonlinear combination of independent variables. a. True b. False

False RATIONALE: Discriminant analysis predicts a categorical dependent variable based on a linear combination of independent variables.

Interval and ratio scales are referred to as nonmetric scales. a. True b. False

False RATIONALE: Interval and ratio scales are referred to as metric scales

Logistic regression equations are estimated using OLS. a. True b. False

False RATIONALE: Logistic regression equations are not estimated using OLS because of the statistical distribution created by using logits as dependent variables

Logit is the inverse log of the odds of some occurrence. a. True b. False

False RATIONALE: Logit is the log of the odds of some occurrence

Standardized variables are generated by subtracting the mean of a variable from each observation and multiplying the result by the standard deviation of that variable. a. True b. False

False RATIONALE: Standardized variables are found generated by subtracting the mean of a variable from each observation and dividing the result by the standard deviation of that variable.

Standardized variables are often called G-scores. a. True b. False

False RATIONALE: Standardized variables are often called Z-scores.

The entropy statistic allows interpretation of the statistical significance of the relationship between each independent variable and the logit values representing the dependent variable. a. True b. False

False RATIONALE: The Wald statistic allows interpretation of the statistical significance of the relationship between each independent variable and the logit values representing the dependent variable.

The likelihood value provides a way of indicating the overall significance or predictive capabilities of multiple regression. a. True b. False

False RATIONALE: The likelihood value provides a way of indicating the overall significance or predictive capabilities of logistic regression.

The odds of success is the probability of success multiplied by the probability of failure. a. True b. False

False RATIONALE: The odds of success is the probability of success divided by the probability of failure.

The basic types of multivariate techniques are known as metric methods and nonmetric methods. a. True b. False

False RATIONALE: The two basic types of multivariate techniques are dependence methods and interdependence methods.

Nominal and ordinal scales are referred to as metric scales. a. True b. False

False RATIONALE: These are nonmetric scales

A Spearman correlation is more appropriate for interval and ratio data than is the Pearson product-moment correlation. a. True b. False

False Spearman correlation is more appropriate for ordinal level data

Correlation coefficients are sufficient to establish that a causal relationship exists between the two variables under study. a. True b. False

False Systematic covariation does not in and of itself establish causality. The relationship would also need to be nonspurious and that any hypothesized "cause" would have to occur before any subsequent effect

In most business research, teh estimate of 'a' in a regression equation is most important. T/F

False The estimate of B(Beta) is most important b/c the explanatory power of regression rests with this parameter. This is where the direction and strenght of the relationship between independent and dependent variable is explained.

The chi-square test requires that the xpected frequency in each cell of the contingency table be at least 30. T/F

False The expected frequency in each cell should be at least 5

In a correlation matrix, the main diagonal contains correlations of zero. a. True b. False

False The main diagonal consists of correlations of 1.00 because it is the correlation of a variable with itself

If r = -.88, this indicates a weak relationship between the two variables under study. a. True b. False

False This indicates a strong negative relationship

To use the chi-square test, both variables in a 2 x 2 contingency table must be measured on a ratio scale. T/F

False (A frequency count of data that nominally identify or categorically rank groups is acceptable).

If a researcher is interested in whether adult males purchase a product more frequently than adult females, univariate statistics would be used in the analysis of the data. T/F

False (This is bivariate)

If the purpose of the regression analysis is forecasting, then standardized regression estimates are most appropriate T/F

False. Raw parameter estimates are most appropriate for forecasting.

A covariance matrix contains the covariance for every pair of variables among a set of metric variables. True False

True

A discriminant score above the cutting score places observations in one group, while a discriminant score below the cutting score places them in another. a. True b. False

True

A scatter plot is a simple plot graphically depicting the corresponding values of variables onto one another in a Cartesian plane T/F

True

Control variables are predictor variables not involved in any causal assertion or hypothesis but are included to better understand the true effect of hypothesized causal variables on dependent variables. a. True b. False

True

Covariance is the extent to which a change in one variable corresponds systematically to a change in another. a. True b. False

True

Cross-tabulation tables typically provide an easy, inuitive way of understanding data T/F

True

In a regression equation, the beta coefficients indicate the effect on the dependent variable of a 1-unit increase in any of the independent variables T/F

True

In multiple regression, the dependent variable must be continuous and interval-scaled

True

In regression analysis, the equation of a straight line is Y = a + b X T/F

True

Multivariate statistical analysis permits the researcher to consider the effects of three or more variables at the same time. a. True b. False

True

One way to determine the relationship between X and Y is to simply visually draw the best-fit straight line through the points in the figure. a. True b. False

True

One way to test the significance of the relationship shown in a contingency tables is by means of the chi-square test. T/F

True

The chi-square test involves comparison of the observed frequencies of the groups with the expected frequencies of the groups. T/F

True

The coefficient of determination reflects the proportion of variance that can be explained by the regression line. a. True b. False

True

The exponential logistic coefficient is the antilog of the raw logistic regression parameter estimate. a. True b. False

True

The least-squares regression line minimizes the sum of the squared deviations of the actual values from the predicted values in the regression line. a. True b. False

True

The null hypothesis for an ANOVA test comparing the means of three groups is m1 = m2 = m3 T/F

True

The ordinary least-squares method of regression analysis is based on the logic of how much better a regression line can predict values of Y compared to simply using the mean as a prediction. True / False

True

The square of the correlation coefficient indicates the part of the total variance of Y that can be accounted for by X. a. True b. False

True

The symbol for the Pearson product-moment correlation coefficient is r. a. True b. False

True

The type of measurement scales used will determine which multivariate statistical techniques are appropriate for the data. a. True b. False

True

The variate is a mathematical way in which a set of variables can be represented with one equation. a. True b. False

True

To determine whether the discriminant analysis can be used as a good predictor, information provided in the "confusion matrix" is used. a. True b. False

True

____________________ may be used to assess problems with multicollinearity.

Variance Inflation Factors

If the coefficient of determination is approximately +0.38, the correlation coefficient might be ____. a. -0.62 b. -0.38 c. +0.23 d. + 0.38

a. -0.62

Which correlation coefficient indicates a perfect negative relationship? a. -1.0 b. -0.5 c. +0.5 d. +1.0

a. -1.0

Which entropy R2 value implies no predictive power? a. 0 b. -1.0 c. +0.5 d. +1.0

a. 0

If the coefficient of determination between X and Y is .28, approximately what percentage of the variance in Y can be explained by X? a. 28% b. 58% c. 78% d. 88%

a. 28%

Which activity is the first step in interpreting MANOVA or MANCOVA results? a. Examine the multivariate F. b. Examine the individual Univariate Model F tests. c. Examine the individual F tests. d. Interpret the effect by examining differences in the means.

a. Examine the multivariate F.

Which formula is the correct formula for a logit? a. Logiti = ln(probability of success/probability of failure) b. Logiti = ln(probability of success*probability of failure) c. Logiti = ln(probability of success/probability of failure)2 d. Logiti = ln(1/probability of failure)2

a. Logiti = ln(probability of success/probability of failure)

The standard form for reporting observed correlations among multiple variables is the ____. a. correlation matrix b. contingency table c. Pearson grid d. inverse table

a. correlation matrix

In discriminant analysis, a number that serves as a benchmark is a ____. a. cutting score b. discriminant function c. discriminant factor d. Wald statistic

a. cutting score

When a multivariate statistical technique is used to predict a dependent variable from several independent variables, the researcher is studying ____. a. dependence b. independence c. interdependence d. segmental

a. dependence

The two basic groups of multivariate techniques are ____. a. dependence methods and interdependence methods b. primary methods and secondary methods c. simple methods and complex methods d. partial methods and complete methods

a. dependence methods and interdependence methods

Jamal is analyzing data and his current focus centers on how strongly interrelated the independent variables in his model are. Jamal is concerned about ____. a. multicollinearity b. MANOVA c. degrees of freedom d. convergence

a. multicollinearity

Which effect occurs if multiple predictor variables are strongly correlated with each other? a. multicollinearity b. heteroskedasticity c. enodgeneity d. dependency

a. multicollinearity

Which correlation coefficient indicates a moderate negative relationship? a. -1.0 b. -0.5 c. +0.5 d. +1.0

b. -0.5

If there is no relationship between two variables, the correlation coefficient between them would be: a. -1.0 b. 0.0 c. +0.50 d. +1.0

b. 0.0

If the correlation between X and Y is -.72, approximately what percentage of the variance in Y can be explained by X? a. 28% b. 52% c. 72% d. 85%

b. 52%

Which method of analysis is a variation of MANOVA that can include interval or ratio covariates? a. ANOVA b. MANCOVA c. logit d. factor analysis

b. MANCOVA

The ____ is a measure obtained by squaring the correlation coefficient. a. t-statistic b. coefficient of determination c. F-ratio d. Pearson coefficient

b. coefficient of determination

Which type of analysis attempts to predict a categorical dependent variable? a. factor analysis b. discriminant analysis c. regression analysis d. linear analysis

b. discriminant analysis

In discriminant analysis, a linear combination of independent variables that explains group memberships is known as a(n) ____. a. regression equation b. discriminant function c. discriminant factor d. n-way ANOVA

b. discriminant function

A variable that has two distinct levels that are coded as either zero or one is called a(n) ____. a. regression variable b. dummy variable c. MANOVA variable d. ANOVA variable

b. dummy variable

The Pearson product-moment correlation requires that the data be measured on at least a ____ scale. a. nominal b. interval c. ordinal d. non-numeric

b. interval

Which analysis is portrayed by the equation: Y = bo + β1X1 + β2X2 + β3X3... + βnXn? a. simple regression b. multiple regression c. chi-square d. factor analysis

b. multiple regression

Nominal and ordinal scales are examples of ____ scales, while interval and ratio scales are examples of ____ scales. a. metric; co-metric b. nonmetric; metric c. nonmetric; advanced d. metric; continuous

b. nonmetric; metric

The probability of success divided by the probability of failure is the ____. a. logit b. odds of success c. Wald statistic d. -2LL

b. odds of success

The portion of dependent variable variance left over after the predictor variables are included in a model intended to represent Y is ____. a. white noise b. residual c. correlation d. covariance

b. residual

If the correlation between two variables is - 0.75, this means that there is a ____. a. weak positive relationship between the variables b. strong inverse relationship between the variables c. weak negative relationship between the variables d. strong positive relationship between the variables

b. strong inverse relationship between the variables

In logistic regression, the dependent variable takes on a value of one to indicate ____. a. non-instance b. success c. failure d. frequency

b. success

Which characteristic can be used to represent a set of variables with one mathematical equation? a. structuralism b. variate c. ANOVA d. exponentiation

b. variate

The sum of differences between the group mean and the grand mean summed over all groups for a given set of observations is called ____________________.

between-groups variance

If the correlation coefficient is - 0.62, the coefficient of determination is approximately ____. a. - 0.62 b. - 0.38 c. + 0.38 d. + 0.62

c. + 0.38

Which correlation coefficient indicates a moderate positive relationship? a. -1.0 b. -0.5 c. +0.5 d. +1.0

c. +0.5

If the regression equation is: Y = -4.2 + 3.6X, then the expected score for Y when X is 4 would be ____. a. -18.6 b. -10.2 c. +10.2 d. +18.6

c. +10.2

The Pearson product-moment correlation coefficient ranges between: a. zero and +1.0 b. -1.0 and zero c. -1.0 and +1.0 d. -2.0 and +2.0

c. -1.0 and +1.0

If the probability of success is 90 percent, what is the value of the logit? a. 0.000 b. 1.099 c. 2.197 d. 3.467

c. 2.197

If the coefficient of determination between X and Y is .72, approximately what percentage of the variance in Y can be explained by X? a. 22% b. 52% c. 72% d. 82%

c. 72%

If the coefficient of determination between X and Y is 0.85, approximately what percentage of the variance in Y can be explained by X? a. 52% b. 72% c. 85% d. 92%

c. 85%

The statistical significance of a regression model is determined using which test? a. t-test b. χ2 c. F-test d. Z-test

c. F-test

All of the following are examples of dependence methods of analysis EXCEPT ____. a. multiple regression analysis b. multiple discriminant analysis c. cluster analysis d. multivariate analysis of variance

c. cluster analysis

What is computed by the following formula? (-2LLnull - (-2LLmodel) / -2LLnull a. Wald statistic b. exponential logistic coefficient c. entropy R2 d. likelihood value

c. entropy R2

In logistic regression, the dependent variable takes on a value of zero to indicate ____. a. success b. occurrence c. failure d. frequency

c. failure

Multivariate dependence techniques are variants of the ____, which is a way of modeling some process based on how different variables cause fluctuations from the average dependent variable. a. ordinary linear model (OLM) b. weighted average model (WAM) c. general linear model (GLM) d. metric scaling model (MSM)

c. general linear model (GLM)

When the correlation between two variables is +.92, this means that as one variable ____, the other variable ____. a. decreases; increases b. increases; decreases c. increases; increases d. decreases; stays the same

c. increases; increases

A bivariate statistical technique that is used to measure the strength of the relationship between two variables is also called a ____. a. one-group t-test b. two-group t-test c. measure of association d. correlation matrix

c. measure of association

When a researcher is attempting to predict sales volume by using building permits, amount of advertising, and the income levels of residents, the researcher is most likely using ____. a. univariate analysis b. a chi-square analysis c. multiple regression analysis d. factor analysis

c. multiple regression analysis

Which type of analysis involves three or more variables? a. univariate statistical analysis b. bivariate statistical analysis c. multivariate statistical analysis d. polyvariate statistical analysis

c. multivariate statistical analysis

In the formula df = n - k, k represents the ____. a. number of observations b. degrees of freedom of the denominator c. number of independent variables d. sample size

c. number of independent variables

The statistical measure of the association between two variables that is a standardized representation of covariance is known as the ____________________ coefficient

correlation

The standard format for reporting the correlations between pairs of several variables is called the ____________________.

correlation matrix

The extent to which a change in one variable corresponds systematically to a change in another is referred to as ____________________.

covariance

Which of the following entropy R2 values implies the highest predictive power? a. -1.0 b. 0.0 c. +0.5 d. +0.8

d. +0.8

Which correlation coefficient indicates a perfect positive relationship? a. -1.0 b. -0.5 c. +0.5 d. +1.0

d. +1.0

Which entropy R2 value implies perfect predictive power? a. 0 b. -1.0 c. +0.5 d. +1.0

d. +1.0

In a correlation matrix, the correlations in the main diagonal are all equal to ____. a. -1.00 b. 0 c. +0.50 d. +1.00

d. +1.00

Consider the regression equation: Y = 98.3 +.35X1 + 22.3X2, the predicted value for Y when X1 = 3 and X2 = 5 is ____. a. 67.23 b. 98.34 c. 118.45 d. 210.85

d. 210.85

Which VIF value would cause a researcher to suspect multicollinearity? a. 0.0 b. 0.5 c. 1.0 d. 5.0

d. 5.0

If the probability of success is 90 percent, what are the odds of success? a. 0.10 b. 1.0 c. 3.0 d. 9.0

d. 9.0

A variable that is coded as either zero or one and that has two distinct levels is called a(n) ____. a. regression variable b. dummy variable c. MANOVA variable d. ANOVA variable

d. ANOVA variable

Which activity is the final step in interpreting MANOVA or MANCOVA results? a. Examine the multivariate F. b. Examine the individual Univariate Model F tests. c. Examine the individual F tests. d. Interpret the effect by examining differences in the means

d. Interpret the effect by examining differences in the means

If the analysis predicts several continuous dependent variables with several categorical independent variables, the appropriate statistical technique is ____. a. multiple regression b. multiple discriminant analysis c. conjoint analysis d. MANOVA

d. MANOVA

The most commonly applied statistic for calculating MANOVA results is the ____. a. Wald statistic b. Wilks beta c. logit d. Wilks lambda

d. Wilks lambda

Which of the following measures that part of the total variance of Y that we can account for by knowing the value of X? a. product-moment correlation b. correlation coefficient c. F-test d. coefficient of determination

d. coefficient of determination

If a bank wants to identify successful and unsuccessful credit risks for home mortgage loans, it should use ____. a. factor analysis b. multidimensional scaling c. MANOVA d. discriminant analysis

d. discriminant analysis

All of the following are dependence methods of analysis EXCEPT ____. a. structural equations modeling b. multiple regression analysis c. multiple discriminant analysis d. factor analysis

d. factor analysis

If the analysis contains only one dependent variable and that variable is interval, the appropriate statistical analysis is ____. a. multiple discriminant analysis b. conjoint analysis c. multivariate ANOVA d. multiple regression

d. multiple regression

Which regression estimation technique is based on the logic of how much better a regression line can predict values of Y compared to simply using the mean as a prediction for all observations, no matter what the value of X may be? a. visual estimation b. maximum likelihood c. coefficient of determination d. ordinary least squares (OLS)

d. ordinary least squares (OLS)

Which approach is most appropriate if the purpose of the regression analysis is forecasting? a. standardized regression coefficient (b) b. Y-intercept a c. coefficient of determination (R2) d. raw parameter estimates

d. raw parameter estimates

When there is a strong positive correlation between two variables, but, in fact, both variables are caused by a third variable, the relationship between the first two variables is said to be ____. a. strong and negative b. weak and negative c. inverse d. spurious

d. spurious

Which of the following is computed by most regression programs and provides an indication of how much multicollinearity exists among a set of independent variables? a. χ2 b. β c. collinear coefficient d. variance inflation factor (VIF)

d. variance inflation factor (VIF)

If the regression equation is: Y = 24.35 - 14.2X, then 24.35 is the ____, while -14.2 is the ____. a. slope; y-intercept b. independent variable; slope c. dependent variable; y-intercept d. y-intercept; slope

d. y-intercept; slope

The two types of multivariate techniques are ____________________ methods and ____________________ methods.

dependence, interdependence

If the researcher wants to classify objects into two mutually exclusive categories, the researcher should use ____________________ analysis.

discriminant

A(n) ____________________ variable has two distinct levels that are coded as 0 and 1

dummy

A(n) ____________________ variable has two distinct levels that are coded as 0 and 1.

dummy

An assessment of logistic regression predictive power that functions like R2 in multiple regression is the ____________________ R2.

entropy pseudo

The ____________________ logistic coefficient is the antilog of the raw logistic regression parameter estimate.

exponential

When the logistic regression equation predicts a result less than 0 (negative value), the predicted outcome is ____________________.

failure

If the number of coupons given out at shopping malls is used to predict the number of tickets that will be sold to a country and western band's performance at a local club, the number of coupons is the ____________________ variable.

independent

A major assumption of any GLM is that the residual terms are ____________________.

independent not dependent not related unrelated

The Pearson product-moment correlation requires at least ____________________-level data.

interval

As multicollinearity increases, the adjusted R2 becomes ____________________ than the unadjusted R2.

lower smaller less

Several Dummy variable can be included in a regression model. T/F

true

The Pearson product-moment correlation coefficient ranges between ____________________ and ____________________.

-1.0; +1.0

If the probability of failure is 50 percent, the value of the logit is ____________________.

0 / zero

If the probability of success is 50 percent, the odds of success are ____________________.

1

In a correlation matrix, the values in the main diagonal equal ____________________.

1.00

If the probability of success is 60 percent, the odds of success are ____________________.

1.5

In the regression equation, Y = a + BX,Y is the A depdendent variable B Slope C Independent variable D intercept

A dependent variable.

When the on-time performance of airlines is used to predict the number of customer complaints in a regression equation, on-time performance is the ___ variable and the number of customer complaints is the __ variable. A independent; dependent B dependent;dependent C dependent; independent D independent; predictor

A independent ; dependent

In using the Z-test for comparing two proportions, the null hypothesis is typically states as ___ A pi1 = pi2 B pi1 <>pi2 C pi1 x pi2 = 1 D pi1 - pi2 = 1

A pi1 = pi2

A study compares the means of two groups in which there are 45 males in group 1 and 37 females in Group 2. The degrees of freedom for this study when using the t-test for the difference between means is __ A 74 B 80 C 82 D 160

B - 80

How many degrees of freedom are there in a four-cell chi-square test? A - R+1 B - R-1 C (R-1)(C-1) D R(c-1)

C. (R-1)(C-1)

In regression, the standardized Y-intercept term is always 1 T/F

False. It is always 0.

____________________ regression is a multivariate technique involving prediction of a categorical, dichotomous dependent variable.

Logistic

____________________ in regression analysis refers to how strongly interrelated the independent variables in a model are

Multicollinearity

____________________ predicts several dependent variables by using several independent variables.

Multivariate analysis of variance MANOVA

-2LL serves as an overall indictor of the predictive power of a logistic regression model. a. True b. False

True

A correlation coefficient equal to +1.0 indicates a perfect positive relationship. a. True b. False

True

A correlation matrix is a standardized covariance matrix. a. True b. False

True

The Chi-square test tests the significance of the relationship shown in an R X C contingency table in which R stands for row and C stands for column T/F

True

The entropy R2 is also called the pseudo R2. a. True b. False

True

Which term refers to the absolute amount of association between two variables, determined by how a change in one variable corresponds systematically to a change in another? a. spurious association b. significance c. covariance d. standardized coefficient

c. covariance

The square of the correlation coefficient is called the ____________________.

coefficient of determination

Multiple regression analysis includes a single independent variable but several dependent variables

false. multiple regression analysis is an extension of simple regression analysis allowing ametric dependent variable to be predicted by multiple independent variables.

Multivariate dependence techniques are variants of the ____________________.

general linear model GLM

The ____________________ is the mean of a variable over all observations.

grand mean

When an analysis studies the effect of several independent variables on a single dependent variable that is intervalscaled, the analysis is called ____________________ analysis.

multiple regression

Statistical methods that permit the study of three or more variables at the same time are called ____________________ statistical analysis.

multivariate

In using the t-test to compare the difference between the means of two groups, the formula for determining the degrees of freedom is ____________________.

n1 + n2 - 2

A(n) ____________________ relationship exists when the value of one variable goes up while the value of the other variable goes down.

negative inverse

Covariation in which one variable increases while the other decreases indicates a(n) ____________________ relationship

negative (inverse) inverse negative

The formula for the chi-square test uses _____. A observed and expected frequencies B observed and expected percentages C the two sample means D the two sample standard deviations

observed and expected frequencies

An appropriate test for comparing the scores of two interval variables drawn from related populations is the ____________________.

paired-samples t-test

Underspecification leads to a pattern in ____________________

residuals unexplained errors

A correlation coefficient indicates both the ____________________ of the relationship between two variables by its absolute value and the ____________________ of this relationship by its sign.

size, direction strength, direction magnitude, direction

When the logistic regression equation predicts a result of greater than 0 (positive value), the predicted outcome is ____________________.

success

Multivariate statistical analysis permits the researcher to consider more than one dependent variable at the same time. T/F

true


Related study sets

Custom: immunity 101/8/3/20_practice

View Set

What Jo Did? Comprehension Questions

View Set

Chapter 55: Assessment of Integumentary Function

View Set