BANA 2 EXAM 1

Ace your homework & exams now with Quizwiz!

What would be the coefficient of determination if the total sum of squares (SST) is 23.29 and the sum of squares due to regression (SSR) is 10.03?

.43

The scatter chart below displays the residuals verses the dependent variable, x. Which of the following conclusions can be drawn from the scatter chart given below? ​

The residuals have a increasing variance as the dependent variable increases.

Use the attached data to create a scatter diagram to show the relationship between Market Capitalization and Profit. Add a trendline. The trendline generally indicates that there is

a positive relationship

The graph of the simple linear regression equation is a(n)

a straight line

In the file, MajorSalary, data have been collected from 111 College of Business graduates on their monthly starting salaries. Create a PivotTable. Which major has the greatest number of graduates?

accounting

Considering the results from Question 10, which if any of the independent variables are not significant?

comfort

Assessing the regression model on data other than the sample data that was used to generate the model is known as

cross-validation

A data visualization tool that updates in real time and gives multiple outputs is called

data dashboard

The Analytics in Action example in Chapter 3 concerned

data dashboard

Corporate-level managers use ______ to summarize sales by region, current inventory levels, and other company-wide metrics all in a single screen.

data dashboards

Chapter 3 focuses on

data visualization

Compare the results from Questions 10 and 13. Did the Standard Error increase, decrease or remain the same in Question 13?

decreased

Recreate the analysis of Question 10 dropping the independent variable Comfort. Compare the results. Did R Square increase, decrease or remain the same?

decreased

Data dashboards are a type of _________ analytics.

descriptive

A variable used to model the effect of categorical independent variables in a regression model which generally takes only the value zero or one is called

dummy variable

The software package most commonly used for creating simple charts is

excel

Considering the results from Question 2, the coefficient for Line Speed is significant at a 1% level.

false

Data-ink is the ink used in a table or chart that

is necessary to convey the meaning of the data to the audience

A disadvantage of stacked-column charts and stacked-bar charts is that

it can be difficult to perceive small difference in areas

In a business, the values indicating the business's current operating characteristics, such as its financial position, the inventory on hand, and customer service metrics, are typically known as

key performance indicators

DJ needs to display data over time. Which of the following charts should he use?

line chart

The following image is a ​

line chart

Chapter 7 focuses on

linear regression

The degree of correlation among independent variables in a regression model is called

multicollinearity

Regression analysis involving one dependent variable and more than one independent variable is known as

multiple regression

Fitting a model too closely to sample data, resulting in a model that does not accurately reflect the population is termed as

overfitting

Making visual comparisons between categorical variables is difficult in a

pie chart

The Analytics in Action example in Chapter 7 concerned

predicting the effect of advertising

A forecast that helps direct police officers to areas where crimes are likely to occur based on past data is an example of

predictive analytics

_______________ analytics use techniques that take input data and yield a best course of action.

prescriptive

Considering the results in Question 2, what is the estimated coefficient for Line Speed? Round your answer to three decimal places.

-.148

A highway department is studying the relationship between traffic flow and speed during rush hour on Highway 193. The data in the file TrafficFlow were collected on Highway 193 during 100 recent rush hours. Develop a scatter chart for these data. Develop an estimated simple linear regression equation for the data. How much variation in the sample values of traffic flow is explained by this regression model? Enter your answer as a decimal rounded to three places? TrafficFlow

.313

Develop an estimated quadratic regression equation for the data in the previous problem. How much variation in the sample values of traffic flow is explained by this regression model? Enter your answer rounded to three decimal places.

.343

Using the attached LineSpeed data, develop an estimated regression equation to predict the number of defective parts found given the line speed. How much of the variation in defective parts is explained by your model? Enter your answer as a decimal rounded to three decimal places.

.739

The attached data are the results of a survey on upscale accommodations. The data show the percentages of respondents who rated the locations as excellent or very good on Comfort, Amenities, In-House Dining and Overall. Develop an estimated regression equation to predict the Overall rating using Comfort, Amenities and In-House Dining. How much of the variation in the Overall rating is explained by your model? Enter your answer as a decimal rounded to three decimal places.

.750

Using the attached data, develop an estimated regression equation to predict the Total Points Earned based on the Hours Spent Studying. How much of the variation in Total Points Earned is explained by your model? Enter your answer as a decimal rounded to three decimal places.

.828

The attached file shows the number of U.S. locations for the top 20 U.S. franchises. Create a PivotTable and group the number of locations starting at zero, ending at 39,999 by 10,000. How many franchises are in the 0 to 9999 category?

13

Using a PivotTable and the MajorSalary data, determine the highest starting salary for Info Systems majors. Round your answer to two decimal places and do not include a dollar sign.

5030

What would be the value of the sum of squares due to regression (SSR) if the total sum of squares (SST) is 25.32 and the sum of squares due to error (SSE) is 6.89?

18.43

Based on the assigned web article, in the Dodgers Front Office how many people have the term analyst or research in their titles?

21

Considering the results from Question 6, what is the test statistic associated with the coefficient for Hours Spent Studying? Round your answer to three decimal places.

27.2

Using a PivotTable and the MajorSalary data, determine the average starting salary of management majors. Round your answer to two decimal places and do not include a dollar sign.

3180

Never use a ________ chart when a __________ chart will suffice.

3D; 2D

A study investigated the relationship between audit delay (the length of time from a company's fiscal year-end to the date of the auditor's report) and variables that describe the client and the auditor. Some of the independent variables that were included in this study follow: Industry A dummy variable coded 1 if the firm was an industrial company or 0 if the firm was a bank, savings and loan, or insurance company. Public A dummy variable coded 1 if the company was traded on an organized exchange or over the counter; otherwise coded 0. Quality A measure of overall quality of internal controls, as judged by the auditor, on a 5-point scale ranging from "virtually none" (1) to "excellent" (5). Finished A measure ranging from 1 to 4, as judged by the auditor, where 1 indicates "all work performed subsequent to year-end" and 4 indicates "most work performed prior to year-end." A sample of 40 companies provided the following data: Audit Develop the estimated regression equation using all of the independent variables included in the data. Enter the value of the intercept rounded to three decimal places.

80.429

Considering the results from Question 6, use the model to predict the number of points a student would earn if the student studied 95 hours. Round your answer to one decimal place.

84.8

Use the attached data and Excel to create sparklines for revenue for each company and a heat map for revenue of the six companies. Which company exhibited the most consistent growth over the six months? For discussion, which tool helped you the most in answering this question and why?

Allen and Davis LLC

The company identified in Chapter 7, Analytics in Action, is

Alliance Data Systems

________________ is used to test the hypothesis that the values of the regression parameters B 0, B 1, B 2, ... B q are all zero.

An F test

Rerun the the analysis done in Question 3 without the independent variable identified in Question 4. Which independent variable is significant at the 1% level of significance?

Industry

Use the attached data to create a bubble chart. Expected rate of return is the horizontal axis, risk estimate is the vertical axis and size of the bubble is the capital investment. Identify whether each investment is on the efficient frontier. Any investment that has a smaller rate of return for the equivalent or higher risk than another project cannot be on the efficient frontier.

Investment 1 Correct A. No Investment 2 Correct B. Yes Investment 3 Correct B. Yes Investment 4 Correct A. No Investment 5 Correct B. Yes Investment 6 Correct B. Yes

Use the attached data to create a stacked-column chart and a clustered column chart. Age category is on the horizontal axis for both charts. What can you say about the relationship of age and cell phone ownership, and which graph displays this better?

Older respondents are more likely to not own a cell phone. Clustered column chart

Which one of the following statements is not true concerning PivotTables in Excel?

PivotTables can only be used if one variable is categorical and the other is quantitative data.

​_______________ analytics are techniques that use models, constructed from past data, to predict the future or to ascertain the impact of one variable on another.

Predictive

Consider the result in Question 3. At a 5% level of significance, which variable is not significant?

Public

The charts that are helpful in making comparisons between categorical variables are

bar charts and column charts

In order to visualize three variables in a two-dimensional graph, we use a

bubble chart

The company identified in Chapter 3, Analytics in Action, is

cincinnati zoo and botanical garden

An alternative for a stacked column chart when comparing more than a couple of quantitative variables in each category is a

clustered column chart

The ___________ is a measure of the goodness of fit of the estimated regression equation. It can be interpreted as the proportion of the variability in the dependent variable y that is explained by the estimated regression equation.

coefficient of determination

Fields may be chosen to represent all of the following except ____________ in the body of a PivotTable.

filters

_____ merges maps and statistics to present data collected over different geographies.

geographic information system

An effective display of trend and magnitude is achieved by using a combination of a

heat map and sparklines

Compare the results from Questions 10 and 13. Did Adjusted R Square increase, decrease or remain the same in Question 13?

increased

Deleting the grid lines in a table and the horizontal lines in a chart

increases data ink ratio

In a linear regression model, the variable (or variables) used for predicting or explaining values of the response variable are known as the ________________. It(they) is(are) denoted by x.

independent variable

_____________ refers to the scenario in which the relationship between the dependent variable and one independent variable is different at different values of a second independent variable.

interaction

Which of the following regression models is used to model a nonlinear relationship between the independent and dependent variables by including the independent variable and the square of the independent variable in the model?

quadratic regression model

In many cases, white space in a chart can improve

readability

The ratio of the amount of ink used in a table or chart that is necessary to convey information to the total amount of ink used in the table and chart is known as data-ink ratio. Using additional ink that is not necessary to convey information has what effect on the data-ink ratio?

reduces

The difference between the observed value of the dependent variable and the value predicted using the estimated regression equation is known as the

residual

A _____________ is a graphical presentation of the relationship between two quantitative variables.

scatter chart

A regression analysis involving one independent variable and one dependent variable is referred to as a

simple linear regression

A line chart that has no axes but is used to provide information on overall trends for time series data is called a

sparkline

Tables should be used instead of charts when

the values being displayed have different units or very different magnitudes

__________ is the data set used to build the candidate models.

training set

A _____________ is a line that provides an approximation of the relationship between the variables.

trendline

Considering the results from Question 10, we can conclude that the model is significant.

true

Considering the results from Question 2, the coefficient for Line Speed is significant at a 5% level.

true

Considering the results from Question 6, the coefficient for Hours Spent Studying is significant at a 1% level.

true

_____________ refers to the data set used to compare model forecasts and ultimately pick a model for predicting values of the dependent variable.

validation set


Related study sets

Fundamental and Technical Analysis

View Set

Chapter 9-Strategies for Motivating Students

View Set

Chapter 1: evolutionary development- revolutionary impact

View Set

Write Formulas for Ionic Compounds

View Set

C++ Unit 5 Test (Control Structures II)

View Set