ISDS 361B

Ace your homework & exams now with Quizwiz!

Prediction of the value of the dependent variable outside the experimental region is called which of the following?

extrapolation

Fields may be chosen to represent all of the following except _____ in the body of a PivotTable.

filters

In a business, the values indicating the business's current operating characteristics, such as its financial position, the inventory on hand, and customer service metrics, are typically known as which of the following?

key performance indicators

Fitting a model too closely to sample data, resulting in a model that does not accurately reflect the population is termed as which of the following?

overfitting

Which of the following graphs cannot be used to display categorical data?

scatter chart

Which of the following is true regarding the coefficient of determination?

It is used to evaluate the goodness of fit.

In the graph of the simple linear regression equation, the parameter 𝛽0 represents which of the following of the true regression line?

the y-intercept

Which of the following is not an approach to making decisions?

guess and check

Deleting the grid lines in a table and the horizontal lines in a chart ______.

increases the data-ink ratio

Scores on Ms. Bond's test have a mean of 70 and a standard deviation of 15. Michelle has a score of 40. Convert Michelle's score to a z-score.

mean: 70, STD: 15, P(x<40), (40-70)/15, Z-Score = -2

Advanced analytics generally refers to which of the following?

predictive and prescriptive analytics

In the spectrum of business analytics, which is the most complex?

prescriptive

Which of the following analytical techniques helps us arrive at the best decision?

prescriptive analytics

In many cases, white space in a chart can improve which of the following?

readability

The College Board originally scaled SAT scores so that the scores for each section were approximately normally distributed with a mean of 500 and a standard deviation of 100. Assuming scores follow a bell-shaped distribution, use the empirical rule to find the percentage of students who scored less than 400.

16%

Suppose one year the College Board reported that the mean Math Level 2 SAT subject test score was 683 with a standard deviation of 98. Assuming scores follow a bell-shaped distribution, use the empirical rule to find the percentage of students who scored less than 487.

2.5%

The College Board originally scaled SAT scores so that the scores for each section were approximately normally distributed with a mean of 500 and a standard deviation of 100. Assuming scores follow a bell-shaped distribution, use the empirical rule to find the percentage of students who scored greater than 700.

2.5%

Never use a _____ chart when a _____ chart will suffice.

3-D; 2-D

For data having a bell-shaped distribution, approximately what percent of the data values will be within one standard deviation of the mean?

68

Scores on Ms. Bond's test have a mean of 74 and a standard deviation of 11. David has a score of 56 on Ms. Bond's test. Scores on Ms. Nash's test have a mean of 68 and a standard deviation of 6. Steven has a score of 56 on Ms. Nash's test. Which student has the higher standardized score?

David's standardized score is −1.64 and Steven's standardized score is −2.00. Therefore, David has the higher standardized score.

Which of the following refers to the technology that allows data, collected from sensors in all types of machines, to be sent over the Internet to repositories where it can be stored and analyzed?

Internet of Things (IoT)

The ratio of the amount of ink used in a table or chart that is necessary to convey information to the total amount of ink used in the table and chart is known as data-ink ratio. Using additional ink that is not necessary to convey information has what effect on the data-ink ratio?

It reduces the data-ink ratio.

DJ needs to display data over time. Which of the following charts should he use?

Line chart

Which Excel command will return all modes when more than one mode exists?

MODE.MULT

Which of the following refers to a programming model used within Hadoop that performs the two major steps for which it is named: the map step and the reduce step?

MapReduce

To summarize and analyze data with both a crosstabulation and charting, Excel typically pairs which of the following?

PivotCharts with PivotTables

Which one of the following statements is not true concerning PivotTables in Excel?

PivotTables can be built using data arrayed in rows.

A chart that is recommended as an alternative to a pie chart is which of the following?

a bar chart

In order to visualize three variables in a two-dimensional graph, we use which of the following?

a bubble chart

An alternative for a stacked column chart when comparing more than a couple of quantitative variables in each category is which of the following?

a clustered column chart

A PivotChart, in few instances, is the same as which of the following?

a clustered-column chart

A data visualization tool that updates in real time and gives multiple outputs is called which of the following?

a data dashboard

A variable used to model the effect of a categorical independent variable in a regression model is known as which of the following?

a dummy variable

A variable used to model the effect of categorical independent variables in a regression model which generally takes only the value zero or one is called which of the following?

a dummy variable

A two-dimensional graph representing the data using different shades of color to indicate magnitude is called which of the following?

a heat map

An effective display of trend and magnitude is achieved by using a combination of which of the following?

a heat map and sparklines

A time series plot is also known as which of the following?

a line chart

Which of the following is used for examining data with more than two variables, and it includes a different vertical axis for each variable?

a parallel-coordinates plot

Of the charts used to compare categorical data, making visual comparisons between categorical variables may be difficult in which of the following?

a pie chart

Which of the following is an interval estimate of an individual y-value, given values of the independent variables?

a prediction interval

Which of the following is a graphical presentation of the relationship between two quantitative variables?

a scatter chart

Which of the following is used to visualize sample data graphically and to draw preliminary conclusions about the possible relationship between the variables?

a scatter chart

A line chart that has no axes but is used to provide information on overall trends for time series data is called which of the following?

a sparkline

The graph of the simple linear regression equation is which of the following?

a straight line

Using multiple lines on a line chart or employing multiple charts is an alternative to which of the following?

a three-dimensional chart

Which of the following is useful for visualizing hierarchical data along multiple dimensions?

a treemap

Which of the following is a line that provides an approximation of the relationship between the variables?

a trendline

A manager of a fast food restaurant wants the drive-thru employee to ask every fifth customer if he or she is satisfied with the service. Who makes up the population?

all customers who use the drive-thru window of this fast food restaurant

A normally distributed error term with a mean of zero would do which of the following.

allow more accurate modeling

Which of the following is used to test the hypothesis that the values of the regression parameters 𝛽1, 𝛽2, ... 𝛽q are all zero?

an F-test

The charts that are helpful in making comparisons between categorical variables are which of the following?

bar charts and column charts

A better understanding of consumer behavior through analytics directly leads to which of the following?

better pricing strategies

The correlation coefficient will always take which of the following values?

between −1 and +1

Assessing the regression model on data other than the sample data that was used to generate the model is known as which of the following?

cross-validation

Which of the following are visual methods of displaying data?

charts

Natalie needs to compare the number of employees by job title for the last five years. Which of the following charts should Natalie use?

clustered-column (bar) chart

Which of the following is a measure of the goodness of fit of the estimated regression equation? It can be interpreted as the proportion of the variability in the dependent variable y that is explained by the estimated regression equation.

coefficient of determination

A graphical presentation that uses vertical bars to display the magnitude of quantitative data is known as which of the following?

column chart

Which of the following are collected from several entities at the same point in time?

cross-sectional data

The data dashboard for a marketing manager may have KPIs related to which of the following?

current sales measures and sales by region

In a linear regression model, the variable that is being predicted or explained is known as which of the following? It is denoted by y and is often referred to as the response variable.

dependent variable

The U.S. Internal Revenue Service uses which of the following to identify patterns that distinguish questionable annual personal income tax filings?

data mining

The extraction of information on the number of shipments, how much was included in each shipment, the date each shipment was sent, and so on from the manufacturing plant's database exemplifies which of the following?

date queries

Optimization models can be used to do which of the following?

decide on how to invest cash received from insurance policies

When a decision maker is faced with several alternatives and an uncertain set of future events, s/he uses which of the following to develop an optimal strategy?

decision analysis

In order to manage an organization's human resource activities, such as hiring employees, tracking, and influencing employee retention, HR personnel use which of the following?

descriptive and predictive analytics

What is the process of removing variables from the analysis without losing crucial information?

dimension reduction

In the simple linear regression model, which of the following accounts for the variability in the dependent variable that cannot be explained by the linear relationship between the variables?

error term

Which of the following is the range of values of the independent variables in the data used to estimate the regression model?

experimental region

In which of the following are one or more variables identified and controlled or manipulated so that data can be obtained about how they influence the variable of interest identified first?

experimental study

Prediction of the mean value of the dependent variable y for values of the independent variables x1, x2, , xq that are outside the experimental range is called which of the following?

extrapolation

Bar charts use which of the following?

horizontal bars to display the magnitude of the quantitative variable

Tactical decisions are concerned with which of the following?

how the organization should achieve the goals and objectives set by its strategy

The process of making a conjecture about the value of a population parameter, collecting sample data that can be used to assess this conjecture, measuring the strength of the evidence against the conjecture that is provided by the sample, and using these results to draw a conclusion about the conjecture is known as which of the following?

hypothesis testing

Which of the following is the most critical step of the decision-making process?

identifying and defining the problem

Which of the following refers to the scenario in which the relationship between the dependent variable and one independent variable is different at different values of a second independent variable?

interaction

Data-ink is the ink used in a table or chart that _____.

is necessary to convey the meaning of the data to the audience

A disadvantage of stacked-column charts and stacked-bar charts is that _____.

it can be difficult to perceive small differences in areas

The prespecified value of the independent variable at which its relationship with the dependent variable changes in a piecewise linear regression model is referred to as which of the following?

knot

The best way to differentiate chart elements is by using which of the following?

labels

Data sets commonly include observations with missing values for one or more variables. In some cases missing data naturally occur; these are called which of the following?

legitimately missing data

The degree of correlation among independent variables in a regression model is called which of the following?

multicollinearity

Which of the following refers to the degree of correlation among independent variables in a regression model?

multicollinearity

Regression analysis involving one dependent variable and more than one independent variable is known as which of the following?

multiple regression

Which of the following are necessary to be determined to define the classes for a frequency distribution with quantitative data?

number of nonoverlapping bins, width of each bin, and bin limits

A light bulb manufacturer uses descriptive analytics to do which of the following?

present supply chain to managers visually

Which of the following regression models is used to model a nonlinear relationship between the independent and dependent variables by including the independent variable and the square of the independent variable in the model?

quadratic regression model

We create multiple dashboards _____.

so that each dashboard can be viewed on a single screen

When working with large spreadsheets with many rows of data, it can be helpful to do what with the data to better find, view, or manage subsets of data?

sort and filter

To avoid problems in interpreting the differences in color in a heat map, which of the following can be added?

sparklines

Susan would like to create a graph to display the number of males and females in her class who got an A, B, C, D, and F on the last test. Which of the following graphs could she use?

stacked-column chart

The process of making estimates and drawing conclusions about one or more characteristics of a population through analysis of sample data drawn from the population is known as which of the following?

statistical inference

The decisions concerning an organization's goals and future plans are called which of the following?

strategic decision

Which of the following relationships would have a negative correlation coefficient?

supply and demand

If the covariance between two variables is near 0, it implies which of the following?

that the variables are not linearly related

Which of the following is an indication of how frequently interval estimates based on samples of the same size taken from the same population using identical sampling techniques will contain the true value of the parameter we are estimating?

the confidence level

Which of the following merges maps and statistics to present data collected over different geographies?

the geographic information system

In a linear regression model, the variable (or variables) used for predicting or explaining values of the response variable are known as which of the following? It(they) is(are) denoted by x.

the independent variable(s)

Which of the following is a procedure for using sample data to find the estimated regression equation?

the least squares method

A useful chart for displaying multiple variables is which of the following?

the scatter chart matrix

In a simple linear regression analysis the quantity that gives the amount by which the dependent variable changes for a unit change in the independent variable is called which of the following?

the slope of the regression line

In a simple linear regression model, y = 𝛽0 + 𝛽1x + 𝜀 the parameter 𝛽1 represents which of the following?

the slope of the true regression line

The least squares regression line minimizes the sum of which of the following?

the squared differences between actual and predicted y-values

Which of the following is a measure of the error that results from using the estimated regression equation to predict the values of the dependent variable in the sample?

the sum of squares due to error (SSE)

Tables should be used instead of charts when _____.

the values being displayed have different units or very different magnitudes

The goal regarding using an appropriate number of bins is to show which of the following?

the variation in the data

Utility theory is the study of the _____ or relative desirability of a particular outcome that reflects the decision maker's attitude toward a collection of factors, such as profit, loss, and risk.

total worth

Which of the following is the data set used to build the candidate models?

training set

Which of the following refers to the data set used to compare model forecasts and ultimately pick a model for predicting values of the dependent variable?

validation set

When the mean value of the dependent variable is independent of variation in the independent variable, the slope of the regression line is which of the following?

zero

The population parameters that describe the y-intercept and slope of the line relating y and x, respectively, are which of the following?

𝛽0 and 𝛽1


Related study sets

Chapter 5 Quiz Networking Fundamentals

View Set

Life & Variables Insurance Test (Hard Questions)

View Set

Monopolistically and Oligopolistic

View Set

Federal Wage and Hour Law (Fair Labor Standards Act)

View Set

Anatomy Head and Neck Multiple Choice

View Set