6276 Study Guide
A better understanding of consumer behavior through analytics directly leads to _____.
Better Pricing Strategies
Which of the following graphs provides information on outliers and IQR of a data set?
Box plot
Which of the following best exemplifies big data?
Cellphone owners around the world generate vast amounts of data by calling, texting, tweeting, and browsing the Web on a daily basis.
Natalie needs to compare the number of employees by job title for the last five years. Which of the following charts should Natalie use?
Clustered-column (bar) chart
Which one of the following is used in predictive analytics?
Linear Regression
A data visualization tool that updates in real time and gives multiple outputs is called _____.
a data dashboard
Complete linkage can be used to measure the distance between _____ in cluster analysis.
clusters
A collection of text documents to be analyzed is called a _____.
corpus
The data dashboard for a marketing manager may have KPIs related to _____.
current sales measures and sales by region
A tree diagram used to illustrate the sequence of nested clusters produced by hierarchical clustering is known as a _____.
dendrogram
The _____ the lift ratio, the _____ the association rule.
higher; stronger
DJ needs to display data over time. Which of the following charts should he use?
Line chart
The process of converting a word to its stem, or root word, is referred to as _____.
Stemming
A _____ is a graphical summary of data previously summarized in a frequency distribution.
Histogram
_____ is the most critical step of the decision-making process.
Identifying and defining the problem
Which of the following sources of big data is not publicly available?
Medical Records
Susan would like to create a graph to display the number of males and females in her class who got an A, B, C, D, and F on the last test. Which of the following graphs could she use?
Stacked-column chart
An analysis of items frequently co-occurring in transactions is known as _____.
market basket analysis
A popular measure for weighing terms based on frequency and uniqueness is _____.
term frequency times inverse document frequency
Euclidean distance can be used to calculate the dissimilarity between two observations. Let u = (25, $350) correspond to a 25-year-old customer that spent $350 at Store A in the previous fiscal year. Let v = (53, $420) correspond to a 53-year-old customer that spent $4,100 at Store A in the previous fiscal year. Calculate the dissimilarity between these two observations using Euclidean distance.
75.39
Which statement is true of an association rule?
It is ultimately judged on how actionable it is and how well it explains the relationship between item sets.
A _____ decision involves higher-level issues and is concerned with the overall direction of the organization, defining the overarching goals and aspirations for the organization's future.
Strategic
_____ refers to the number of times a collection of items occurs together in a transaction data set.
Support count
Data dashboards are a type of _____analytics.
descriptive
The strength of the association rule is known as _____ and is calculated as the ratio of the confidence of an association rule to the benchmark confidence.
lift
k-means clustering is the process of _____.
organizing observations into distinct groups based on a measure of similarity
We create multiple dashboards _____.
so that each dashboard can be viewed on a single screen
The decisions concerning an organization's goals and future plans are called _____.
strategic decisions
The process of dividing text into separate terms is referred to as _____.
tokenization
A visual representation of a document or set of documents in which the size of the word is proportional to the frequency with which the word appears is called a _____.
word cloud