accounting analytics exam 2
__________ mark the split between one class and another. - decision trees - identified questions - decision boundaries -linear classifiers
decision boundaries
_______mark the split between one class and another. -decision trees -decision boundaries -identifying questions -linear classifiers
decision boundaries
________ are the kind of visualizations that present findings to an audience. declarative static exploratory interactive
declarative visualizations
________ are the product of wanting to present findings to an audience. declarative static exploratory interactive
declarative visualizations
Diagnostic analytics include all of the following except: -similarity matching -clustering -profiling -all of the above
-all of the above
Which approach attempts to discover associations between individuals based on transactions involving them? -classification -regression -similarity matching -co-occurence grouping
-co-occurence grouping
Which of the following best describes a dependent variable? -output -input -application -operation
-output
In general, the simpler the model, the greater the chance of: -overfitting the data -under-fitting the data -pruning the data -the need to reduce the amount of data considered
-under-fitting the data
Which chart is best described as useful for when quartiles, median, and outliers are required for analysis and insights? Scatter plots Box and whisker plots Line chart Pie chart
Box and whisker plots
Which of the following is not an example of discrete data? Points earned in a basketball game Points earned in a hockey game Points earned in a diving match Points earned in a soccer game
Points earned in a diving match
Line charts are not recommended for what type of data? Normalized data Qualitative data Continuous data Trend lines
Qualitative data
What is the most appropriate chart when showing a relationship between two variables? Scatter chart (scatter plot) Bar chart Pie graph Histogram
Scatter chart (scatter plot)
Which of the following is not one of the four main questions to consider when creating your data scale and increments? How much data should be displayed in the visualization? Should outliers be displayed? What scale should be used to display the data? Which colors should be displayed?
Which colors should be displayed?
When working with a predictive model, under fitting the data is most likely caused by ________. -an overly complex model -an overly simple model -over pruning the data -a lack of data reduction
an overly simple model
An observation about the frequency of leading digits in many real-life sets of numerical data is called: -leading digits hypothesis -moores law -benford's law -clustering
benfords law
Which approach to data analytics attempts to assign each unit in population into small set of classes where it belongs? -classification -regression -similarity matching -co-occurance grouping
classification
Which approach to data analytics attempts to divide individuals into groups in a useful or meaningful way? -clustering -data reduction -similarity matching -co-occurence grouping
clustering
________ is data that is represented by whole numbers. Continuous data Discrete data Interval data Ordinal data
discrete
While overfitting data could lead to an error rate of 0, it is unlikely that you would be able to _____ your results. -define -specify -generalize -articulate
generalize
The following are typical examples of nominal data except: Hair color Eye color Gender Height
height
Which of the following best describes an independent variable? -output -input -application -operation
input
Results using the Fahrenheit scale would be best described as an example of: Nominal data Ordinal data Ratio data Interval data
interval
The Fahrenheit scale of temperature measurement would best be described as an example of: interval data. discrete data. nominal data. continuous data.
interval data.
The following charts are frequently considered for depicting qualitative data except: Line chart Pie chart Bar chart Stacked bar chart
line
Which approach to data analytics attempts to forecast a relationship between two data items? -link prediction -regression -similarity matching -co-occurence grouping
link prediction
Which approach to data analytics attempts to forecast a relationship between two data items? -regression -link prediction -similarity matching -co-occurance grouping
link prediction
Which approach to data analytics attempts to predict a relationship between two data items? -classification -link prediction -similarity matching -co-occurance grouping
link prediction
Polar bears, brown bears, black bears, and panda bears would be best described as an example of: nominal ordinal ratio interval
nominal
Red, Yellow, and Blue would be best described as an example of: nominal ordinal structured test
nominal
__________ data would be considered the least sophisticated type of data. Ratio Interval Ordinal Nominal
nominal
When considering the color theme for a visualization, which is the best color theme for a color blind audience? orange/blue scale red/green scale gray scale red/blue scale
orange/blue scale
1st place, 2nd place, and 3rd place would be best described as an example of: Nominal data Ordinal data Structured data Interval data
ordinal
Letter grades of A, B, and C would be best described as an example of: Nominal data Ordinal data Ratio data Interval data
ordinal
Gold, silver, and bronze medals would be examples of: nominal data. ordinal data. structured data. test data.
ordinal data
In general, the more complex the model, the greater the chance of ________. -overfitting the data -under-fitting the data -pruning the data -the need to reduce the amount of data considered
overfitting the data
The following charts are frequently considered for depicting quantitative data except: Scatter plots Box plots Line chart Pie chart
pie chart
which approach attempts to characterize the behavior of an individual group or pop. by generating summary statistics? -classification -regression -profiling -link prediction
profiling
__________ data would be considered the most sophisticated type of data. Ratio Interval Ordinal Nominal
ratio
Which approach to data analytics attempts to predict, for each unit, the numerical value of some variable? -classification -regression -similarity matching -link prediction
regression
Which of the following is not a typical example of nominal data? Gender SAT scores Hair color Ethnic group
sat scores
Chart for identifying the correlation between two variables or for identifying a trend line or line of best fit? Scatter plots Box and whisker plots Line chart Pie chart
scatter
Which approach to data analytics attempts to identify similar individuals based on data known about them? -classification -clustering -similarity matching -co-occurence grouping
similarity matching
data organized and reside in a fixed field with record or file are generally contained in a relational db or spreadsheet -training -unstructured -structured -test
structured data
The following are typical examples of ratio data except: Currency Temperature Length Weight
temperature
__________ is a set of data used to assess the degree and strength of a predicted relationship. -training data -unstructured data -structured data -test data
test data
Justin Zobel suggests that Revising your writing requires you "be egoless," suggesting that it is _ you need to please. yourself the reader (or your audience) the customer your boss
the reader (or your audience)
_____ are existing data that have been manually evaluated and assigned a class and _________are existing data used to eval. model. -test data; training data -training data; test data -structured data; unstructured data -unstructured data; structured data
training data; test data
The chart below is an example of a ________.
word cloud