ISM 4402 midterm exam ch 2
According to Eckerson (2006), a well-known expert on BI dashboards, what are the three layers of information of a dashboard?
1. Monitoring. Graphical, abstracted data to monitor key performance metrics. 2. Analysis. Summarized dimensional data to analyze the root cause of problems. 3. Management. Detailed operational data that identify what actions to take to resolve a problem.
Report
A(n) ________ is a communication artifact, concerning business matters, prepared with the specific intention of relaying information in a presentable form.
mobile platforms such as the iPhone are supported by these products. it is easier to spot useful patterns and trends in the data. ***they explore massive amounts of data in hours, not days. *** there is less demand on IT departments for reports.
Benefits of the latest visual analytics tools, such as SAS Visual Analytics, include all of the following EXCEPT
Describe categorical and nominal data.
Categorical data represent the labels of multiple classes used to divide a variable into specific groups. Examples of categorical variables include race, sex, age group, and educational level. Nominal data contain measurements of simple codes assigned to objects as labels, which are not measurements. For example, the variable marital status can be generally categorized as (1) single, (2) married, and (3) divorced.
the visual cube level.
Dashboards can be presented at all the following levels EXCEPT
Screen
Dashboards present visual displays of important information that are consolidated and arranged on a single ________.
False
Dashboards provide visual displays of important information that is consolidated and arranged across several screens to maintain data order.
True
Data accessibility means that the data are easily and readily obtainable.
False
Data is the contextualization of information, that is, information set in context.
True
Data is the main ingredient for any BI, data science, and business analytics initiative.
False
Data source reliability means that data are correct and are a good match for the analytics problem.
True
Descriptive statistics is all about describing the sample data on hand.
rapid
Due to the ________ expansion of information technology coupled with the need for improved competitiveness in business, there has been an increase in the use of computing power to produce unified reports that join different views of the enterprise in one place.
Visual analytics is aimed at answering, "What is it happening?" and is usually associated with business analytics.
False
True
Google Maps has set new standards for data visualization with its intuitive Web mapping software.
Describe the difference between simple and multiple regression.
If the regression equation is built between one response variable and one explanatory variable, then it is called simple regression. Multiple regression is the extension of simple regression where the explanatory variables are more than one.
False
In the Dallas Cowboys case study, the focus was on using data analytics to decide which players would play every week.
Drill-down/drill-through
Information dashboards enable ________ operations that allow the users to view underlying data sources and obtain more detail.
True
Interval data are variables that can be measured on interval scales.
balanced scorecard-type reports.
Kaplan and Norton developed a report that presents an integrated view of success in the organization called
Internal results
Key performance indicators (KPIs) are metrics typically used to measure
List and describe the three major categories of business reports.
Metric management reports. Many organizations manage business performance through outcome-oriented metrics. For external groups, these are service-level agreements (SLAs). For internal management, they are key performance indicators (KPIs). • Dashboard-type reports. This report presents a range of different performance indicators on one page, like a dashboard in a car. Typically, there is a set of predefined reports with static elements and fixed structure, but customization of the dashboard is allowed through widgets, views, and set targets for various metrics. • Balanced scorecard-type reports. This is a method developed by Kaplan and Norton that attempts to present an integrated view of success in an organization. In addition to financial performance, balanced scorecard-type reports also include customer, business process, and learning and growth perspectives.
False
Nominal data represent the labels of multiple classes used to divide a variable into specific groups.
True
One of SiriusXM's challenges was tracking potential customers when cars were sold.
True
Predictive algorithms generally require a flat file with a target variable, so making data analytics ready for prediction means that data sets must be transformed into a flat-file format and made ready for ingestion into those predictive algorithms.
True
Structured data is what data mining algorithms use and can be classified as categorical or numeric.
new forms of computation of business logic.
The Internet emerged as a new medium for visualization and brought all the following EXCEPT
False
The data storage component of a business reporting system builds the various reports and hosts them for, or disseminates them to users. It also provides notification, annotation, collaboration, and other services.
Describe the difference between descriptive and inferential statistics.
The main difference between descriptive and inferential statistics is the data used in these methods—whereas descriptive statistics is all about describing the sample data on hand, and inferential statistics is about drawing inferences or conclusions about the characteristics of the population.
What are the most important assumptions in linear regression?
The most important assumptions in linear regression are linearity, independence, normality, constant variance, and multicollinearity.
True
There are basic chart types and specialized chart types. A Gantt chart is a specialized chart type.
arithmetic mean
This measure of central tendency is the sum of all the values/observations divided by the number of observations in the data set.
standard deviation
This measure of dispersion is calculated by simply taking the square root of the variations.
box-and-whiskers plot
This plot is a graphical illustration of several descriptive statistics about a given data set.
Correlation
This technique makes no a priori assumption of whether one variable is dependent on the other(s) and is not concerned with the relationship between variables; instead it gives an estimate on the degree of association between the variables.
List the five most common functions of business reports.
To ensure that all departments are functioning properly • To provide information • To provide the results of an analysis • To persuade others to act • To create an organizational memory (as part of a knowledge management system)
False
To respond to its market challenges, SiriusXM decided to focus on manufacturing efficiency.
In the FEMA case study, the BureauNet software was the primary reason behind the increased speed and relevance of the reports FEMA employees received.
True
2
Typical charts, graphs, and other visual elements used in visualization-based applications usually involve ________ dimensions.
predictive
Visual analytics is widely regarded as the combination of visualization and ________ analytics.
True
Visualization differs from traditional charts and graphs in complexity of data sets and use of multiple dimensions and measures.
False
When telling a story during a presentation, it is best to avoid describing hurdles that your character must overcome, to avoid souring the mood.
normality
When validating the assumptions of a regression, ________ assumes that the errors of the response variable are normally distributed.
Linearity
When validating the assumptions of a regression, ________ assumes that the relationship between the response variable and the explanatory variables are linear.
Data richness
Which characteristic of data means that all the required data elements are included in the data set?
data granularity
Which characteristic of data requires that the variables and data values be defined at the lowest (or as low as required) level of detail for the intended use of the data?
Bubble chart
Which kind of chart is described as an enhanced version of a scatter plot?
graphic artwork
Which of the following is LEAST related to data/information visualization?
Why is it happening?
Which type of question does visual analytics seeks to answer?
Geographic map
Which type of visualization tool can be very helpful when a data set contains location data?
Pie chart
Which type of visualization tool can be very helpful when the intention is to show relative proportions of dollars per department allocated by a university administration?
Metadata
With a dashboard, information on sources of the data being presented, the quality and currency of underlying data provide contextual ________ for users.
monitoring
With dashboards, the layer of information that uses graphical, abstracted data to keep tabs on key performance metrics is the ________ layer.
Maps
________ are typically used together with other charts and graphs, as opposed to by themselves, and show postal codes, country names, etc.
Gantt
________ charts are a special case of horizontal bar charts that are used to portray project timelines, project tasks/activity durations, and overlap among the tasks/activities.
Bar
________ charts are effective when you have nominal data or numerical data that splits nicely into different categories so you can quickly see comparative results and trends within your data.
Bar
________ charts are useful in displaying nominal data or numerical data that splits nicely into different categories so you can quickly see comparative results and trends.
PERT
________ charts or network diagrams show precedence relationships among the project activities/tasks.
Metric
________ management reports are used to manage business performance through outcome-oriented metrics in many organizations.
Scatter
________ plots are often used to explore the relationship between two or three variables (in 2-D or 2-D visuals).
Logistic
________ regression is a very popular, statistically sound, probability-based classification algorithm that employs supervised learning.
Time
________ series forecasting is the use of mathematical modeling to predict future values of the variable of interest based on previously observed values.
Inferential
________ statistics is about drawing conclusions about the characteristics of the population.
When you tell a story in a presentation, all of the following are true EXCEPT
a story should make sense and order out of a lot of background noise. ***a well-told story should have no need for subsequent discussion. **** stories and their lessons should be easy to remember. the outcome and reasons for it should be clear at the end of your story.
What is the fundamental challenge of dashboard design?
ensuring that the required information is shown clearly on a single screen
What is the management feature of a dashboard?
operational data that identify what actions to take to resolve a problem
Contextual metadata for a dashboard includes all the following EXCEPT
whether any high-value transactions that would skew the overall trends were rejected as a part of the loading process. ***which operating system is running the dashboard server software. *** whether the dashboard is presenting "fresh" or "stale" information. when the data warehouse was last refreshed.
List five types of specialized charts and graphs.
• Histograms • Gantt charts • PERT charts • Geographic maps • Bullets • Heat maps • Highlight tables • Tree maps