Chapter 3 Data Visualization
To summarize and analyze data with both a crosstabulation and charting, Excel typically pairs
PivotCharts with PitTables
To avoid problems in interpreting the differences in color in a heat map, we can add:
Sparklines
Bar charts use
horizontal bars to display the magnitude of the quantitative variable.
Deleting the grid lines in a table and the horizontal lines in a chart
increases the data-ink ration.
The data dashboard for a marketing manager may have KPIs related to
current sales measures and sales by region
Data-ink is the ink used in a table or chart that
is necessary to convey the meaning of the data to the audience.
Treemap
is used for visualizing hierarchical data along multiple dimensions.
We create multiple dashboards
so that each dashboard can be viewed on a single screen.
Tables should be used instead of charts when
the values being displayed have different units or very different magnitudes.
Tables should be use when:
(1) The reader needs to refer to specific numerical values. (2) The reader needs to make precise comparisons between different values and not just relative comparisons. (3) The values being displayed have different units or very different magnitudes.
Scatter Chart
Is a graphical presentation of the relationship between two quantitative variables.
Scatter Chart
Is a graphical representation of the relationship between two quantitative variables.
Trendline
Is a line that provides an approximation of the relationship between the variables.
The geographic Information System (GIS)
Is a system that merges map and statistics to present data collected over different geographies.
A Line Chart for time series
Is often called a time series plot.
PParallel-Coordinates Plot
Is used for examining data with more than two variables, and it includes a different vertical axis for each variable.
The Recommended PivotTable Tool
Is useful for quickly creating commonly used PivotTables for a data set.
PivotTable
An interactive crosstabulation created in EXCEL
Line Charts
Are known as time series plots
Line Charts
Are very useful for time series data collected over a period of time (minutes, hours, days, years, etc). The data points are connected by a line.
Charts or Graphs
Are visual methods for displaying data.
Charts
Are visual methods of displaying data
A chart that is recommended as an alternative to a pie chart
Bar Chart
In order to visualize three variables in a two-dimensional graph, we use:
Bubble Chart
An alternative for a stacked column chart when comparing more than a couple of quantitative variables in each category is
Clustered Column Chart
A PivotChart, in few instances, is the same as
Clustered-column chart
Data Visualization
Is very helpful for identifying data errors and for reducing the size of your data set by highlighting important relationships, trends and conveying your analysis to others.
Chart Filter button
Is very useful for performing additional data analysis
A disadvantage of stacked-column chart and stacked-bar chart is that
It can be difficult to perceive small differences in areas.
The best way to differentiate chart elements is using
Labels
Column Charts
Graphic representation of data. Column charts display vertical bars going across the chart horizontally, with the values axis being displayed on the left side of the chart.
A two-dimensional graph representing the data using different shades of color to indicate magnitude is called
Heat Map
An effective display of trend and magnitude is achieved by using a combination of
Heat Map and Sparklines
Key Performance Indicators (KPIs)
In business, the values indicating the business's current operating characteristics, such as its financial position, the inventory on hand, and customer service metrics, are typically known as:
Making visual comparisons between categorical variables is difficult in a
Pie Chart
Never use a ________ chart when a __________ chart will suffice.
3-D; 2-D
Chart Elements button
A button that enables you to add, remove, or change chart elements such as the title, legend, gridlines, and data labels.
Chart Filter button
A button that enables you to change which data displays in the chart.
Chart Styles button
A button that enables you to set a style and color scheme for your chart.
Sparkline
A line chart that has no axes but is used to provide information on overall trends for time series data
Crosstabulation
A table that describes data of two variables. A tabular summary of data for two variables. The clases of one variable are represented by the rows; the clases for the other variable are represented by the columns.
Sparkline
A tiny chart in the background of a cell that gives a visual trend summary alongside your data; makes a pattern more obvious.
Quantity Ratings
Example of Categorical Data
Meal Prices
Example of Quantitative Data
To generate a scatter chart matrix we use
Excel XLMiner
Using additional ink that is not necessary to convey information has the effect of ____________ on the data-ink ratio
Reduce
Fields may be chosen to represent all of the following in the body of a PivotTable
Rows, values, and columns
A useful chart for displaying multiple variables is the
Scatter Chart Matrix
Scatter Charts are often referred as :
Scatter Plots or Scatter Diagrams
Bar Charts and Column Charts
The charts that are helpful in making comparisons between categorical variables.
Data-ink Ratio
The ration of the amount of ink used in a table or chart that necessary to convey information to the total amount of ink used in the table and chart. Ink used that is not necessary to convey information reduces the data-ink ration.
Excel
The software package most commonly used for creating simple charts.
Pivot Tables in Excel
They are also known as crosstabulation tables. They summarize data for two variables. They are interactive.
Line Charts
They connect the points in the charts. Are very useful for time series data collected over a period of time (minutes, hours, days, years, etc)
Using multiple lines on a line chart or employing multiple charts is an alternative to
Three-dimensional chart
Bar Charts
Used when data is divided into categories (discrete data) The horizontal bars display the magnitud of quantitative data. The bars are separated to show different categories
Data Dashboard
a data visualization tool that updates in real time and gives multiple output
Column Bar
a graphical presentation that uses vertical bars to display the magnitude of quantitative data
PivotTable
are interactive, and they may be used to display statistics other than a simple count of items.
In many cases, white space in chart can improve
readability.