Visualizing data
5 phases of design process
-empathize -define -ideate -prototype -test
Channels will vary in terms of how effective they are at communicating data based on three elements:
. Accuracy - Are the channels helpful in accurately estimating the values being represented? Popout - How easy is it to distinguish certain values from others? Grouping - How good is a channel at communicating groups that exist in the data?
Story (concept):
: Story allows you to share your data in meaningful and interesting ways. Without a story, your visualization is informative, but not really inspiring.
clustering
A collection of data points with similar or different values. This is best represented through a distribution graph.
Static visualization
A data visualization that does not change over time unless it is edited
are you measuring changes over time?
A line chart is usually adequate for plotting trends over time. However, when the changes are larger, a bar chart is the better option.
Graphs and charts should use a diverging color palette to show contrast between elements.
Color contrast
Visuals and their organization should align with audience expectations and cultural conventions
Conventions and expectations
dynamic visualizations
Data visualizations that are interactive or change over time
What is involved with designing with an accessibility mindset?
Designing with an accessibility mindset involves thinking about your audience ahead of time; focusing on simple, easy to understand visuals; and creating alternative ways for your audience to access and interact with your data.
Filters in Tableau can be used for which of the following tasks?
Filters in Tableau can be used to limit information, customize information, or highlight a data point
hird slide: Data/analysis
First, It really is possible to tell your data story in a single slide if you summarize the key things about your data and analysis. You may have supporting slides with additional data or information in an appendix at the end of the presentation.
Fourth slide: Recommendations
If you have been telling your story well in the previous slides, the recommendations will be obvious to your audience. This is when you might get a lot of questions about how your data supports your recommendations. Be ready to communicate how your data backs up your conclusion or recommendations in different ways. Having multiple words to state the same thing also helps if someone is having difficulty with one particular explanation.
are there multiple data sets?
If you have data that has one, continuous, numerical variable, then a histogram or density plot are the best methods of plotting your categorical data. Depending on your type of data, a bar chart can even be appropriate in this case.
does your data only have one numerical value
If you have data that has one, continuous, numerical variable, then a histogram or density plot are the best methods of plotting your categorical data. Depending on your type of data, a bar chart can even be appropriate in this case.
In Tableau, tiled items can be layered over other objects. T/F
In Tableau ONLY, floating items can be layered over other objects
Here is an example of a 30-minute agenda:
Introductions (4 minutes) Project overview and goals (5 minutes) Data and analysis (10 minutes) Recommendations (3 minutes) Actionable steps (3 minutes) Questions (5 minutes)
They use the _____ tool on the Marks shelf to display the population of each country on the map.
Label
What tool do you use to select the area on the map representing Central America?
Lasso
means that you can build dashboards, reports, and views connected to automatically updated data.
Live data
Titles, axes, and annotations should use as few labels as it takes to make sense. Having too many labels makes your graph or chart too busy. It takes up too much space and prevents the labels from being shown clearly.
Minimal labels
A data analyst is working with the World Happiness data in Tableau. To get a better view of Moldova, they use the _____ tool.
Pan
First slide: Agenda
Provide a high-level bulleted list of the topics you will cover and the amount of time you will spend on each.
example of a purpose statement:
Service center consolidation is an important cost savings initiative. The aim of this project was to determine the impact of service center consolidation on customer response times.
involves providing screenshots or snapshots in presentations or building dashboards using snapshots of data.
Static data
A data analyst uses the Color tool in Tableau to apply a color scheme to a data visualization. Why do they make sure the color scheme has contrast?
The data analyst makes sure the color scheme has contrast in order to make the visualization accessible for people with color vision deficiencies.
Which of the following are elements for effective visuals? Select all that apply.
The elements for effective visuals are clear meaning, sophisticated use of contrast, and refined execution.
Pre-attentive attributes
The elements of a data visualization that an audience recognizes automatically without conscious effort
Goal (function):
The goal of your data visualization makes the data useful and usable. This is what you are trying to achieve with your visualization. Without a goal, your visualization might still be informative, but can't generate actionable insights.
Information (data)
The information or data that you are trying to convey is a key building block for your data visualization. Without information or data, you cannot communicate your findings successfully.
Data composition
The process of combining the individual parts in a visualization and displaying them together as a whole
When creating a presentation to share with stakeholders, what is the purpose of a framework?
The purpose of a framework is to create logical connections that tie back to the business task. It also gives your audience context about your data and helps you focus on the most important information.
Visual form (metaphor):
The visual form element is what gives your data visualization structure and makes it beautiful. Without visual form, your data is not visualized yet.
relativity
These are observations considered in relation or in proportion to something else. You have probably seen examples of relativity data in a pie chart.
ranking
This is a position in a scale of achievement or status. Data that requires ranking is best represented by a column chart.
change
This is a trend or instance of observations that become different over time. A great way to measure change in data is through a line or column chart.
Second slide: Purpose
This slide summarizes the purpose of the project and why it is important to the business for your audience.
Why do data analysts use alternative text to make their data visualizations more accessible?
To provide a textual alternative to non-text content
3 of trifecta check up questions
What is the practical question? What does the data say? What does the visual say?
do relationships between the data need to be shown?
When you have two variables for one set of data, it is important to point out how one affects the other. Variables that pair well together are best plotted on a scatter plot. However, if there are too many data points, the relationship between variables can be obscured so a heat map can be a better representation in that case.
Histogram
a chart that shows how often data values fall into a certain range
time series chart
a graphical representation showing change of a variable over time
headlines
a line of words printed in large letters at the top of the visualization to communicate what data is being presented.
Design thinking
a process used to solve complex problems in a user- centric way
Kaiser fungs junk charts trifecta checkup
a useful set of questions that can help consumers of data visualization critique what they are consuming and determine how effective it is.
heatmap
also use color to compare categories in a data set. They are mainly used to show relationships between two variables and use a system of color-coding to represent different values.
Channels
are visual aspects or variables that represent characteristics of the data. Channels are basically marks that have been used to visualize data
Marks are
basic visual objects like points, lines, and shapes. Every mark can be broken down into four qualities:
hue
color
refined execution
deep attention to detail
Track other spending that doesn't neatly fit into the set categories
discretionary spending
distribution graph
displays the spread of various outcomes in a dataset.
diverging palettes
displays two ranges of values using color intensity to show the magnitude of the number and the actual color to show the range the number is from
3 points of data story telling
engaging your auidence creating compelling visuals telling an interesting story about your data.
What icon do you click to hide a visualization?
eye
A data visualization should be clear, effective, and convincing enough to be absorbed in five seconds or less.
five second rule
Line graphs
help audience understand shifts or changes in your data
maps
help organize data geographically
legend (key)
identifies the meaning of various elements in a data visualization.
Correlation
in statistics is the measure of the degree to which two variables move in relationship to each other. An example of correlation is the idea that "As the temperature goes up, ice cream sales also go up."
the four elements of effective data visualization are
information(data) the story(concept) the goal (function) the visual form (metaphor)
how bright or dull a color is
intensity
decision tree
is a decision-making tool that allows you, the data analyst, to make decisions based on key questions that you can ask yourself. Each question in the visualization decision tree will help you make a decision about critical features for your visualization.
Which element of design can add visual form to your data and help build the structure for your visualization?
lines
it is a If one variable goes up and the other variable goes down, it is a
negative/ inverse correlation
If one variable goes up and the other variable stays about the same, there is
no correlation
Causation
occurs when an action directly leads to an outcome
Fifth slide: Call to action
ometimes the call to action can be combined with the recommendations slide. If there are multiple actions or activities recommended, a separate slide is best.
the four qualities of marks
position - where the mark is in relation to other marks size-how big, small, tall , or long the mark is shape- the shape of the mark color- the color of the mark
if one variable goes up and the other variable also goes up, it is a
positive correlation
spotlighting
scanning through data to quickly identify the most important insights.
correlation chart
show relationship among data
scatterplots
show relationships between different variables. Scatter plots are typically used for two variables for a set of data, although additional variables can be displayed.
pie charts
shows how much each part of something makes up the whole.
subtitles
supports the headline by adding more context and description
A data analyst is choosing their dashboard layout. They want the layout to automatically resize itself based on the dashboard size. They should use a tiled layout. (t/f)
true
Directly labeling a data visualization helps viewers identify data more efficiently. Legends are often less effective because they are positioned away from the data.
true
Bar Graphs
use size contrast to compare 2 or more values.
the color's lightness or darkness
value
causation
when an action directly leads to an outcome