Quantitative Survey Methods Ch 2
__________ Diagram is used to show the relationship between two quantitative variables
Scatter
_____________ diagrams and __________ are useful in exploring the relationship between two variables
Scatter diagrams and trendlines
______-__-_____ _____ Chart is used to compare two variables
Side-by-Side Bar
_________ _____ Chart is used to compare the relative frequency or Percent frequency of two categorical variables
Stacked Bar
_____-___-_____ Display is used to show both the rank order and shape of the distribution for quantitative data
Stem-and-Leaf
The three steps necessary to define the classes for a frequency distribution with quantitative data are:
Step 1 - Determine the number of non-overlapping classes. Step 2 -Determine the width of each class. Step 3 - Determine the class limit
a __________ is used to approximate the relationship of data in a scatter diagram
Trendline
The appropriate values for the class limits depend on the level of __________ of the data.
accuracy
in quality control, _____ charts are used to identify the most important causes of problems
bar
A _____ _______ is a graphical display for depicting qualitative data.
bar chart
On one axis (usually the horizontal axis), we specify the labels that are used for each of the classes. A frequency, relative frequency, or percent frequency scale can be used for the other axis (usually the vertical axis)
bar chart
Using a bar of fixed width drawn above each class label, we extend the height appropriately. The bars are separated to emphasize the fact that each class is separate.
bar chart
Displays used to show the distribution of data:
bar chart, pie chart, dot plot, histogram, stem-and-leaf display
A ________ ____________ is a widely used data visualization tool. It provides timely, summary information that is easy to read, understand, and interpret. Some additional guidelines include . . . •Minimize the need for screen scrolling. •Avoid unnecessary use of color or 3D. •Use borders between charts to improve readability
data dashboard
One of the simplest graphical summaries of data is a _____ ________. A horizontal axis shows the range of data values. Then each data value is represented by a dot placed above the axis
dot plot
The Pareto diagram is named for its founder, Vilfredo Pareto, an Italian ______________.
economist
A ____________ __________ is a tabular summary of data showing the number (frequency) of observations in each of several non-overlapping categories or classes
frequency distribution
The objective of _________ _________ is to provide insights about the data that cannot be quickly obtained by looking only at the original data
frequency distribution
in most cases, a ____________ display is more useful than a table for recognizing patterns and trends
graphical
Unlike a bar graph, a __________ has no natural separation between rectangles of adjacent classes.
histogram
Another common graphical display of quantitative data is a ______________. The variable of interest is placed on the horizontal axis. A rectangle is drawn above each class interval with its height corresponding to the interval's frequency, relative frequency, or percent frequency.
histogram
a stem-and-leaf display is similar to a _________ on its side, but it has the advantage of showing the actual data values. The first digits of each data item are arranged to the left of a vertical line. To the right of the vertical line we record the last digit for each item in rank order.
histogram
a data dashboard organizes and presents _____ ____________ ______________ used to monitor an organization or process.
key performance indicators (KPIs)
Each digit on a stem is a _______
leaf
The ________ class limit identifies the smallest possible data value assigned to the class.
lower
The class _________ is the value halfway between the lower and upper class limits.
midpoint
The last entry in a cumulative frequency distribution always equals the total number of __________________.
observations
An _________-_____ class requires only a lower class limit or an upper class limit.
open-end
A ___________ frequency distribution is a tabular summary of a set of data showing the percent frequency for each class.
percent
The ________ frequency of a class is the relative frequency multiplied by 100
percent
Cumulative _________ ________________ distribution - shows the percentage of items with values less than or equal to the upper limit of each class
percent frequency
The ____ ______ is a commonly used graphical display for presenting relative frequency and percent frequency distributions for categorical data. First draw a circle; then use the relative frequencies to subdivide the circle into sectors that correspond to the relative frequency for each class
pie chart
Cumulative __________ ___________ distribution- shows the proportion of items with values less than or equal to the upper limit of each class.
relative frequency
The _________ frequency of a class is the fraction or proportion of the total number of data items belonging to a class
relative frequency
A _________ frequency distribution is a tabular summary of a set of data showing the relative frequency for each class
relative frequency distribution
A _________ diagram is a graphical presentation of the relationship between two quantitative variables. One variable is shown on the horizontal axis and the other variable is shown on the vertical axis. The general pattern of the plotted points suggests the overall relationship between the variables. A trendline provides an approximation of the relationship
scatter
a ______-___-______ ____ chart is a graphical display for depicting multiple bar charts on the same display. Each cluster of bars represents one value of the first variable. Each bar within a cluster represents one value of the second variable.
side-by-side bar chart
A ___________ _____ chart is another way to display and compare two variables on the same display. It is a bar chart in which each bar is broken into rectangular segments of a different color. If percentage frequencies are displayed, all bars will be of the same height (or length), extending to the 100% mark
stacked bar chart
Each line (row) in a stem-and-leaf display is referred to as a ________.
stem
A _______-____-________ display shows both the rank order and shape of the distribution of the data.
stem-and-leaf
In practice, the number of classes and the appropriate class width are determined by _______ and __________. Once a possible number of classes is chosen, the appropriate class width is found. The process can be repeated for a different number of classes. Ultimately, the analyst uses judgment to determine the combination of the number of classes and class width that provides the best frequency distribution for summarizing the data
trial and error
The _________ class limit identifies the largest possible data value assigned to the class.
upper
Guidelines for Determining the Number of Classes
Use between 5 and 20 classes. Data sets with a larger number of elements usually require a larger number of classes. Smaller data sets usually require fewer classes. The goal is to use enough classes to show the variation in the data, but not so many classes that some contain only a few data items
Guidelines for Determining the Width of Each Class
Use classes of equal width. Approximate Class Width = (Largest data value− Smallest data value) / Number of classes Making the classes the same width reduces the chance of inappropriate interpretation
Approximate Class Width =
(Largest data value− Smallest data value) / Number of classes
relative frequency of a class =
(frequency of the class)/ n (divided by)
Displays used to show relationships:
-scatter diagram -trendline
Displays used to make comparisons:
-side by side bar chart -stacked bar chart
The last entry in a cumulative relative frequency distribution always equals _________
1.00
The last entry in a cumulative percent frequency distribution always equals ______
100
_____ Chart is used to show the frequency distribution or relative frequency distribution for categorical data
Bar
________ _________ must be chosen so that each data item belongs to one and only one class.
Class limits
____________ can be used when: • one variable is categorical and the other is quantitative, • both variables are categorical, or • both variables are quantitative. • The left and top margin labels define the classes for the two variables
Crosstabulation
_____________________ is a method for summarizing the data for two variables.
Crosstabulation
_____________ ____________ distribution- shows the number of items with values less than or equal to the upper limit of each class
Cumulative frequency
_______ ____________ describes the use of graphical displays to summarize and present information about a data set. The goal is to communicate as effectively and clearly as possible the key information about the data.
Data visualization
_____ ______ is used to show the distribution for quantitative data over the entire range of the data
Dot Plot
____________ is used to show the frequency distribution for quantitative data over a set of class intervals
Histogram
When the bars are arranged in descending order of height from left to right (with the most frequently occurring cause appearing first) the bar chart is called a ________ diagram
Pareto
______ Chart is used to show the relative frequency or percent frequency for categorical data
Pie
