CRM Stats CH 2
make a graph
1. graph, 2. chart builder, 3. drag variable
Bar Chart
A data visualization method that uses rectangular bars of equal width to represent categories or values being compared. The length of each bar is proportional to the magnitude of the category or value it represents based on quantity of observations
Negative Association
A relationship between two continuous measures such that changes in values trend in the opposite direction
Positive Association
A relationship between two continuous measures such that changes in values trend in the same direction
Bivariate Analysis
Analysis between two variables
H-Spread
The difference between the upper and lower hinge values of a box plot
exhaustive
When grouping data from a continuous measure into categories, the categories must be mutually exclusive and ____________?
Q1
adjacent to hinge extension is the lower quartile
scatter plot
association between 2 measures; like the histogram it is diagnostic and needs a pattern
Identify the most appropriate data visualization technique for displaying a variable used to measure a prison's security level, categorized as minimum, medium, and maximum?
bar chart
Identify the most appropriate data visualization technique for displaying a variable used to measure the type of attorney a criminal defendant had at trial, categorized as public defender or privately council?
bar chart
Identify the most appropriate data visualization technique for displaying a variable used to measure victim's race, categorized as White, Black or African American, American Indian or Alaska Native, Asian, and other to measure?
bar chart
Identify the most appropriate data visualization technique for displaying a variable used to measure the number of court authorized wire taps issued over the previous 10 years?
box plot
pie chart usage
categorical data, nominal or ordinal measure, but not too many categories
bar chart usage
categorical, nominal or ordinal data; more categories
exhaustive
collects every category
histogram usage
continuum - something counting and there are no spaces; ex age (counting); diagnostic purposes
box and whisker plots
count data so interval or ratio measures
median
cuts data in half
cumulative distribution
data reduction technique used to present a running total of frequencies or percentages
missing observations
data without valid observations
hinges
edges of the box
asterisks
extreme scores
trend line
feature can be added to a scatterplot to help visualize the direction and strength of a linear trend
f
frequencies
Identify the most appropriate data visualization technique for displaying a variable used to measure the amount of money that a police department collects each year from drug asset forfeitures (in USD)?
histogram
Identify the most appropriate data visualization technique for displaying a variable used to measure the national crime rate?
histogram
What 2 graphs correspond to numerical categorization?
histogram and box plot
Q2/Q3
interquartile range (50%)
%
percentages
What 2 graphs correspond to nominal categorization?
pie and bar chart
Identify the most appropriate data visualization technique for displaying a variable used to measure the sentences received by convicted defendants, categorized as jail, prison, probation, fine, and other?
pie chart
valid observations
refers to data whose observations can be accurately recorded in a variable's attribute
interquartile range
refers to the middle 50% of a distribution, visualized on a box plot
Identify the most appropriate data visualization technique for displaying a suspected relationship between a variable used to measure the amount of money that a police department collects each year from drug asset forfeitures (in USD) and the number of full-time sworn officers employed at those police departments?
scatterplot
Identify the most appropriate data visualization technique for displaying whether there might be a linear association between variable used to measure the national crime rate and a variable used to measure the national poverty rate?
scatterplot
hold shift
select all cells within a range (A1:E1)
mutually exclusive
separate requirements for each category
whiskers
show range or the span of a box plot (lines of the curve/skew)
n
subset of sample (ex: males) (italicized)
percentage distribution
summarize the number of times a variable's attribute is observed in a data set and is presented in an output table as a percentage of the entire distribution
frequency distribution
technique involves summarizing the distribution of raw counts of a variable's attribute in an output table
data reduction
the process of organizing large amounts of data to make meaning conclusions
hold control
to select 2 or more objects press and hold this, then click each object
Q4
top quartile
N
total sample size
scatterplot
used to assess whether the relationship between two continuous measures is linear
adjacent
used to describe the ends (horizontal line) of the whiskers displayed on a box plot
histogram
used to display a data distribution for a single continuous measure and is constructed similarly to the way a bar chart is constructed, except there aren't spaces between the bars
box chart
used to display categorical data, displaying bars associated with each of a variable's attributes that extend outward relative to the number of times the attribute is observed
box plot
used to display interval- or ratio-level measures and shows the spread of a distribution based on that distribution's quartiles
pie chart
uses a circular-shaped graphic divided into different-sized slices that represent each of a variable's attributes
adjacent
whiskers