Analyzing Data Vocabulary
Range
Calculated by the high minus the low
Mode
Most frequently occurring number in a data set
Univariate
One-variable data.
Bivariate
Two variables
Standard Deviation
A computed measure of how much scores vary around the mean score.
Outlier
A data point far away from the other data points
Box Plot
A diagram that shows the five number summary of a distribution. (Five number summary includes lowest value, lower quartile, median, upper quartile, and highest value.)
Histogram
A graph of vertical bars representing the frequency distribution of a set of data.
Scatterplot
A graph on a coordinate place where two corresponding data points are paired to make an ordered pair.
Dot Plot
A graph that summarizes data by the number of dots above each data value on the horizontal axis
Percentile
A number that tells us what percent of the total number of data values that lie at or below a certain level
Relative Frequency
A percent formed by frequency (count)/ Total count
Line of best fit
A straight line drawn to best describe a scatter plot that appears to have a linear relationship
Causal
Implying a cause-and-effect relationship
5 number summary
Minimum, Q1, Median, Q3, Maximum
Residual
The "left-overs" that are the differences between the data value and the value on the line of best fit.
Correlation coefficient
The value given during linear regression that determines the strength of the linear model. The closer to 1 or -1, the stronger the linear model. The closer to 0 the weaker the linear the model.
Uniform Distribution
Distribution where all of the values are about the same
Symmetrical Distribution
Distribution where it is lower on each side and higher in the middle
Skewed
Distribution where most of the data is to the right (negatively skewed) or to the left (positively skewed)
Mean
The average of all the numbers in a data set
Interquartile Range
The difference between the upper (Q3) and lower (Q1) quartiles.
1st Quartile
The number in the middle of the bottom half of the data set
Median
The number in the middle of the data set when the data is put in order from least to greatest
3rd Quartile
The number in the middle of the top half of the data set
Quartile
The numbers that separate the set into four equal parts