MANA EXAM 2
Big Data
A massive volume of both structured and unstructured data that are often difficult to manage, process, and analyze using traditional data processing tools.
contingency table
A table that shows frequencies for two categorical variables, x and y, where each cell represents a mutually exclusive combination of the pair of x and y observations.
stacked column chart
Graph of a contingency table; depicts more than one categorical variable and allows for the comparison of composition within each category.
Volume
One of the V's describing big data; an immense amount of data is compiled from a single source or a wide range of sources.
Variety
One of the V's describing big data; data come in all types, forms, and granularity.
Velocity
One of the V's describing big data; data from a variety of sources get generated at a rapid speed
Value
One of the V's describing big data; information derived from big data should have value.
Veracity
One of the V's describing big data; refers to the credibility and quality of data.
histogram (for numerical value)
a series of rectangles where the width and height of each rectangle represent the interval width and frequency (or relative frequency) of the respective interval
polygon (for numerical value)
connects a series of neighboring points where each point represents midpoint of a particular class and its associated frequency or relative frequency
ogive
connects a series of neighboring points where each point represents the upper limit of a particular interval and its associated cumulative frequency or cumulative frequency