Module 5 Big Data Exam - STUDY SET.jtc3896
Which of the following might be considered a metric in datasets? Choose the best answer available below.
Temperature
Imagine that a cop uses a radar gun to catch speeding cars, but that the radar gun is completely unpredictable. This faulty radar gun does one of the following, with no predictability: .determines the car's correct speed .provides no reading at all, or .shows an incorrect speed
This data is neither useful nor useable
When performing a Cluster Analysis on a dataset, what main aim does this visualization technique have?
To determine unique areas where data points are concentrated
As seen in the image shown below, what is the name of the data visualization that can uses colors to show frequency in data?
Heatmap
As seen in this image, what is the name of the data visualization that can be used to show the frequency of dimensions that appear in datasets? These are the words used in the last 24 inaugural speeches, with the size of the word correlated with how often the word was used in the speech.
Automated Summarization
A local child psychologist is administering a test to local children to measure their development. An ad on the local television news asked interested parents of children in the age range of 1-9 years old to participate. The following chart summarized her findings: She noticed that no children in the age range of 4—5 participated in the study. Using statistical techniques, however she was able to determine that a 4 year old child should score around _____ on the test.
20
What is the best definition of the term viz?
A colorful and modern way of referring to a data visualization
Which of the following might be considered a dimension in datasets? Choose the best answer available below.
Country
In order to help make data useful, we sometimes have to engage in data scrubbing. Which is not an example of that process in action?
Erasing data that does not fit your hypothesis
What is not a file type that Big Data most often comes in?
GIF
In the image shown below, what is the name of the technique that can be used to prevent clumping of many data points when using a scatter plot to visualize?
Jitter
A local child psychologist is administering a test to local children to measure their development. An ad on the local television news asked interested parents of children in the age range of 1-9 years old to participate. The following chart summarized her findings: She noticed that no children in the age range of 4—5 participated in the study. Using statistical techniques, however she was able to determine that a 4 year old child should score around _____ on the test. What statistical analysis what the psychologist performing?
Regression
Choose the most accurate sample search string in order to satisfy the following criteria: .The subject is popular music of 1964 .The following term should be included exactly: Billboard Top Ten .It should be a dataset .No results from kaggle.com should be displayed
csv music "Billboard Top Ten" -kaggle.com
Which of the following is not a defining characteristic of Big Data? Choose the best answer available below.
subject (but volume, velocity, and variety are)