Week 2
Two doctors look at the exact same image of a brain scan. The image is inconclusive, yet one doctor sees evidence of an abnormality in the brain. The other doctor sees a healthy brain. This is an example of sampling bias. True False
False This is an example of observer bias, which is the tendency for different people to observe things differently.
Which of the following are types of data bias often encountered in data analytics? Select all that apply. Interpretation bias Observer bias Confirmation bias Educational bias
Interpretation bias Observer bias Confirmation bias
A data analyst is analyzing sales data for the newest version of a product. They use third-party data about an older version of the product. For what reasons is this inappropriate for their analysis? Select all that apply. The data is biased The data is not current The data is not accurate The data is not original
The data is not original The data is not current
To determine if a data source is cited, you should ask which of the following questions? Select all that apply. Who created this dataset? Is the data relevant to the problem I'm trying to solve? Is this data from a reliable organization is dataset from a credible organization?Has this dataset been properly cleaned?
Who created this dataset? Is this data from a reliable organization?
Question 1 Which of the following are examples of sampling bias? Select all that apply. A national election poll only interviews people with college degrees. A survey of high-school-age students does not include homeschooled students. A clinical study includes three times more men than women. An online marketing analytics firm stores data in a spreadsheet.
A national election poll only interviews people with college degrees. A survey of high-school-age students does not include homeschooled students. A clinical study includes three times more men than women.
Which of the following are usually good data sources? Select all that apply. Governmental agency data Vetted public datasets Social media sites Academic papers
Academic papers Governmental agency data Vetted public datasets
Question 6 What is data privacy? Providing free access, usage, and sharing of data Preserving a data subject's information and activity for all data transactions Applying well-founded standards of right and wrong that dictate how data is collected, shared, and used Searching for or interpreting supporting information
Applying well-founded standards of right and wrong that dictate how data is collected, shared, and used
Question 8 Interoperability is key to open data's success. Which of the following is an example of interoperability? A company restricts the use of a database to its own employees Different databases use common formats and terminology A website charges a fee to access a database An analyst removes all personally identifiable information from a database
Different databases use common formats and terminology
Question 3 Which of the following are qualities of a bad data source? Select all that apply. The data source is out of date and irrelevant The data source solely relies on third-party information The data source is not cited or vetted The data source is not missing any important information
The data source is out of date and irrelevant The data source solely relies on third-party information The data source is not cited or vetted
Ownership is a key issue in data ethics. Who owns data? The organization that invests time and money collecting, processing, and analyzing the data The law enforcement agencies that enforce data protection laws The individual who originally generates the data The government that passes data-protection legislation
The individual who originally generates the data
Question 7 Data anonymization applies to both text and images. True False
True
Question 4 In data ethics, consent gives an individual the right to know the answers to which of the following questions? Select all that apply. Why is my data being collected? How long will my data be stored? Why am I being forced to share my data? How will my data be used?
Why is my data being collected? How long will my data be stored? How will my data be used?