Unit 8-Digital Information Assessment
Which of the following is the largest amount of data?
3 petabytes
Computers are often used to search through large sets of data to find useful patterns in the data. Which of the following tasks is NOT an example where searching for patterns is needed to produce useful information?
A high school analyzing student grades to identify the students with the top ten highest grade point averages.
A large company wants to store a backup copy of all emails for the past five years and allow every computer in the building to access it. Which of these systems should the company use?
An online storage network
How does Cloud Computing help us analyze Big Data?
By making vast computing resources available on demand
Biologists often attach tracking collars to wild animals. For each animal, the following geolocation data is collected at frequent intervals. The time The date The location of the animal Which of the following questions about a particular animal could NOT be answered using only the data collected from the tracking collars?
Do the movement patterns of the animal vary according to the weather?
What type of application program allows you to store, manipulate, and analyze data in organized workbooks for home and business tasks?
Electronic spreadsheet
Facebook needs to store, manipulate and present data related to members, their friends, member activities, messages, advertisements and lot more. It uses an electronic spreadsheet to organize the data.
False
Weaknesses of the database approach include reduced data redundancy, improved data integrity, shared data, easier access, and reduced development time.
False
What major company is on the cutting edge of Big Data?
A large data set contains information about all students majoring in computer science in colleges across the United States. The data set contains the following information about each student. The student's gender The state in which the student attends college The student's grade point average on a 4.0 scale Which of the following questions could be answered by analyzing only information in the data set?
How many states have a higher percentage of female computer science majors than male computer science majors attending college in that state?
Privacy can be a concern when dealing with large data sets. Consider that the following information is stored in a company's database for each customer: Customer name Social security number Last item purchased Address Product category preference Average purchase amount (in dollars) Internal customer reference number The company wants to make part of this raw data available to its employees for further analysis, but the company requires that the data is anonymous and cannot be linked to a specific customer. Which of the following combinations of data could employees be provided access to without endangering the anonymity of an individual customer?
Last item purchased Product category preference Average purchase amount (in dollars)
Which compression technique allows the exact original data to be reconstructed.
Lossless
Which type of compression reduces file size by decreasing the level of detail of the image?
Lossy.
What is referred to as Data Mining?
Methods of pattern recognition within Big Data stores
Is there a minimum size needed for data to be considered Big Data?
No, it refers to any large size which exceeds the capacity of a given domain to easily store and process it
The table below shows the time a computer system takes to complete a specified task on the customer data of different-sized companies. Based on the information in the table, which of the following tasks is likely to take the longest amount of time when scaled up for a very large company of 100,000 customers?
Sorting data
Which of the following is a language used to create and maintain professional, high-performance corporate databases?
Structured Query Language (SQL)
A database is a collection of data organized in a manner that allows access, retrieval, and use of that data.
True
An online telephone directory would use a database to store data pertaining to people, phone numbers, or other contact details.
True
Big Data is going to steal white collar professional jobs.
True
Business and government measure Big Data.
True
Machine learning is a branch of artifical intelligence.
True
Most video compression algorithms and codecs combine spatial image compression and temporal motion compensation.
True
Proper use of image compression can make a huge difference in the appearance and size of your website image files.
True
The recommendation engines, like the ones used by Netflix and Amazon, are algorithms based on your purchase history of similar items.
True
Your electricity service provider is an example of an entity using a database to manage billing, client related issues, or to handle fault data.
True
A search engine has a trend-tracking feature that provides information on how popular a search term is. The data can be filtered by geographic region, data, and category. Categories include arts and entertainment, computers and electronics, games, news, people and society, shopping, sports, and travel. Which of the following questions is LEAST likely to be answerable using the trends feature?
What is the cost of a certain electronics product?
Which of the following illustrate the concept of data persistence?
When an item is returned to a store, the original debit transaction is matched with a credit transaction rather than being simply deleted.
A retailer that sells footwear maintains a single database containing records with the following information about each item for sale in the retailer's store. Item identification number Footwear type (sneakers, boots, sandals, etc.) Selling price (in dollars) Size Color Quantity available Using only the database, which of the following can be determined?
Which items listed in the database are not currently in the store.
Is the World Wide Web used to share Big Data?
Yes, the World Wide Web made it really quick to download and upload enormous files
When you do a Google search, what are you actually querying?
a database
Machine learning includes: voice recognition self-driving cars identifying breast cancer all items listed are examples of machine learning
all items listed are examples of machine learning
Big data involves
facts, framework, and algorithms
Who invented Big data?
humans
The characteristics of Big Data include
volume, variety, velocity, variability