Computer Princlpe

Pataasin ang iyong marka sa homework at exams ngayon gamit ang Quizwiz!

Which of the following are examples of how people give up some of their privacy in order gain something in return (utility)? Select two answers:

. Customers sign up for "rewards" programs for different grocery stores so that they can get discounts on different items throughout the store. People enable GPS on their phone so that Apps can locate nearby stores, restaurants and hotels.

Both small and big businesses can benefit from using big data in their organization. Which of the following is true about how businesses could use big data to their advantage?

All of these

Let's assume, when a user texts another person, the phone company keeps track of not only who the sender and receiver of the message was, but also keeps track of the content of the text being sent. Which of the following could fall under the category of metadata

All of these

Which of the following can be used to extract structured information from unstructured data?

All of these

Students are using data collected from a non-profit organization in order to try to convince the school board that their school should be year-round with several week long breaks as opposed to the usual 9 months on and 3 months off. Information that was collected by this organization was as follows: - The location of the school (city and country). - The number of students at the school. - Whether it was year-round or had the normal 3-month summer break. - Scores on standardized tests (AP, SAT, ACT, etc). - The student handbook of rules and regulations. - Results from a survey from teachers and students about happiness level and motivation level. They decided to make an infographic in order to try to easily display the data they have analyzed. Which of the following would be the best information to put on their infographic to try to convince the school board of the change to the schedule? Select two answers:

An association rule showing links between motivation and happiness levels to the type of schooling students were receiving. A regression analysis of standardized tests scores comparing the two different types of schooling.

Consider the following relational database that contains the following census data. Which of the following would be the result if a user were to query this database for any "City" with a population between 100,000 and 1,000,000?

Anaheim, Austin, Charlotte, Tempe

There was a large study conducted on a random sample of 500 students from the UK, South Africa and Australia. The graph displays a comparison of each student's height and age. Four data points are represented by a star on the graph. Upon further inspection, it was discovered that the 4 students had rare medical conditions.

Anomaly detection

After reviewing the service records at a car dealership, the CEO (chief executive officer) discovered that people that scheduled service for a transmission fluid exchange and differential fluid exchange also typically schedule for an oil change. This conclusion is done using what data analysis technique?

Association rule mining

The highest word frequency method, the TF*IDF method and the topic sentence concatenation method are all techniques that will produce a nice summary of commonly used words in a large group of text. This can then be displayed in a nice visual format like a word cloud. All of these methods are forms of what type of data mining strategy?

Automated summarization

There is a popular site, similar to Netflix and Hulu, where you are allowed to stream different television shows and movies over the internet. There is a vast assortment of shows and movies in their database that users can choose from. In order to find the show or movie of choice, the most popular way is to locate it by genre of movie or by the television station that the show is on. Each month, this site adds new movies and TV shows to their database and assigns them to already existing categories. This process of adding new movies best describes which data analysis technique?

Classification

Which of the following are examples of unstructured data? Select two answers:

Closed-circuit security footage of a bank lobby. Digital Image scans of store receipts.

A snack company is starting an advertising campaign for its new line of tortilla chips. Rather than target specific demographic groups in its commercials, the company has decided to perform market research to determine common characteristics of patrons who prefer their chips. This is an example of what type of data mining strategy? Select one:

Cluster Analysis

teacher noticed that they have had a lot of students in their classes that did not do very well on homework and classwork assignments, but still seem to do about the same as other students on the final exam. In order to check their claim, a statistics teacher helped them to create the following graph from a data set that compares a student's final exam grade (FE_Grade) to their marking period average (Average) in the class. What the teacher was observing and what the graphs shows depicts what data analysis technique?

Cluster analysis

Which Big Data analysis technique involves the examination of previously collected data sets in an attempt to discover patterns and other knowledge hidden within the data?

Data Mining

Which of the following is true when it comes to usefulness and usability of a data set? Select two answers:

Data can be useful but not usable., Data can be usable but not useful.

Social media sites like Facebook, Twitter and Instagram collect large amounts of data from their users every single day. Even after a user decides to leave the social media world and delete their account the database still maintains a record of the user. This phenomenon is know to computer scientists as what?

Data persistence

student in a history class is creating an infographic about the civil war. He includes information on where battles took place, how long the battles lasted (on average), how many soldiers were involved from both sides, etc. What type of statistical analysis is this student using? Select one:

Descriptive Analytics

Biologists often attach tracking collars to wild animals. For each animal, the following geolocation data is collected at frequent intervals. - time - date - location of the animal Which of the following questions about a particular animal could NOT be answered using only the data collected from the tracking collars?

Do the movement patterns of the animal vary according to the weather?

By using your own data, search engines and other sites try to make your web experience more personalized. However, by doing this, certain information is being hidden from you. This process of choosing to show you certain information of others puts you in a what?

Filter bubble

A hospital that has it's own pharmacy keeps track of the following information: - Date prescription is filled - Patient name - Room number - Medication prescribed - Cost of medication At the end of the week, all of the data is summarized into a database that is accessible by financial analysts of the hospital that can be sorted by any column in ascending or descending order. Below is a portion of this database: Which of the following cannot be determined from the information in the database?

How many patients were in the hospital on a given day.

A large data set contains information on all registered republicans in the United States. The following information is recorded: - Name - Age - Gender - Home address - Whether they voted or not in 2016 presidential election Which of the following questions could not be answered based solely on the information in this data set?

How many registered republicans voted for Gary Johnson in the 2016 presidential election.

Sarah works part-time as a babysitter for a number of families in her neighborhood. In order to coordinate all of her babysitting jobs, she has created a website that parents can access to check her availability and reserve a night when Sarah can watch their kids. Using the website, parents can see Sarah's schedule, including which nights she is booked as well as comments and ratings from other parents indicating their level of satisfaction with her services for recent babysitting jobs. In order to do this, the website stores an online database of all of her babysitting appointments, Including information on each job as well as personal details about the parents and their children. For each job, the database stores the location (i.e., family's home address), date and time, and the rating or comments that Sarah received for her work. For the parent information, Sarah's database stores the parents' names, home address, and contact information (home/work/cell phone numbers, email address, etc.). For the children, Sarah's database stores their names, ages, list of allergies, list of medications, and interests/hobbies. Because of the potentially sensitive and personal nature of the data that is stored in Sarah's database, which of the following factors does she NOT need to be concerned with from a security standpoint?

If parents provide only partial information when reserving a babysitting job (e.g., they do not enter data for one or more of the items stored in the database), the entire database record will be insecure and vulnerable to attackers.

Which of the following is NOT a benefit of making digital information and scientific databases openly available across the internet?

Inaccurate and misleading data can be more easily disseminated to scientific Researchers

Which of the following tasks best shows an example of using searching and sorting techniques of big data in order to find a useful pattern.

Keeping track of all employees' email use to see how many personal or work-related emails are sent during work time to check for productivity.

A popular restaurant likes to keep track of what food their patrons are ordering so that they can be better informed about what items they need to order in preparation for the next week. What would be the best way for the restaurant to organize this data with that end goal in mind? Select one:

List what meat, vegetable and fruit each customer ordered (steak - corn - apples, fish - peas - peaches, etc).

Two parents are trying to figure out how tall their child might be by using a formula that was created based on studying large amounts of parents and children. This formula takes into account the heights of the parents along with other key factors. This is an example of what kind of analytics?

Predictive

A number of parents have volunteered their children to participate in a developmental study administered by a local child psychologist. The following chart summarizes the results of the psychologist's assessments. Which of the following statistical techniques can the psychologist use to determine the developmental score of a typical 4-year- old child despite the fact that no 4-year- old children participated in the study?

Regression

Similar to a Google Fusion Table, a company wants to organize their data in a relational database so that their data can be easily narrowed according to certain categories. This is done using what?

SQL (Structured Query Language) filters

What is the conversion of data formatted for human use to a more easily used format that can be used by automated computer processes?

Screen Scraping

There are programs designed so that when hard copies of things such as receipts, business cards and recipes are scanned into a computer, items from those paper copies are represented electronically. In order for the program to understand the difference between a zip code and a phone number or between a price of an item and the total bill for the shopping trip, certain rules must be put into place. If the program was trying to identify what part of a receipt is the date of purchase, which of the follow probably would not be included in part of the rules?

Search for the numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 anywhere on the receipt.

Central High School recently won the girl's volleyball championships and have been rewarded by the school with $5,000 to purchase merchandise for the team. The coach is trying to surprise the players with the merchandise at the banquet, so instead of asking them what sizes they want, he is trying to figure that out based on the information from the volleyball program. The following information is located in the program for each Player: - Name - Age - Grade - Height - Weight - Jersey number - Position What would be the best way for the coach to use this information in order to order sizes that work for the majority of the team?

Sort the data by height and weight and order smaller sizes for the girls that are shorter and weigh less and order larger sizes for the girls that are taller and weigh more.

The table below shows the time a computer system takes to complete a specified task on the customer data of different-sized companies. Based on the information in the table, which of the following tasks is likely to take the longest amount of time when scaled up for a very large company of approximately 100,000 customers?

Sorting data

The World Wide Web is full of unstructured data. Search engines like Google, Bing and Yahoo have been doing a good job of allowing users to search by key term in order to quickly locate links to websites about that particular topic. In order to do this, these search engines use what tool in order to help index and find these results?

Spiderbots

Different kinds of data analytics have differing levels of utility and confidence. In general, as the utility level increases, what happens with the confidence levels?

The confidence level decreases.

Which of the following are examples of structured data?

The glossary and index in the back of a textbook., An address book filled with family members names and addresses.

Temporal scan thermometers are popular tools in order to take a baby's temperature. It uses sensors that when slowly moved across the forehead provides a temperature reading in either degrees Celsius or Fahrenheit. Which of the following has to deal with this type of thermometer's usefulness and not usability?

The thermometer (when used properly) is accurate to within 0.2 degrees.

A certain social media Web site allows users to post messages and to comment on other messages that have been posted. When a user posts a message, the message itself is considered data. In addition to the data, the site stores the following metadata. - time the message was posted - name of the user who posted the message - names of any users who comment on the message and the times the comments were made For which of the following goals would it be more useful to analyze the data instead of the metadata? Select one:

To determine the topics that many users are posting about.

Which of the following is a risk of obtaining information through the use of crowdsourcing? Select one:

Unless independently verified, the results of crowdsourcing may be inaccurate.

An infographic displays the relative frequencies of the 100 most common emoji used in text messaging for each of the last 12 months. Which of the following conclusions cannot be drawn from such a representation of emoji usage?

You can determine the average age of emoji users based on emoji use


Kaugnay na mga set ng pag-aaral

Accounting 1 Sheridan College Winter Term

View Set

biology chapter 21-protist evolution and diversity

View Set

chapter 3 - policy riders, provisions, options, and exclusions

View Set

Representations of Trigonometric Functions

View Set

COMM-04 CH 10: Critical Thinking and Argumentation in Groups

View Set