Comp sci unit 2 chapter 2

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

what is google trends? an advertisement tool a visualization tool a scientific database website browser database all of the above

all of the above

which of the following is where google trends comes from? real time data non-real-time data random sample of searches from last 7 days random sample of google search data from as far back as 2004 all of the above

all of the above

what type of chart is good for displaying nominal comparisons or ranking relationships? bubble chart scatter plot area chart heat map pie chart

bubble chart

what is quantitative data? numerical data that has a finite number of possible values data that can be sorted according to group or category data that is measured and has a value within a range data that can be counted or measured; all values are numerical none of the above

data that can be counted or measured; all values are numerical

what is correlation data? tracks change in values of a consistent metric over time simple comparison of the quantitative values of subcategories data with two or more variables that may demonstrate a positive or negative relationship subset of data compared to the larger whole distribution around a central value

data with two or more variables that may demonstrate a positive or negative relationship

what is the usefulness of patterns that are founding in popular topics? none helps to plan for the worst-case scenario empowers citizens and web users helps to identify, understand, and predict all of the above

helps to identify, understand, and predict

which chart is useful for showing the progression of values over time? bubble chart scatter plot line chart bar chart pie chart

line chart

which of the following is the best example of a nominal comparison? amusement park tickets sold on a rainy day vs. a regular day monthly sales number of visitors to various websites historic weather patterns, ranked from hottest months to coldest percentage of consumers purchasing specific products

number of visitors to various websites

When a topic is quickly growing in popularity it is said to be: becoming popular trending pop culture tweeting liked

trending

The digital divide is about how...

...people's access to computing and the Internet differs based on socioeconomic or geographic characteristics.

Aggregation

A computation in which rows from a data set are grouped together and used to compute a single value of more significant meaning or measurement.

README

A document providing background information about a dataset.

Hypothesis

A proposed explanation for some phenomenon used as the basis for further investigation.

Summary table

A table of aggregate information about a dataset (e.g., the average, sum, count of some values).

summary tables

A table that summarizes information about some larger dataset. It typically consists of performing computations like sums, averages, and counts on higher level groupings of information. The intent is to summarize lots of data into a form that is more useful, and easier to "see"

CSV

Abbreviation of "comma-separated values," this is a widely-used format for storing data.

This question refers to the same data from the High School Survey about college plans from the previous question. Amara decides to make a visualization of a portion of the responses showing only a few states and a few areas of study. She wants to make an effective visualization that shows for comparison: Students' average likelihood of attending college in-state broken down by which state they live in and what they plan to major in. For example, in Illinois (IL) on average students who want to study economics are very likely to say they want to attend college in-state. Amara makes four different visualizations shown below (marked A, B, C, D). According to good principles of visualization, and for what Amara wants to show, which one of these would be considered the best visual representation? Chart A (Line Chart) Chart B (Vertical Bar Chart) Chart C (Stacked Line Chart) Chart D (Stacked Vertical Bar Chart)

Chart B (Vertical Bar Chart)

Which of the following is the most accurate statement about cleaning and filtering data?

Filtering and cleaning data is necessary to ensure that data is in a form that is better for computers to process

The Chart below from Google Trends shows the prevelance of some search terms in the United States between 2004 and the present. Which of the following is the most accurate statement of what this chart is showing. Since sometime around 2009, red has become the favorite color of more people Generally speaking, since 2009 more people use "red" in their search terms more than they use "blue", "yellow", "green", or "purple" The general decline in the search term "yellow" might be due to the decline of searches for yellow taxis, as car sharing services have become more popular Generally speaking, the volume of internet searches is increasing over time because the number of people using the internet is also increasing.

Generally speaking, since 2009 more people use "red" in their search terms more than they use "blue", "yellow", "green", or "purple"

which population has the greatest internet usage through mobile phones/ connectors (all cell owners)? White Black Hispanic Native Americans all are equitable

Hispanic

The AP CS Principles framework contains the following statement: 7.1.1G Search trends are predictors. Which of the following is the most accurate statement about using search trends as predictors of future events? Search trends are imperfect predictors of future events that fully represent society at large. Search trends are accurate and reliable predictors of future events that fully represent society at large. Search trends are imperfect predictors of future events that may not fully represent society at large. Search trends are accurate and reliable predictors of future events that may not fully represent society at large.

Search trends are imperfect predictors of future events that may not fully represent society at large.

This question refers to the same data from the High School Survey about college plans from the previous question. Amara plans to use the survey data to create a visualization and short write up about students' plans for college, but first she wants to learn more about how the survey was conducted. Of the following things she might learn about the survey, which are the most likely sources of bias in the results based how it was collected? Choose two answers. She learns that the survey administrators only asked a representative sample of students, rather than every student in each state. She learns that responses were collected only by mobile app. She learns that the survey was only available to students who scored at the top 10% on the PSAT. She learns the survey was available to complete in both digital and paper form.

She learns that responses were collected only by mobile app. She learns that the survey was only available to students who scored at the top 10% on the PSAT.

A programmer is writing a system that is intended to be able to store large amounts of personal data. As the programmer develops the data system, which of the following is LEAST likely to impact the programmer's choices in designing the structure of the system? Maintaining privacy of the information stored in the data set. Scalability of the system. Structuring the metadata of the information for analysis. The frequency of a particular item occurring in a data set

The frequency of a particular item occurring in a data set.

Raw data

The original data as it was collected.

pivot table

The tool used by most spreadsheet programs to create a summary table.

A certain social media Web site allows users to post messages and to comment on other messages that have been posted. When a user posts a message, the message itself is considered data. In addition to the data, the site stores the following meta data. The time the message was posted The name of the user who posted the message The names of any users who comment on the message and the times the comments were made For which of the following goals would it be more useful to analyze the data instead of the metadata? To determine the users who post messages most frequently To determine the time of day that the site is most active To determine the topics that many users are posting about To determine which posts from a particular user have received the greatest number of comments

To determine the topics that many users are posting about

A bakery collects data on sales. Each sales record includes the date of the sale and some metadata about the items that were part of the sale. The data includes: the names of the items sold, the types of items sold, the number of each item sold, and the price of each item sold. Which of the following CANNOT be determined from the bakery's data set? The total income from sales the bakery received in the past month. Which customer most frequently purchases bread. The item bought in the highest quantity in the past week. Days when certain items sell the most.

Which customer most frequently purchases bread.

The next 3 questions all refer to data collected in a hypothetical survey of high school seniors, and a student, Amara, who is working with this data. The survey of high school seniors asked: What state do you live in? How likely are you to attend college in your home state? (on a scale of 1-5, 5 meaning "very likely") What do you plan to study? Amara is tasked with cleaning the data to prepare it for further analysis. Which of the following would be the least appropriate modifications to make to the data to prepare it for further analysis? Translate all states into their two-letter state code Group similar areas of study into a single area of study. For example: grouping Applied Mathematics and Mathematics together into "Mathematics" Round up all non-integer values for "Likelihood of staying in state" Removing the entire row with home state "adsfas" and recomputing

Round up all non-integer values for "Likelihood of staying in state"

Which of the following statements are true about pivot tables?

Pivot tables are used to quickly perform aggregate computations and groupings on a set of raw data Pivot tables are used to generate a summarized view of a large dataset which is helpful for gaining insight

which household, by race/ethnicity, has the greatest use of internet? White Asian Hispanic Black Native American

Asian

which population has the greatest internet usage through mobile phones/connectors (smartphone owners)? White Black Hispanic Native Americans all are equitable

Black

Consider the following numbers given in Binary (BIN), Decimal (DEC), and Hexadecimal (HEX) representations: BIN: 1110 DEC: 13 HEX: F Which of the following lists the numbers in order from least to greatest? BIN: 1110, DEC: 13, HEX: F DEC: 13, BIN: 1110, HEX: F DEC: 13, HEX: F, BIN: 1110 HEX: F DEC: 13, BIN: 1110

DEC: 13, BIN: 1110, HEX: F

Biologists often attach tracking collars to wild animals. For each animal, the following geolocation data is collected at frequent intervals. The time The date The location of the animal Which of the following questions about a particular animal could NOT be answered using only the data collected from the tracking collars? Approximately how many miles did the animal travel in one week? Does the animal travel in groups with other tracked animals? Do the movement patterns of the animal vary according to the weather? In what geographic locations does the animal typically travel?

Do the movement patterns of the animal vary according to the weather?


Ensembles d'études connexes

Custom:PN VATI Medical Surgical Re-evaluation Assessment

View Set

Chapter 4 - Test Scores and What They Mean

View Set

Macro Econ Chapter 12 Practice Quiz

View Set

WEEK 2 GERIATRIC SYNDROMES - FRAILTY & TCM & PALLIATIVE/HOSPICE CARE INTRO CH 24

View Set

Final Exam, Lab Manual Questions, Health Assessment

View Set