Chapter 8
Why is analyzing data with computers important? (A) To identify patterns that humans cannot see (B) To increase the viability of server farms (C) To verify existing solutions to problems (D) To test due diligence
A—Analyzing data allows us to identify patterns that could help solve problems or identify new possibilities that people likely could not process.
What is information about the author of a document considered to be? (A) Metadata (B) Content (C) Context (D) Mididata
A—Information about the author of a document is metadata.
Which data compression technique provides the most compression? (A) Lossy (B) Lossless (C) Filtering (D) Classification
A—Lossy data compression provides the most compression.
What is one way in which number systems are abstract? (A) The same amount can be represented by different number representations. (B) A number can only be represented by one number system. (C) Symbols can not be used to add, subtract, multiply, or divide them in their abstract form. (D) They use constants.
A—Number systems are abstract because the same amount can be represented by different number representations.
Being able to add or remove resources to store large data sets is called (A) scalability (B) filtering (C) efficiency (D) routing
A—scalability is adding or removing resources to store and process large datasets.
What Is 21410 in binary? (A) 11010100 (B) 11010110 (C) 11010111 (D) 01101011
B—21410is 11010110 in binary.
Your program is comparing temperatures to determine how many patients have a fever. Your selection statement's condition is not working correctly when the variable patient_temp is 98.6. What could be one reason why? IF (patient temp = 98.6) (A) The Format Is incorrect for this test. (B) A round-off error occurred. (C) An overflow error occurred. (D)The test condition is invalid.
B—A round-off error occurred. If The variable patient_temp contained 98.6, it could be stored in memory imprecisely as 98.5555559. The selection statement is then comparing 98.5555559 to 98.6, so the condition is false, when it is expected to be true.
Which of the following best describes abstraction? (A) Adding complexity so the concept can apply to more uses (B) Simplifying complexity to make the concept more general (C) Combining procedures to make a new one (D) A set of instructions to do a task
B—Abstraction means simplifying and taking away details to make something more general and flexible.
What Is the amount of data compression an algorithm can produce reliant upon? (A) No repeating parts of the file being compressed (B) Several patterns in the data (C) A large file size (D) A small file size
B—Algorithms can provide a larger percentage of compression when there is repetition in the data such as sections with repeating patterns. The entire pattern can be represented by one character or symbol in the compressed result.
A company purchases a large block of data from a social media site. Ifthey want to analyze the data to learn more about potential customers, what techniques should they use? (A) Simulations to test different hypotheses about what data could be present (B) Data analysis to identify patterns and relationships in the data for further analysis (C) Maximization to get the highest return on their purchase of the data (D)Data processing to use the data with existing company software to see if it will run on their systems or if new ones will need to be developed
B—Data analysis is the transformation of data to identify patterns and connections. Companies can then use the data to identify business opportunities to take advantage of threats to avoid.
Metadata is used to (A) provide updates to the data (B) help find and organize data (C) brand the data (D) sort the data
B—Metadata helps organize and tag data so that it can more easily be found.
A magnet school wants to advertise its students' success taking AP exams to prospective families. What's the best method to share the summarized data? (A) Post an image of student results on social media sites. (B) Create an interactive pie chart that can drill down to topics and overall scores posted on the school's website. (C) Write a report for a marketing pamphlet. (D) Send an email to families with middle-school-age children.
B—Prospective families will likely check the school's website for additional information. Therefore, an interactive pie chart can provide high-level information and drill down to more information when sections of the pie are selected. Social media accounts may not reach all families, and a marketing pamphlet cannot provide as much information or the drill down ability. Sending a mass email may be flagged as spam and families may not open an email from an unexpected source.
What order should these beintoo from smallest to largest? 1. Binary—01110111 2. Decimal 111
B—The binary number 01110111 converts to 119 in decimal. The number is smallest, then the first binary number in #1, then the second binary number in #3.
A school district wants to analyze the data about their high school graduates who attend local community colleges. What information can the school district obtain from the following data that the school has available? Student name Number of students in the graduating class by high school. High school graduation year Student self-reported plans for after high school Total numbers from local community colleges of enrolled students and their high school Total number of local community colleges students enrolled in each degree program (A) Average number of in-state students who enrolled in a local community college (B) If The Number of local graduates who enroll in a local community college is increasing, decreasing, or stable (C) Number of local high school students who graduate from a local community college (D)Popular degree programs in the community colleges
B—With the high school graduation year along with the community college enrollment numbers plus the students' high schools, the school district can determine if the number of their high school graduates who enroll in local community colleges is increasing, decreasing, or stable over time.
Which number type is stored imprecisely in memory? (A) Integers (B) Numbers with decimals (C) Both (D) Neither
B—lntegers, or whole numbers, are stored precisely. Real numbers with a decimal and fractional part are stored imprecisely in computer memory.
When does an overflow error occur? (A) When the computer runs out of memory to store program instructions (B) When the flow of binary digits reaches a broken pathway and cannot arrive at its destination (C) When an integer requires more bits to represent it than the programming language provides (D) When the numerator in a calculation is larger than the denominator
C—An overflow error occurs when a number needs more bits to hold it than the programming language provides. Think of it like a three-position odometer reaching 999. At the next mile, it should read 1 ,OOO, but it cannot store a number that large. Instead, it rolls over to 1 , which is an invalid representation ofthe mileage.
Data compression algorithms are used when the data (A) needs to be shared with a large number of people (B) is used for cryptography to keep data secure (C) is too large to send in a timely manner (D) needs to be senta large physical distance away
C—Compression techniques are used when data is too large to send in ways such as an attachment to an e-mail or when it would take too long to send it in its original, expanded form.
What is the number system used by computers? (A) Base 10 (decimal) (B) Base 8 (octal) (C) Base 2 (binary) (D) Base 16 (hexadecimal)
C—Computers use the binary or base 2 number system consisting of Os and Is.
How can an organization begin the process of analyzing data? (A) By following an iterative development process (B) By establishing measurements the data should show (C) By developing hypotheses and questions to test (D) By checking to see if the data matches previously collected data
C—Developing hypotheses and questions and testing these with the data helps gain insight and knowledge about it.
How many more bits are available if you go from a 32-bit computer to a 64-bit machine? (A) Twice as many (B) 32 more (C) 232 more (D) 322 more
C—The computer increases from 232 to 264 bits. Therefore, the increase is 232 bits.
If analyzing data indicates a company should only hire people with a college degree because they stay at the company longer, what is this a potential indication of? (A) Good data management practices leading to good hiring practices (B) Frequency analysis to identify commonalities in the data (C) Bias in collecting the data (D) Data assessment and inquiry of hiring practices
C—This represents bias in that if a company historically mainly hired people with college degrees, then the data being analyzed would show that they stayed with the company longer because of higher numbers overall.
Which topic needs the use of programs to analyze the data to identify insights? (A) The average number of students who drive to school each day (B) The record of wins and losses for all sports teams under their current coaches compared to prior years' win/loss records (C) The number of library books in a school district that need to be replaced each year (D) The standardized test scores for current students compared to test scores across the country for the past decade
D—The test scores compared to all other scores for the prior decade will be a large dataset. Searching for correlations between current student data and prior data is a task that needs programs to massage the data. The other options will have exact numbers on manageable-sized datasets that a person could identify manually or by using a local computer for processing.
An example of metadata about sea turtle nests could be (A) number of eggs in the nest (B) location of the nest (C) number of incubation days (D) tracking number assigned to the nest
D—Data about the data would be the tracking number for the nest. The other values are details about the nest itself.
Why is cleaning data important? (A) It ensures incomplete data does not hide or skew results. (B) It removes bad data. (C) It repairs incomplete data. (D) All of the above.
D—Data needs to be cleaned to remove or repair corrupt or incomplete data to ensure valid data is used for research and analysis.
What Is a reason to perform additional research on correlations found through data analysis? (A) There May not be an actual cause and effect relationship between the correlation variables. (B) A single source may not provide enough data for a conclusion. (C) To understand the relationship between the variables. (D) All of the above.
D—Each of these is a reason to perform additional research and analysis to validate the correlation.
When is sampling needed? (A) Sampling is used to store analog data. (B) Sampling is used to approximate real-world data. (C) Sampling is used when converting from digital data to analog data. (D) Sampling is used when converting from analog data to digital data.
D—Samplingis used to convert analog data to digital by taking samples of the analog data at intervals to create a digital representation of those values.
Given a table lunchroom leftovers, what can be determined from the data? Date Total Meals Total Meals Left Over 1/29/18 800 25 1/30/18 750 5 1/31/18 800 42 (A) Most popular items (B) Days of field trips when classes missed lunch (C) Days with high absenteeism (D)Amount of wasted budget dollars
D—While answers B and C hint at the cause, you cannot tell for sure from the table data. You can take the number of meals left over and multiply it by the cost of each meal to determine the budget dollars lost.
The Letter"M" is represented by 01001101 in binary. What Is this in decimal? (A) 414 (B) 76 (C) 77 (D) 1101
c—010011012 = 77 (01001101)2 = (Ox 28) + (1 x 27) + 26) + (Ox 25) + (1 x 24) + (1 x 23) + (Ox 22) x 20) (77)10