CSC220 Chapter 4-6
Which of the following is required to create a traditional data warehouse but NOT a data lake? a. Extract Transform Load process b. raw data c. data storage d. OLTP system and/or other data source(s)
a. Extract Transform Load process
_____ is an approach concerned with the efficient and environmentally responsible design, manufacture, operation, and disposal of IS-related products. a. Green computing b. Grid computing c. Cloud computing d. Utility computing
a. Green computing
_____ is an open-source operating system whose source code is freely available to everyone. a. Linux b. Mac OS X c. Chrome d. UNIX
a. Linux
Raw facts such as a social security number or catalog item number for a shirt are known as _____. a. data b. knowledge c. information d. entities
a. data
The component of a computer that provides the CPU with a working storage area for program instructions and data is called the __________. a. main memory b. input/output device c. bus d. processor
a. main memory
A collection of characteristics that belong to a single person, place, or thing for which data is maintained is a(n) _____. a. record b. file c. attribute d. character
a. record
To identify and make predictions about various alternative scenarios, a manager would use _______. a. simulation techniques b. descriptive analytics c. optimization techniques d. visual analytics
a. simulation techniques
_____ is used to explore large amounts of data for hidden patterns to predict future trends. a. Data governance b. Data mining c. Regression analysis d. A genetic algorithm
b. Data mining
_____ is the use of a collection of computers, often owned by many people or different organizations, to work in a coordinated manner to solve a common problem. a. Cloud computing b. Grid computing c. Parallel computing d. Web computing
b. Grid computing
The graphical representation that summarizes the steps a consumer takes in making the decision to buy your product and become a customer is called _____. a. a word cloud b. a conversion funnel c. a scatter diagram d. a pivot chart
b. a conversion funnel
New cars come with onboard computer systems that control antilock brakes, air bag deployment, fuel injection, etc. They run operating system software known as ____. a. a multi-user operating system b. an embedded operating system c. an enterprise operating system d. mobile application software
b. an embedded operating system
Currently, though many different programming languages are used, most software is developed using _____. a. a compiler b. an integrated development environment c. a debugger d. personal application software
b. an integrated development environment
Jesse's team is organizing the data in their company's relational database so that all data is stored in only one place and only related data is stored in any given table. What is this process called? a. conforming to ACID properties b. data normalization c. projecting d. selecting and joining
b. data normalization
Julie is a non-IS employee who is responsible for, among other tasks, creating and maintaining consistent reference data and master data definitions and for analyzing data for quality. Coworkers consult with Julie to find out what data to use to answer a business question. Julie is a _____. a. data analyst b. data steward c. database administrator d. data owner
b. data steward
With _____, the database is stored on a service provider's servers and accessed by the client over a network, typically the Internet. a. Internet access b. database as a service c. software as a service d. Oracle
b. database as a service
A group of programs used to access and manage a database as well as provide an interface between the database and its users and other application programs is called a _____. a. database b. database management system c. data model d. data definition system
b. database management system
What criteria are used by the Uptime Institute to classify data centers into four tiers? a. quality of fire protection systems, physical security systems, and HVAC system b. expected annual downtime, fault tolerance, and power outage protection c. local climate, risk of natural disasters, and power usage effectiveness d. number of customers, reliability of power source, and quality of equipment
b. expected annual downtime, fault tolerance, and power outage protection
A type of memory whose contents are not lost if the power is turned off or interrupted is said to be _____. a. unarbitrary b. nonvolatile c. inaccessible d. nonadjacent
b. nonvolatile
CPU clock speed is the predetermined rate at which the processor _____. a. processes instructions b. produces a series of electronic pulses c. produces a number of files d. loads memory pages
b. produces a series of electronic pulses
The class of computer systems used by multiple concurrent users offers businesses the potential to increase their processing capability to handle more users, more data, or more transactions in a given period, which is known as _____. a. backward compatibility b. scalability c. quantum computing d. portability
b. scalability
One key difference between a relational database and a NoSQL database is _____. a. where data comes from b. the way data storage and retrieval are modeled c. which vendors provide applicable software tools d. the data backup and recovery procedure
b. the way data storage and retrieval are modeled
3D printing is ________. a. an emerging technology with few practical uses b. used to make solid objects from filaments or powder c. used more in homes than in commercial settings d. an extremely high-resolution form of 2D printing
b. used to make solid objects from filaments or powder
Haley's employer has asked her to review a database containing thousands of social media posts about their company's products and extract the data the executive team needs to make decisions about these products and their marketing. In terms of the characteristics of big data, Haley is focusing on ________. a. volume b. value c. velocity d. veracity
b. value
An information system that operates in the _____ sphere of influence supports teamwork between two or more people who work together to achieve a common goal, regardless of where those team members live. a. personal b. workgroup c. enterprise d. social
b. workgroup
_____ is the term used to describe enormous and complex data collections that traditional data management software, hardware, and analysis processes are incapable of handling. a. Data warehouse b. Data mart c. Big data d. Knowledge base
c. Big data
_____ is a special-purpose programming language for accessing and manipulating data stored in a relational database. a. Query by Example (QBE) b. Access c. Structured Query Language (SQL) d. Java
c. Structured Query Language (SQL)
Mollie's company is having problems with their data integrity because multiple database users sometimes access and alter a record at the same time. A database management system could eliminate these problems by implementing _____. a. a logical access path b. data definition languages c. concurrency control d. a schema
c. concurrency control
A _____ is a climate- and access-controlled building or a set of buildings that houses the computer hardware that delivers an organization's data and information services. a. data mart b. data warehouse c. data center d. data mine
c. data center
What process detects and then corrects or deletes "bad data"? a. data recovery b. data enhancement c. data cleansing d. data validation
c. data cleansing
A _____ is a subset of a data warehouse that is used by small- and medium-sized businesses and departments within large companies to support decision making. a. data dictionary b. data model c. data mart d. data mine
c. data mart
Barry's job responsibilities include helping maintain a large database that holds business information from over a dozen source systems, covering all aspects of his company's processes, products, and customers. This database contains not only enterprise data but also data from other organizations. Barry works with a(n) _____. a. data mart b. data lake c. data warehouse d. in-memory database
c. data warehouse
A database system that stores the entire database in random access memory is known as a(n) _____. a. relational database b. HDFS database c. in-memory database d. NoSQL database
c. in-memory database
When rules and relationships are set up to organize raw facts, creating value beyond that of those individual facts, this produces _____. a. data b. knowledge c. information d. entities
c. information
The _____ is the heart of the operating system and controls its most critical processes. a. user interface b. register c. kernel d. cache
c. kernel
The simultaneous execution of two or more instructions at once by a computer is known as _____. a. parallel processing b. grid computing c. multiprocessing d. massive computing
c. multiprocessing
A(n) _____ is a characteristic or set of characteristics in a record that uniquely identifies the record. a. attribute b. entity c. primary key d. data item
c. primary key
Mark creates a new relational database table that includes only five of the seven columns in the existing products table. This action is known as ____. a. selecting b. joining c. projecting d. linking
c. projecting
A _____ is a computer employed by many users to perform a specific task, such as running network or Internet applications. a. slim client b. nettop c. server d. mainframe
c. server
When a business wishes to move away from hosting its own applications, a solution that offers many advantages is to use ______. a. proprietary software b. virtualization c. software as a service d. muti-boot options
c. software as a service
The purpose of business intelligence is to _____. a. provide access to novel tools to end users b. reduce the cost of data processing c. support improved decision making d. improve employee morale
c. support improved decision making
Donna is a member of a team trying to select the best type of database for a business problem. If the database must handle a variety of data, she would like to store the data on a group of servers, and the data structures must be very flexible, what would you suggest? a. Either a NoSQL or a relational database would be a good fit. b. Neither a NoSQL nor a relational database would be a good fit. c. A relational database would likely be a better fit than a NoSQL database. d. A NoSQL database would likely be a better fit than a relational database.
d. A NoSQL database would likely be a better fit than a relational database.
___ is the last phase of the six-phase CRISP-DM method. a. Evaluation b. Modeling c. Business understanding d. Deployment
d. Deployment
_____ is an example of a popular general-purpose software suite for personal computer users. a. Google Chrome b. Snow Leopard c. Mozilla Firefox d. Microsoft Office
d. Microsoft Office
An operating system with _____ capabilities allows a user to run more than one program concurrently. a. networking b. hardware independence c. memory management d. Multitasking
d. Multitasking
_____ are the most powerful computers with the fastest processing speed and highest performance. a. Blade servers b. Workstations c. Mainframe computers d. Supercomputers
d. Supercomputers
Which of the following is the LEAST essential characteristic for success as a data scientist? a. strong business acumen b. a deep understanding of analytics c. a healthy appreciation of the limitations of their data d. business leadership skills
d. business leadership skills
To ensure reliability and integrity, SQL databases conform to four specific properties. Which of the following is NOT one of those four properties? a. atomicity b. durability c. isolation d. currency
d. currency
What determines the size of words in a word cloud? a. length of the word or phrase b. difficulty in pronouncing the word or phrase c. whether the word is a noun or verb d. frequency of occurrence of the word in source documents
d. frequency of occurrence of the word in source documents
A(n) _____ device provides data and instructions to the computer and receives results from it. a. back-side b. expansion c. internal d. input/output
d. input/output
For the ____ operation, it is required that the the two tables have a common data attribute. a. delete b. select c. project d. join
d. join
Helen is 72 years old and is a retired school teacher on a fixed income. She would like to buy a new computer so that she can communicate via email, follow friends and family on social media, and occasionally access recipes and gardening tips from the Web. An important thing to consider is that Helen has arthritis in her hands, making it difficult for her to work with small buttons and gadgets. So, of the following options, her best choice is probably a(n) ________. a. smartphone b. e-book c. desktop d. nettop
d. nettop
Completing an instruction involves two phases—instruction and execution—which are each broken down into two steps for a total of four steps. Which of the following is NOT one of the four steps? a. fetch instruction b. store results c. execute instruction d. process data
d. process data
All of the following are examples of activities performed by an operating system EXCEPT ________. a. managing system memory b. controlling common computer hardware functions c. managing files d. providing word processing capabilities to users
d. providing word processing capabilities to users
Suppose you wish to run two different operating systems on one server. You can accomplish this by using _______. a. a multiprocessor operating system b. an embedded system c. the system utilities d. virtualization software
d. virtualization software