CIS 330 Final Quiz
Which of the following advances in information systems contributed to the emergence of data warehousing? Increase in viruses and other computer threats. Improvements in monitor technologies. Advances in middleware products that enabled enterprise database connectivity across heterogeneous platforms. The invention of the iPad.
Advances in middleware products that enabled enterprise database connectivity across heterogeneous platforms.
Which of the following factors drive the need for data warehousing? Businesses need an integrated view of company information. Informational data must be kept together with operational data. Data warehouses generally have better security. Reduce virus and Trojan horse threats.
Businesses need an integrated view of company information.
The analysis of summarized data to support decision making is called: operational processing. informational processing. artificial intelligence. data scrubbing
Informational processing
Converting data from the format of its source to the format of its destination is called: data transformation. data loading. data scrubbing. data storage.
data transformation
Data governance can be defined as: a means to slow down the speed of data. high-level organizational groups and processes that oversee data stewardship. a government task force for defining data quality. a means to increase the speed of data.
high-level organizational groups and processes that oversee data stewardship.
Informational and operational data differ in all of the following ways EXCEPT: level of detail. normalization level. scope of data. data quality
level of detail
Descriptive, predictive, and ________ are the three main types of analytics. adaptive comparative prescriptive decisive
prescriptive
Informational systems are designed for all of the following EXCEPT: running a business in real time. supporting decision making. complex queries. data mining.
running a business in real time
Quality data can be defined as being: unique. inaccurate. historical. precise.
unique
Operational and informational systems are generally separated because of which of the following factors? A data warehouse centralizes data that are scattered throughout disparate operational systems and makes them readily available for decision support applications. A properly designed data warehouse decreases value to data. A separate data warehouse increases contention for resources. Only operational systems allow SQL statements.
A data warehouse centralizes data that are scattered throughout disparate operational systems and makes them readily available for decision support applications.
The Hadoop framework consists of the ________ algorithm to solve large scale problems. MapSystem MapReduce MapCluster MapComponent
MapReduce
The process of transforming data from a detailed to a summary level is called: extracting. updating. joining. aggregating.
aggregating
At a basic level, analytics refers to: collecting data. conducting a needs analysis. analysis and interpretation of data. normalizing data.
analysis and interpretation of data
All of the following are tasks of data cleansing EXCEPT: decoding data to make them understandable for data warehousing applications. adding time stamps to distinguish values for the same attribute over time. generating primary keys for each row of a table. creating foreign keys.
creating foreign keys
Conformance means that: data have been transformed. data are stored, exchanged or presented in a format that is specified by its metadata. data are stored in a way to expedite retrieval. data is a harbinger.
data are stored, exchanged or presented in a format that is specified by its metadata.
When we consider data in the data warehouse to be time-variant, we mean: that the time of storage varies. data in the warehouse contain a time dimension so that they may be used to study trends and changes. that there is a time delay between when data are posted and when we report on the data. that time is relative.
data in the warehouse contain a time dimension so that they may be used to study trends and changes.
All of the following are ways to consolidate data EXCEPT:__ application integration. data rollup and integration. business process integration. user interaction integration
data rollup and integration
A technique using pattern recognition to upgrade the quality of raw data is called: data scrounging. data scrubbing. data gouging. data analysis.
data scrubbing
Allowing users to dive deeper into the view of data with online analytical processing (OLAP) is an important part of: predictive analytics. descriptive analytics. prescriptive analytics. comparative analytics.
descriptive analytics
When online analytical processing (OLAP) studies last year's sales, this represents: predictive analytics. descriptive analytics. prescriptive analytics. comparative analytics.
descriptive analytics
Which of the following organizational trends does not encourage the need for data warehousing? Multiple, nonsynchronized systems Focus on customer relationship management Downsizing Focus on supplier relationship management
downsizing
The development of the relational data model did not contribute to the emergence of data warehousing. True False
false
When multiple systems in an organization are synchronized, the need for data warehousing increases. True False
false
The best place to improve data entry across all applications is: in the users. in the level of organizational commitment. in the database definitions. in the data entry operators.
in the database definition
The process of combining data from various sources into a single table or view is called: extracting. updating. selecting. joining.
joining
It is true that in an HDFS cluster the DataNodes are the: large number of slaves. single master servers. language libraries. business intelligences.
large number of slaves
Big Data includes: large volumes of data with many different data types that are processed at very high speeds. large volumes of data entry with a single data type processed at very high speeds. large volumes of entity relationship diagrams (ERD) with many different data types that are processed at very high speeds. large volumes of entity relationship diagrams (ERD) with a single data type processed at very high speeds.
large volumes of data with many different data types that are processed at very high speeds.
Data federation is a technique which: creates an integrated database from several separate databases. creates a distributed database. provides a virtual view of integrated data without actually creating one centralized database. provides a real-time update of shared data.
provides a virtual view of integrated data without actually creating one centralized database.
Event-driven propagation: provides a means to duplicate data for events. pushes data to duplicate sites as an event occurs. pulls duplicate data from redundant sites. triggers a virus.
pushes data to duplicate sites as an event occurs.
Data quality ROI stands for: return on installation. risk of incarceration. rough outline inclusion. rate of installation.
risk of incarceration
One characteristic of quality data which pertains to the expectation for the time between when data are expected and when they are available for use is: currency. consistency. referential integrity. timeliness.
timeliness
Advances in computer hardware, particularly the emergence of affordable mass storage and parallel computer architectures, was one of the key advances that led to the emergence of data warehousing. True False
true
Informational systems are designed to support decision making based on historical point-in-time and prediction data. True False
true
The need for data warehousing in an organization is driven by its need for an integrated view of high-quality data. True False
true
The three 'v's commonly associated with big data include: viewable, volume, and variety. volume, variety, and velocity. verified, variety, and velocity. vigilant, viewable, and verified.
volume, variety, and velocity
Including data capture controls (i.e., dropdown lists) helps reduce ________ deteriorated data problems. external data source inconsistent metadata data entry lack of organizational commitment
data entry
The Hadoop Distributed File System (HDFS) is the foundation of a ________ infrastructure of Hadoop. relational database management system DBBMS Java data management
data management
It is true that in an HDFS cluster the NameNode is the: large number of slaves. single master server. language library. business intelligence.
single master server
One simple task of a data quality audit is to: interview all users. statistically profile all files. load all data into a data warehouse. establish quality metrics.
statistically profile all files
The characteristic that indicates that a data warehouse is organized around key high-level entities of the enterprise is: subject-oriented. integrated. time-variant. nonvolatile.
subject oriented
Which of the following is a basic method for single field transformation? Table lookup Cross-linking entities Cross-linking attributes Field-to-field communication
table lookup
External data sources present problems for data quality because: data are not always available. there is a lack of control over data quality. there are poor data capture controls. data are unformatted.
there is a lack of control over data quality