data midterm chapter 3
What is the definition of a data warehouse (DW) in simple terms?
A pool of data produced to support decision making; it is also a repository of current and historical data of potential interest to managers throughout the organization.
How does the use of cloud computing affect the scalability of a data warehouse?
Hardware resources are dynamically allocated as use increases.
________ (also called in-database analytics) refers to the integration of the algorithmic extent of data analytics into data warehouse.
In-database processing
The ________ Model, also known as the EDW approach (top-down) to data warehouse development
Inmon
The ________ Model, also known as the data mart approach, is a "plan big, build small" approach. A data mart is a subject-oriented or department-oriented data warehouse. It is a scaled-down version of a data warehouse that focuses on the requests of a specific department, such as marketing or sales.
Kimball
Which of the following online analytical processing (OLAP) technologies does NOT require the precomputation and storage of information?
ROLAP
________, or "The Extended ASP Model," is a creative way of deploying information system applications where the provider licenses its applications to customers for use as a service on demand (usually over the Internet)
SaaS (software as a service)
Examples of infrastructure
a) columnar b) real-time DW c) data management technologies and practices d) data warehouse applicances (all in 1 solutions to DW) e) in-memory storage technology (moving the data in the memory for faster processing) f) new database management systems g) advanced analytics
Examples of sourcing
a) web, social media and Big Data b) open source software c) SaaS (software as a service) d) cloud computing
The three main types of data warehouses are data marts, operational ________, and enterprise data warehouses.
data stores
The role responsible for successful administration and management of a data warehouse is the ________, who should be familiar with high-performance software, hardware, and networking technologies, and also possesses solid business insight.
data warehouse administrator (DWA)
The "single version of the truth" embodied in a data warehouse such as Capri Casinos' means all of the following EXCEPT
decision makers have unfettered access to all data in the warehouse.
Performing extensive ________ to move data to the data warehouse may be a sign of poorly managed data and a fundamental lack of a coherent data management strategy.
extraction, transformation, and load (ETL)
The ________ data warehouse architecture involves integrating disparate systems and analytical resources from multiple sources to meet changing needs or business conditions.
federated
Which data warehouse architecture uses metadata from existing data warehouses to create a hybrid logical data warehouse comprised of data from the other warehouses?
federated architecture
All of the following statements about metadata are true EXCEPT
for most organizations, data warehouse metadata are an unnecessary expense.
All of the following are benefits of hosted data warehouses EXCEPT
greater control of data.
A(n) ________ architecture is used to build a scalable and maintainable infrastructure that includes a centralized data warehouse and several dependent data marts.
hub-and-spoke
Which data warehouse architecture uses a normalized relational warehouse that feeds multiple data marts?
hub-and-spoke data warehouse architecture
Data warehouses provide direct and indirect benefits to using organizations. Which of the following is an indirect benefit of data warehouses?
improved customer service
Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates?
independent data mart
____________ refers to architectural-hardware and software-enhancements.
infrastructure`
Data ________ comprises data access, data federation, and change capture.
integration
All of the following are true about in-database processing technology EXCEPT
it is the same as in-memory storage technology.
Which of the following statements is more descriptive of active data warehouses in contrast with traditional data warehouses?
large numbers of users, including operational staffs
Real-time data warehousing, also known as active data warehousing (ADW), is the process of _______________ via the data warehouse as they become available.
loading and providing data
________ describe the structure and meaning of the data, contributing to their effective use.
metadata
Which of the following BEST enables a data warehouse to handle complex queries and scale up to handle many more requests?
parallel processing
In ________ oriented data warehousing, operational databases are tuned to handle transactions that update the database.
product
With ________ data flows, managers can view the current state of their businesses and quickly identify problems.
real-time
More data, coming in faster and requiring immediate conversion into decisions, means that organizations are confronting the need for _____________.
real-time data warehousing (RDW)
Most data warehouses are built using ________ database management systems to control and manage the data.
relational
A data mart is a subset of a data warehouse, typically consisting of a ___________ whereas a data warehouse _____________ across an entire enterprise.
single subject area or department; combines databases
_________ refers to mechanisms for acquisition of data from diverse and dispersed sources.
sourcing
Active data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is
speed of data transfer
When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure?
star schema
Online ________ is arguably the most commonly used data analysis technique in data warehouses.
analytical processing
Metadata are data about data. Metadata describe the ________ and some meaning about data, thereby contributing to their effective or ineffective use.
structure of
Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are
subject-oriented and nonvolatile.
A Web client that connects to a Web server, which is in turn connected to a BI application server, is reflective of a
three tier architecture.
Online ________ is a term used for a transaction system that is primarily responsible for capturing and storing data related to day-to-day business functions such as ERP, CRM, SCM, and point of sale.
tranaction processing
In which stage of extraction, transformation, and load (ETL) into a data warehouse are data aggregated?
transformation
The most common information system architectures that can be used for data warehousing are _______ and _________ but sometimes there is simply ________.
two-tier; three-tier architectures; one tier
major components of the data warehousing process
∙ Data sources ∙ ETL process: data extraction, transformation & loading ∙ EDW and metadata ∙ data marts (if desired) ∙ Middleware tools: enable access to the DW ex. data mining, OLAP, reporting tools, and data visualization tools.
4 characteristics of data warehousing.
∙ Subject oriented ∙ Integrated ∙ Time variant (time series) ∙ Nonvolatile
________ is a mechanism that integrates application functionality and shares functionality (rather than data) across systems, thereby enabling flexibility and reuse.
Enterprise application integration (EAI)
________ is a mechanism for pulling data from source systems to satisfy a request for information. It is an evolving tool space that promises real-time data integration from a variety of sources, such as relational databases, Web services, and multidimensional databases.
Enterprise information integration (EII)
In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected?
cleanse
The primary purpose of metadata should be to provide context to the reported data; that is, it provides enriching information that leads to ______________.
creation of knowledge
________ modeling is a retrieval-based system that supports high-volume query access.
dimensional
When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is
drill down
In the Michigan State Agencies case, the approach used was a(n) ________ one, instead of developing separate BI/DW platforms for each business area or state agency.
enterprise
Which approach to data warehouse integration focuses more on sharing process functionality than data across systems
enterprise application integration