Chapter 7
Data warehouse use
The retrieval of the _____ in the _________ - Indirect o Via the front-end (BI) applications - Direct o Via the DBMS o Via the OLAP (BI) tools
Enterprise-wide
The term refers to the fact that the data warehouse provides an organization-wide view of the analytically useful information it contains
Historical
The term refers to the larger time horizon in the data warehouse than in the operational databases
Data warehouse front-end (BI) applications
Used to provide access to the data warehouse for users who are engaging in indirect use
Retrieval of analytical information
-A data warehouse is developed for this, and it is not meant for direct data entry by the users. -The only functionality available to the users of the data warehouse is _________ -The data in the data warehouse is not subject to changes. -The data in the data warehouse is referred to as non-volatile, static, or read-only
Detailed and/or summarized data
-A data warehouse, depending on its purpose, may include ______ or _____ or both -A data warehouse that contains the ___ at the finest level of ____ is the most powerful
Developing front-end (BI) applications
-Designing and creating _____ for indirect use by the ____-users -Included in most data warehousing systems and are often referred to as business intelligence (BI) applications -Contain interfaces (such as forms and reports) accessible via a navigation mechanism (such as a menu)
Dependent data mart
-Does not have its own source systems -The data comes from the data warehouse
Source systems
-In the context of data warehousing, _______ are operational databases and other operational data repositories (in other words, any sets of data used for operational purposes) that provide analytically useful information for the data warehouse's subjects of analysis -Every operational data store that is used as a ________ for the data warehouse has two purposes: — The original operational purpose — As a ________ for the data warehouse -__________ can include external data ______
Data warehouse components
-Source systems -Extraction-transformation-load (ETL) infrastructure -Data warehouse -Front-end applications
Independent data mart
-Stand-alone _______, created in the same fashion as the data warehouse -Independent ________ has its own source systems and ETL infrastructure
Data warehouse
-The _________ is sometimes referred to as the target system, to indicate the fact that it is a destination for the _______ from the source systems -A typical __________ periodically retrieves selected analytically useful ______ from the operational _____ sources
Data mart
A ______ store based on the same principles as a _____ warehouse, but with a more limited scope
2) Operational queries typically process: A) Larger amounts of data than analytical queries B) Smaller amounts of data than analytical queries C) Equal amounts of data as analytical queries D) Infinite amounts of data, incomparable to analytical queries
B
4) Operational queries are typically issued: A) By fewer users than analytical queries B) By more users than analytical queries C) By an equal number of users as analytical queries D) By no users if the corporation has users issuing analytical queries
B
6) Which of the following is NOT true: A) Data marts typically have fewer data sources than data warehouses B) Data marts typically have broader focus than data warehouses C) Data marts are typically not as big as data warehouses D) Data marts typically have shorter implementation time than data warehouses
B
1) Data time-horizon is: A) Typically longer in operational systems than in analytical systems B) Typically equal in operational systems and in analytical systems C) Typically shorter in operational systems than in analytical systems D) Typically nonexistent in operational systems and in analytical systems
C
5) The purpose of the source system is: A) Original operational purpose only B) As a source system for the data warehouse only C) Both the original operational purpose and as a source system for the data warehouse D) As a write-only source system
C
Analytical information
Collected and used in support of __________ tasks
Operational information (transactional information)
Collected and used in support of day to day needs in businesses and other organizations
8) Data warehouse requirements can be refined and/or expanded following: A) Data warehouse use B) Data warehouse deployment C) Creating ETL infrastructure D) All of the above
D
9) Implicit changes of data warehouse requirements are permitted during: A) Data warehouse use B) Data warehouse deployment C) Creating ETL infrastructure D) Implicit changes of requirements are not permitted
D
A typical organization
Maintains and utilizes a number of operational data sources.
Data warehouse administration and maintenance
Performing activities that support the data warehouse end user, including dealing with technical issues, such as: -Providing security for the information contained in the data warehouse -Ensuring sufficient hard-drive space for the data warehouse content -Implementing the backup and recovery procedures
Data warehouse deployment
Releasing the _______ and its front-end (BI) applications for use by the end users
Requirements collection, definition, and visualization 2
The COLLECTED requirements should be clearly DEFINED and stated in a written document, and then VISUALIZED as a conceptual data model
Creating ETL infrastructure
• Creating necessary procedures and code for: o Automatic extraction of relevant data from the operational data sources o Transformation of the extracted data, so that its quality is assured and its structure conforms to the structure of the modeled and implemented data warehouse o The seamless load of the transformed data into the data warehouse — Due to the amount of details that have to be considered, is often the most time- and resource-consuming part of the data warehouse development process
Two main reasons for the creation of a data warehouse as a separate analytical database
-The performance of operational day-to-day tasks involving data use can be severely diminished if such tasks have to compete for computing resources with analytical queries -It is often impossible to structure a database which can be used in an efficient manner for both operational and analytical purposes
Time variant
-The term refers to the fact that a data warehouse contains slices or snapshots of data from different periods of _____ across its _______ horizon -With the data slices, the user can create reports for various periods of ______ within the _____ horizon
3) Operational queries are typically issued: A) More often than analytical queries B) Less often than analytical queries C) Equally as often as analytical queries D) Never in a corporation that has users issuing analytical queries
A
7) Which of the following is NOT a part of the data warehouse requirements step? A) Requirements collection B) Requirements definition C) Conceptual modeling D) Logical modeling
D
Requirements collection, definition, and visualization
Results in the ________ specifying the desired capabilities and functionalities of the future data warehouse -The _________ are based on the analytical needs that can be met by the data in the internal data source systems and available external data sources -The _________ are collected through interviewing various stakeholders of the data warehouse -In addition to interviews, additional methods for eliciting ________ from the stakeholders can be used
Structured repository
-The data warehouse is a database containing analytically useful information -Any database is this with its _______ represented in its metadata
Data warehouse
A structured repository of integrated, subject-oriented, enterprise-wide, historical, and time-variant _____. The purpose of it is the retrieval of analytical information. It can store detailed and/or summarized ____.
Operational data sources
Include the databases and other ____ repositories which are used to support the organization's day-to-day _______
Integrated
The data warehouse _______ the analytically useful data from the various operational databases (and possibly other sources) ________ refers to this process of bringing the data from multiple data sources into a singular data warehouse.
ETL infrastructure
-The _________ that facilitates the retrieval of data from operational databases into the data warehouses -______ includes the following tasks: o EXTRACTING analytically useful data from the operational data sources o TRANSFORMING such data so that it conforms to the structure of the subject-oriented target data warehouse model (while ensuring the quality of the transformed data) o LOADING the transformed and quality assured data into the target data warehouse
Operational data
-Typical Time-Horizon: Days/Months -Detailed -Current -Small Amounts used in a Process -High frequency of Access -Can be Updated -Non-Redundant -Used by all types of employees for tactical purposes -Application Oriented
Analytical Data
-Typical Time-Horizon: Years -Summarized (and/or Detailed) -Values over time (Snapshots) -Large Amounts used in a Process -Low/Modest frequency of -Access Read (and Append) Only -Redundancy not an Issue
Creating the data warehouse
-Using a DBMS to implement the ________ model as an actual _______ • Typically, _______ are implemented using a relational DBMS (RDBMS) software
Data warehouse modeling (logical data warehouse modeling )
Creation of the data warehouse _______ that is implementable by the DBMS software
A data warehouse
Is created within an organization as a separate ____ store whose primary purpose is ____ analysis
Subject-oriented
-The term refers to the fundamental difference in the purpose of an operational database system and a data warehouse. -An operational database system is developed in order to support a specific business operation -A data warehouse is developed to analyze specific business subject areas
10) Creating ETL infrastructure includes creating necessary procedures and code for: A) Designing and creating applications for indirect use by the end users B) The seamless load of the transformed data into the operational data bases C) Automatic extraction of relevant data from the data warehouses D) Transformation of the extracted data, so that its quality is assured and its structure conforms to the structure of the modeled and implemented data warehouse
D