Business Analytics Chapter 4
Data Consolidation
(Data Wranging) reflects the reality that data from various sources are frequently mismatched and therefore need to be manipulated and adjusted so as to align them
Data Warehouse
A single source of truth, that is, data are consistent. uniform, integrated, unambiguous, and reliable
Data Source
Defined within the data warehousing state, as a collection of fields that have been extracted from the source system. The data source is usually a subset of all available fields contained in the source system. Data from the source system are raw unchanged data
what is ETL?
ETL is the process of populating data structures such as InfoCubes, data marts, and data warehouse from one or many sources. ETL designed to ensure that data are constient
Loading
The movement of data from source systems to their immediate or final destinations
Transformation
The process of mapping and harmonizing, making consistent, cleansed and reliable from their sources to their targets
Persistent Storage Area
Where data are stored in a staging area, that is associated with the data source
ETL Process
brings data from source systems into a data warehouse by first extracting them into a data staging area, from there data are transformed and loaded into a data cube
Extraction
the process of identifying data sources and source fields and acquiring (sourcing) A program called extractor must be provided for every type of source system in the data warehouse