Chapter 6 Key Terms
Distributed Database
a database that is stored in multiple physical locations.
Data Warehouse
a database that stores current and historical data of potential interest to decision makers throughout the company.
File
a group of records of the same type
Record
a group of related fields such as the courses taken, the date, and the grade
Database
a group of related files
Field
a grouping of characters into a word, a group of words, or a complete number
Data Lake
a repository for raw unstructured data or structured data that for the most part have not yet been analyzed and the data can be accessed in many ways.
Database Management System (DBMS)
a specific type of software for creating, storing, organizing, and accessing data from a database.
Data Quality Audit
a structured survey of the accuracy and level of completeness of the data in an information system.
Data Mart
a subset of a data warehouse in which a summarized or highly focused portion of the organizations data is placed in a separate database for a specific population of users.
Data Cleansing
aka as data scrubbing, consists of activities for detecting and correcting data in a database that are incorrect, incomplete, improperly formatted or redundant.
Data Dictionary
an automated or manual file that stores definitions of data elements and their characteristics.
Hadoop
an open-source software framework managed by Apache Software Foundation that enables distributed parallel processing of very large amounts of data across inexpensive computers.
Attributes
and entity has specific characteristics, and they are called...
In-memory computing
another way of facilitating big data analysis that relies primarily on a computer main memory for data storage.
Sentiment Analysis
can mine text comments or survey form to detect favorable and unfavorable opinions about specific subjects.
Entity-Relationship Diagram
clarifies table relationships in a relational database.
Data Governance
encompasses policies and procedures through which data can be managed as an organizational resource.
Key Field
field that uniquely identifies each record so that the record can be retrieved, updated, or sorted
Entity
group of generalized categories in which we store information
Data Mining
provides insight into corporate data that cannot be obtained with OLAP by finding hidden patterns and relationships in large databases and inferring rules from them to predict future behavior.
Database server
receives SQL requests and provides the required data.
Bit
represents the smallest unit of data a computer can handle
Byte
represents the smallest unit of data a computer can handle
Online Analytical Processing
supports multidimensional data analysis, enabling users to view the same data in different ways using multiple dimensions.
Web Mining
the discovery and analysis of useful patterns and information from the web
Relational Database
the most common type of database today.
Structured Query Language
the most prominent data manipulation language today. Query- a request for data from a database.
Normalization
the process of streamlining complex groups of data to minimize redundant data elements and awkward many-to-many relationships and increase stability and flexibility.
Primary Key
the unique identifier for all of the information in any row of the table and cannot be duplicated.
Referential Integrity
these are rules to ensure that relationships between coupled tables remain consistent.
Foreign Key
this is essentially a look-up field to find data about the supplier of a specific part.
Text Mining
tool that helps businesses analyze data and extract key elements from unstructured big data sets, discover patterns and relationships, and summarize information.
Nonrelational Database Management Systems
use a more flexible data model and are designed for managing large data sets across many distributed machines and for easily scaling up or down.
Analytic Platforms
use both relational and nonrelational technology that are optimized for analyzing large data sets.
Data Manipulation Language
used to add, change, delete, and retrieve that data in the database.
Data Definition
used to create database tables and to define the characteristics of the fields in each table.
Report Generator
Crystal Reports is a popular example of this