Chapter 6 Key Terms

¡Supera tus tareas y exámenes ahora con Quizwiz!

Distributed Database

a database that is stored in multiple physical locations.

Data Warehouse

a database that stores current and historical data of potential interest to decision makers throughout the company.

File

a group of records of the same type

Record

a group of related fields such as the courses taken, the date, and the grade

Database

a group of related files

Field

a grouping of characters into a word, a group of words, or a complete number

Data Lake

a repository for raw unstructured data or structured data that for the most part have not yet been analyzed and the data can be accessed in many ways.

Database Management System (DBMS)

a specific type of software for creating, storing, organizing, and accessing data from a database.

Data Quality Audit

a structured survey of the accuracy and level of completeness of the data in an information system.

Data Mart

a subset of a data warehouse in which a summarized or highly focused portion of the organizations data is placed in a separate database for a specific population of users.

Data Cleansing

aka as data scrubbing, consists of activities for detecting and correcting data in a database that are incorrect, incomplete, improperly formatted or redundant.

Data Dictionary

an automated or manual file that stores definitions of data elements and their characteristics.

Hadoop

an open-source software framework managed by Apache Software Foundation that enables distributed parallel processing of very large amounts of data across inexpensive computers.

Attributes

and entity has specific characteristics, and they are called...

In-memory computing

another way of facilitating big data analysis that relies primarily on a computer main memory for data storage.

Sentiment Analysis

can mine text comments or survey form to detect favorable and unfavorable opinions about specific subjects.

Entity-Relationship Diagram

clarifies table relationships in a relational database.

Data Governance

encompasses policies and procedures through which data can be managed as an organizational resource.

Key Field

field that uniquely identifies each record so that the record can be retrieved, updated, or sorted

Entity

group of generalized categories in which we store information

Data Mining

provides insight into corporate data that cannot be obtained with OLAP by finding hidden patterns and relationships in large databases and inferring rules from them to predict future behavior.

Database server

receives SQL requests and provides the required data.

Bit

represents the smallest unit of data a computer can handle

Byte

represents the smallest unit of data a computer can handle

Online Analytical Processing

supports multidimensional data analysis, enabling users to view the same data in different ways using multiple dimensions.

Web Mining

the discovery and analysis of useful patterns and information from the web

Relational Database

the most common type of database today.

Structured Query Language

the most prominent data manipulation language today. Query- a request for data from a database.

Normalization

the process of streamlining complex groups of data to minimize redundant data elements and awkward many-to-many relationships and increase stability and flexibility.

Primary Key

the unique identifier for all of the information in any row of the table and cannot be duplicated.

Referential Integrity

these are rules to ensure that relationships between coupled tables remain consistent.

Foreign Key

this is essentially a look-up field to find data about the supplier of a specific part.

Text Mining

tool that helps businesses analyze data and extract key elements from unstructured big data sets, discover patterns and relationships, and summarize information.

Nonrelational Database Management Systems

use a more flexible data model and are designed for managing large data sets across many distributed machines and for easily scaling up or down.

Analytic Platforms

use both relational and nonrelational technology that are optimized for analyzing large data sets.

Data Manipulation Language

used to add, change, delete, and retrieve that data in the database.

Data Definition

used to create database tables and to define the characteristics of the fields in each table.

Report Generator

Crystal Reports is a popular example of this


Conjuntos de estudio relacionados

Unit 2 Chemistry Study Guide: Matter , Nomenclature & Bonding

View Set

Human Growth and Development Ch 7-11

View Set

NUR 108 Ch 30 Perioperative Nursing

View Set

Chapter 11 PrepU: Maternal Adaptation During Pregnancy

View Set

Private Pilot Written Test Ch. 4

View Set

BJU Cultural Geography Chapter 22 Review

View Set

FLVS Personal Fitness: Final Exam

View Set