MIS Managing the Digital Firm -- Chapter 6: Foundations of Business Intelligence, Databases, and Information Management

Pataasin ang iyong marka sa homework at exams ngayon gamit ang Quizwiz!

Join Operation

Combines relational tables to provide the user with more information than is otherwise available in individual tables

Types of Information Obtainable from Data Mining

1) Associations 2) Sequences 3) Classifications 4) Clustering 5) Forecasting

DBMS Capabilities

1. Data definition language 2. Data dictionary 3. Data manipulation language

Three Basic Operations of Relational Databases to Develop Useful sets of Data

1. Select 2. Join 3. Project

3Vs of Big Data

1. The extreme "volume" of data 2. The wide "variety" of data types and sources 3. The "velocity" at which data must be processed

Bit

A binary digit representing the smallest unit of data in a computer system. It can only have one of two states, representing 0 or 1

Database

A collection of data organized to service many applications at the same time by storing and managing data so that they appear to be in one location

Database Server

A computer in a client/server environment that is responsible for running a DBMS to process SQL statements and perform database management tasks

Data Warehouse

A database, with reporting and query tools, that stores current and historical data extracted from various operational systems and consolidated for management reporting and analysis

Key Field

A field in a record that uniquely identifies instances of that record so that it can be retrieved, updated, or sorted

File

A group of records of the same type

Record

A group of related fields

Database (hierarchy)

A group of related files

Field

A grouping of characters into a word, a group of words, or a complete number, such as a person's name or age

Data Manipulation Language

A language associated with a database management system that end users and programmers use to manipulate data in the database

Entity-Relationship Diagram

A methodology for documenting database illustrating the relationship between various entities in the database; the boxes represent entities and the lines connecting the boxes represent relationships; a line connecting two entities that ends in two short marks designates a one-to-one relationship; a line connecting two entities that ends with a crow's foot topped by a short mark indicates a one-to-many relationship

Entity

A person, place, thing, or event about which information must be kept

Attribute

A piece of information describing a particular entity

Tuples

A row or record in a relational database

Data Mart

A small data warehouse containing only a portion of the organization's data for a specified function or population of users

Byte

A string of bits, usually eight, used to store one number or character in a computer system

Data Quality Audit

A survey and/or sample of files to determine accuracy and completeness of data in an information system; surveying entire data files, surveying samples from data files, or surveying end users for their perceptions of data quality

Relational DBMS

A type of logical database model that treats data as if they were stored in two-dimensional tables. It can related data stored in one table to data in another as long as the two tables share a common data element; tables may be referred to as files; each table contains data on an entity and its attributes; each field represents an attribute for that entity; fields in a relational database are also called columns; rows are commonly referred to as records

Data Cleansing (Data Scrubbing)

Activities for detecting and correcting data in a database or file that are incorrect, incomplete, improperly formatted, or redundant; not only corrects errors but also enforces consistency among different sets of data that originated in separate information systems

Conceptual Design of a Database

An abstract model of the database from a business perspective; describes how the data elements in the database are to be grouped

Data Dictionary

An automated or manual tool for storing and organizing information about the data maintained in a database

Data Mining

Analysis of large pools of data to find patterns and rules that can be used to guide decision making and predict future behavior

Online Analytical Processing (OLAP)

Capability for manipulating and analyzing large volumes of data from multiple perspectives; each aspect of information represents a different dimension

Foreign Key

Field in a database table that enables users to find related information in another database table

Web Content Mining

The process of extracting knowledge from the content of web pages, which may include text, image, audio, and video data

Smart Contracts

Computer programs that implement the rules governing transactions between firms; what is the price of products, how will they be shipped, when will the transaction be completed, who will finance the transaction, what are finacning terms

Select Operation

Creates a subset consisting of all records in the file that meet stated criteria

Project Operation

Creates a subset consisting of columns in a table, permitting the user to create new tables that contain only the information required

Data Definition

DBMS capability that specifies the structure and content of the database; used to create database tables and to define the characteristics of the fields in each table

Blocks

The records of a blockchain; each block contains a timestamp and a link to a previous block; once a block of data has been recorded, it cannot be changed retroactively

Big Data

Data sets with volumes so huge that they are beyond the ability of typical relational DBMS to capture, store, and analyze. The data are often unstructured or semi-structured

Nonrelational Database Management System

Database management system for working with large quantities of structured and unstructured data that would be difficult to analyze with a relational model; useful for accelerating simple queries against large volumes of structured and unstructured data, including web, social media, graphics, and other forms of data that are difficult to analyze with traditional SQL-based tools; NoSQL

Distributed Database

Database stored in multiple physical locations

Clustering

Discover different groupings within data; works in a manner similar to classification when no groups have yet been defined

Web Mining

Discovery and analysis of useful patterns and information from the World Wide Web

Text Mining

Discovery of patterns and relationships from large sets of unstructured data

Blockchain

Distributed ledger system that stores permanent and tamper-proof records of transactions and shares them among a distributed network of computers; the system stores transactions as a distributed ledger among a network of computers; when someone wants to add a transaction, participants in the network run algorithms to evaluate and verify the proposed transaction

Sequences

Events linked over time

Web Structure Mining

Examines data related to the structure of a particular website

Web Usage Mining

Examines user interaction data recorded by a web server whenever requests for a web sites resources are received

Application Server Software

Handles all application operations, including transaction processing and data access, between a user and an organization's back-end business applications or databases; takes requests from the web server, runes the business logic to process transactions based on those requests, and provides connectivity to the organization's back-end systems or databases

Structured Query Language (SQL)

The standard data manipulation language for relational database management systems

Hadoop Distributed File System (HDFS)

Links together the file systems on the numerous nodes in a Hadoop cluster to turn them into one big file system

Sentiment Analysis

Mining text comments in an email message, blog, social media conversation, or survey form to detect favorable and unfavorable opinions about specific subjects

Associations

Occurrences linked to a single event

Hadoop

Open source software framework that enables distributed parallel processing of huge amounts of data across many inexpensive computers; breaks a big data problem down into sub-problems, distributes them among up to thousands of inexpensive computer processing nodes, and then combines the result into a smaller data set that is easier to analyze

Data Governance

Policies and processes for managing the availability, usability, integrity, and security of the firm's data; establishes the organization's rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information

Analytic Platforms

Preconfigured hardware-software system that is specifically designed for high-speed analysis of large datasets; feature preconfigured hardware-software systems that are specifically designed for query processing and analytics

Logical View of Databases

Presents data as they would be perceived by end users or business specialists

Classification

Recognizes patterns that describe the group to which an item belongs by examining existing items that have been classified and by inferring a set of rules

Data Lake

Repository for raw unstructured data or structured data that for the most part have not yet been analyzed; stores data in their native format until they are needed

Query

Request for data from a database

Referential Integrity

Rules to ensure that relationships between coupled database tables remain consistent

Physical View of Databases

Shows how data re actually organized and structured on physical storage media

Physical Design of a Database

Shows how the database is actually arranged on direct-access storage devices

Database Management System (DBMS)

Special software to create and maintain a database and enable individual business applications to extract the data they need without having to create separate files or data definitions in their computer programs; the DBMS acts as an interface between application programs and the physical data files

In-Memory Computing

Technology for very rapid analysis and processing of large quantities of data by storing the data in the computer's main memory rather than in secondary storage; users access data stored in a system's primary memory, thereby eliminating bottlenecks from retrieving and reading data in a traditional, disk-based database and dramatically shortening query response times

Program-Data Dependence

The close relationship between data stored in files and the software programs that update and maintain those files. Any change in data organization or format requires a change in all the programs associated with those files

Data Inconsistency

The presence of different values for same attribute when the same data are stored in multiple locations

Data Redundancy

The presence of duplicate data in multiple data files

Normalization

The process of creating small stable data structures from complex groups of data when designing a relational database

Primary Key

Unique identifier for all the information in any row of a database table

Forecasting

Uses a series of existing values to forecast what other values will be


Kaugnay na mga set ng pag-aaral

Technische Grundlagen - Böhm (Altklausuren)

View Set

Essentials of Human Anatomy and Physiology(11) Activity lab 5

View Set

Statistic: 2.2 Organizing Quantitative Data

View Set

Saunders Med Administration NCLEX Questions

View Set