MIS Ch. 11 Multiple Choice
NoSQL
A category of recently introduced data storage and retrieval technologies that are not based on the relational model
Business Intelligence
A set of methodologies, processes, architectures and technologies that transform raw data into meaningful and useful information
Pig
A tool that integrates a scripting language and an execution environment intended to simplify the use of MapReduce
Hive
An Apache project that supports the management and querying of large data sets using HiveQL, an SQL-like language that provides a declarative interface for managing data stored in Hadoop
MapReduce
An algorithm for massive parallel processing of various types of computing tasks
Hadoop
An open source implementation framework of MapReduce
Predictive Analytics
Applies statistical and computational methods and models to data regarding past and current events to predict what might happen in the future
Big Data
Data that exist in large volumes and in many different varieties (data types) and that need to be processed at a very high velocity (speed)
Data Mining
Knowledge discovery using a sophisticated blend of technologies from traditional statistics, artificial intelligence, and computer graphics
Data Lake
Large integrated repository for internal and external data that does not follow a predefined schema
Multidimensional OLAP (MOLAP)
OLAP tools that load data into an intermediate structure, usually a three or higher dimensional array
Relational OLAP (ROLAP)
OLAP tools that view the database as a traditional relational database in either a star schema or other normalized or denormalized set of tables
Descriptive Analytics
Oldest form of analytics. It describes the past status of the domain of interest using a variety of tools through techniques such as reporting, data visualization, dashboards and scoreboards
Text Mining
Process of discovering meaningful information algorithmically based on computational analysis of unstructured textual information
Analytics
Refers to systematic analysis and interpretation of data. It typically uses mathematical, statistical and computational tools to improve our understanding of a real-world domain
HDFS
Stands for Hadoop Distributed File System. It is a file system designed for managing a large number of potentially very large files in a highly distributed environment
Online Analytical Processing (OLAP)
The use of a set of query and reporting tools that provides users with multidimensional views of their data and allows them to analyze the data using simple windowing techniques
Prescriptive Analytics
Uses results of predictive analytics together with optimization and simulation tools to recommend actions that will lead to a desired outcome