OTM Chapter 5+7 Study Guide PART 2
How do Organizations Use Data Warehouses to Acquire Data? (4) *CSPB*
Contains data from many core operational transaction systems Standard enterprise data Provides analysis and reporting tools Both current and historical data
Describe Hadoop (2)
Enables distributed parallel processing of big data across inexpensive computers Used by Facebook, Yahoo, NextBio
What is text mining?
Extracts key elements from large unstructured data sets
Operational data is divided into: (4)
Operational DBs Social Data Purchased Data Employee knowledge
Describe MapReduce (2)
Technique for harnessing the power of thousands of computers working in parallel The BigData collection is broken down into pieces, and hundreds/thousands of independent processors search these pieces for something of interest
ICANN is owned by:
The US government
What are the benefits/components of web mining? (4)
discovery/analysis of useful patterns or info on the web Web content mining Web structure mining; analyzing links to/from webpage Web usage mining-mining user interactions
Measures and dimensions are 2 main features of what kind of cube?
information cube
Acquiring data involves: (4)
obtaining cleansing organize/relate catalog
publishing results involves:
print web servers report servers automation
Performing analysis involves: (3)
reporting data mining biodata
What is IofT
the internet of things; a network of smart objects (devices, cars, buildings, etc.)
Virtualization software enables the creation of what?
virtual environments nearly instantaneously
Describe analytic platforms
•High-speed platforms optimized for large datasets
Analytical Tools (Data Analytics) are: (4)
•Multidimensional data analysis (OLAP) that View data in multiple dimensions •Data mining •Text mining •Web mining
How do Organizations Use Data Marts to Acquire Data? (3)
•Subset of data warehouse •Summarized data, specific population of users •Typically focuses on single subject or line of business
Describe In-memory computing (4)
•Used in big data analysis •Use RAM for data storage to avoid delays in retrieving data •Reduce hours/days of processing to seconds •Requires optimized hardware