MIS Chapter 3
Challenges of Content Management
-Databases are huge -Content is dynamic -Documents do not exist in isolation -Contents are perishable -In many languages
BI analysis types
-Reporting -Data mining -Big Data
Advances in AI
-computing power increasing exponentially -availability of large data sets -cloud computing making scalable resources available at low costs -rapid increase in network connected smart devices is producing vast amounts of data for AI applications -fundamental breakthroughs in AI techniques have made AUCuseful for a variety of tasks -advances driven by demand for applications that solve practical problems
2 benefits of Knowledge management (KM)
-improves process quality -increases team strength
Benefits of Automated Labor
-reduction in labor costs -won't have to pay for any additional benefits required with human labor -immediately trained -no breaks -no time-wasting activities -no arguments -no scheduling issues -more accurate, precise, consistent
By 2025, nearly ________ million workers could be taken out of the current US labor force of 146 million workers
100 million
Neural network
A computing system modelled after the human brain that is used to predict values and make classifications
Corpus of knowledge
A large set of related data and texts
OLAP cube
A presentation of an OLAP measure with associated dimensions. The reason for this term is that some products show these displays using three axes, like a cube in geometry. Same as OLAP report.
Algorithm
A set of procedures used to solve a mathematical problem
RFM analysis
A technique readily implemented with basic reporting operations to analyze and rank customers according to their purchasing patterns. (R) recently, (F) frequency, (M) money
Turing Test
A test proposed by Alan Turing in which a machine would be judged "intelligent" if the software could use conversation to fool a human into thinking it was talking with a person instead of a machine.
Cluster analysis
A type of common, unsupervised data mining. Statistical techniques identify groups of entities that have similar characteristics
Superintelligence
AI beyond Strong AI: capable of intelligence more advances than human intelligence
unsupervised data mining
Analysts do not create a model or hypothesis before running the analysis. They create hypotheses afterwards, in order to explain the patterns found
Static reports
BI documents that are fixed at the time of creation and do not change
dynamic reports
BI documents that are updated at the time they are requested
Problematic data are termed ________.
Dirty Data (999-999-9999 for a phone number)
Weak AI
Focuses on completing a single specific task
Tasks of BI
Informing, deciding, problem solving, project management
OLAP measure and dimension
Measure = data item of interest Dimension = characteristic of measure (purchase data, customer type, etc)
Regression analysis
Measures the effect of a set of variables in another variable
Deep learning
Multilayered neural network techniques applied to learning task. This has greatly improved accuracy/practical usefulness of AI
Hadoop
Open source program supported by the Apache Foundation that implements MapReduce on potentially thousands of computers
Hadoop's query language is _____
Pig
Naïve Bayes Classifier
Predicts The probability of a certain outcome based on prior occurrences of related events
Natural Language Processing (NLP)
The ability of a computer system to understand spoken human language to answer questions
artificial intelligence (AI)
The ability of a machine to simulate human abilities such as vision, communication, recognition, learning, and decision making in order to achieve a goal
The Singularity
The point at which an AI becomes sophisticated enough that it can adapt and create its own software, and hence, adapt its behavior without human assistance
Knowledge Management (KM)
The process of creating value from intellectual capital and sharing that knowledge with employees, managers, suppliers, customers, and others who need it.
Automation
The process of making systems operate without human intervention
What's the goal of AI?
To create strong AI: AI that can complete all the same tasks a human can
Data Mart
a data collection, smaller than the data warehouse, that addresses the needs of a particular department or functional area of the business
Data Warehouse
a facility for managing an organization's BI data. The functions are to: -Obtain data -Cleanse data -Organize and relate data -Catalog data
decision support system
a synonym for decision-making BI systems
MapReduce
a technique for harnessing the power of thousands of computers working in parallel
Big Data
a term used to describe data collections that are characterized by huge volume, rapid velocity, and great variety.
3 primary activities in the BI process
acquire data, perform analysis, publish results
Structured Data
data in the form of rows and columns
Supervised data mining
data miners develop a model prior to the analysis and apply statistical techniques to data to estimate parameters of the model
Push Publishing
delivers business intelligence to users without any request from the users
Business Intelligence (BI) Systems
information systems that process operational, social, and other data to identify patterns, relationships, and trends for use by business professionals and other knowledge workers
Content Management System (CMS)
information systems that support the management and delivery of documents including reports, web pages, and other expressions of employee knowledge
Reporting Application
is a BI application that inputs data from one or more sources and applies reporting operations to that data to produce business intelligence
Two functions of a BI server
management and delivery
Online Analytical Processing (OLAP)
more generic reporting tool than RFM, that provides the ability to sum, count, average, and other simple arithmetic Reports are dynamic—viewer can change report's format
Granularity
refers to the level of detail represented by the data
Exception reports
reports produced when something out of predefined bounds occurs (hospital asking for a report showing doctors who prescribe more than 2x the amount of pain meds than average doctors)
Pull Publishing
requires the user to request BI results
Data mining
the application of statistical techniques to find patterns and relationships among data for classification and prediction
Machine Learning
the extraction of knowledge from data based on algorithms created from training data
BI analysis
the process of creating business intelligence
Publish Results
the process of delivering business intelligence to the knowledge workers who need it
Data Aquisition
the process of obtaining, cleaning, organizing, relating, and cataloging source data
Reporting Analysis
the process of sorting, grouping, summing, filtering, and formatting structured data
BI application
the software component of a BI system
Drill down
to further divide the data into more detail
Subscriptions
user requests for particular BI results on a particular schedule or in response to particular events