Ch 6 isom
is there an incorrect value in the information?
accurate
determines which things go together
affinity grouping
mathematical formulas placed in software that performs an analysis on a data set
algorithms
encompasses all organizational information, and its primary purpose is to support the performing of managerial analysis tasks
analytical information
sales projections, future growth and trends, product statistics
analytical information
the science of fact-based decision making
analytics
the process of identifying rare or unexpected items or events in a data set that do not conform to other items in the data set
anomaly detection
what are the data elements associated with an entity?
attributes
using data about people's behaviors to understand intent and predict future actions
behavioral analysis
__________________ data is a collection of large, complex data sets, including structured and unstructured data, which cannot be analyzed using traditional database methods and tools
big
a data ____________ is a business that collects personal information about consumers and sells that information to other organizations
broker
what is the solution to the problem of being data rich and information poor
business intelligence
what defines how a company performs certain aspects of its business and typically results in either a yes/no or true/false answer?
business rule
gain a clear understanding of the business problem that must be solved and how it impacts the company
business understanding
what enforces business rules vital to an organization's success and often require more insight and knowledge than relational integrity constraints?
business-critical integrity constraints
assigns records to one of a predefined set of classes
classification
____________________ analysis is a technique used to divide information sets into mutually exclusive groups such that the members of each group are as close together as possible to one another and the different groups are as far apart as possible.
cluster
segments a heterogeneous population of records into a number of more homogeneous subgroups
clustering
summary information level
coarse granularity
what can compare two or more data sets to identifying patterns and trends?
comparative analysis
is a value missing from the information
complete
is aggregate or summary information in agreement with detailed information?
consistent
who is the person responsible for creating the original website content?
content creator
who is the person responsible for updating and maintaining website content?
content editor
determines a statistical relationship between variables, often for the purpose of identifying predictive factors among the variables
correlation analysis
select the four functions that a database management system can perform on data in a database?
create, read, update, delete data
business intelligence _______________ track corporate metics such as critical success factors and key performance indicators and include advanced capabilities such as interactive controls allowing users to manipulate data for analysis.
dashboards
_________- mining tools use a variety of techniques to find patterns and relationships in large volumes of information that predict future behavior and guide decision making
data
what is the collection of data from various sources for the purpose of data processing
data aggregation
the three focus areas of big data
data analysis, data mining, data visualization
who is a business analytics specialist who uses visual tools to help people understand complex data
data artist
select the two terms representing the smallest of basic unit of information
data field and data element
what refers to the overall management of the availability, usability, integrity, and security of company data
data governance
what is a storage repository that holds a vast amount of raw data in its original format until the business needs it?
data lake
What contains a subset of data warehouse information?
data mart
apply mathematical techniques to identify trends and patterns in the data
data modeling
what are logical data structures that detail the relationships among data elements using graphics or pictures?
data models
gather and organize the data in the correct formats and structures for analysis
data preparation
What is the process of collecting statistics and information about data in an existing source?
data profiling
what determines the accuracy and completeness of its data?
data quality audits
What is the process of sharing information to ensure consistency between multiple data sources?
data replication
select the statement that accurately describes a situation in which there is too much data to properly understand or make use of it
data rich and information poor
what is an organized collection of data?
data set
who is responsible for ensuring the policies and procedures are implemented across the organization and acts as a liaison between the MIS department and the business?
data steward
analyze the current data along with identifying any data quality issues
data understanding
What describes technologies that allow users to "see" or visualize data to transform information into a business perspective?
data visualization
what moves beyond excel graphs and charts into sophisticated analysis techniques such as pie charts, controls, instruments, maps, time-series graphs, and more?
data visualization tools
what is a logical collection of information, gathered from many different operational databases that supports business analysis activities and decision-making tasks?
data warehouse
what is an interactive website kept constantly updated and relevant to the needs of its customers using a database
data-driven website
select the four primary reasons low-quality information occurs in a system
data-entry personnel enter abbreviated information to save time, online customers intentionally enter inaccurate information to protect their privacy, third-party and external information contains inconsistencies and errors, different systems have different information standards and formats
what maintains information about various types of objects (inventory), events (transactions), people (employees), and places (warehouses)?
database
data-driven ______________ management is an approach to business governance that values decisions that can be backed up with verifiable data
decision
deploy the discoveries to the organization for work in everyday business
deployment
a data __________________ compiles all of the metadata about the data elements in the data model
dictionary
select the four primary reasons low-quality information occurs in a system
different systems have different information entry standards and formats, online customers intentionally enter inaccurate information to protect their privacy, data-entry personnel enter abbreviated information to save time, third-party and external information contains inconsistencies and errors
_________ data is erroneous or flawed data
dirty
What process and manages algorithms across many machines in a computing environment?
distributing computing
progress information through increasing levels of detail
drilling down
proceed in reverse through decreasing levels of detail
drilling up
which of the following problems associated with dirty data?
duplicate data, non-formatted data, incorrect data
what is an area of a website that stores information about products in a database?
dynamic catalog
what includes data that change based on user actions?
dynamic information
select the statement that accurately defines the relationship between entities and attributes in a relational database
each attribute of an entity occupies a separate column of a table
select the three advantages of a data-driven website
easy to store large amounts of data, easy to eliminate human errors, easy to manage content
what stores information about a person, place, thing, transaction, or event?
entity
determines values for an unknown continuous variable behavior or estimated future value
estimation
analyze the trends and patterns to assess the potential for solving the business problem
evaluation
identifies patterns in data, including outliers, uncovering the underlying structure to understand relationships between the variables
exploratory data analysis
what is a process that extracts information from internal and external databases, transforms it using a common set of enterprise definitions, and load is into a data warehouse
extraction, transformation, and loading
what is the application of big data analytics to smaller data sets in near-real or real-time in order to solve a problem or create business value?
fast data
______________ are predictions based on time-series information
forecasts
a ________________ key is a primary key of one table that appears as an attribute in another table and acts to provide a logical relationship between the two tables
foreign
users could not get the data they needed, what was collected was not always useful for intended pruposes
inadequate data usefulness
Every department had its own method for recording data so when trying to share information, data did not match and users did not get the data they really needed.
inconsistent data definitions
a data point is an ____________________ item on a graph or a chart
individual
most data stored in operational databases did not allow users direct access; users had to wait to have their similar queries or questions answered by MIS professionals who could code SQL
ineffective direct data access
______________ present the results of data analysis, displaying the patterns, relationships, and trends in a graphical format
infographics
Select two terms that describe the process for weeding out, fixing, or discarding inconsistent, incorrect, or incomplete information
information cleansing and information scrubbing
what is the common term for the representation of multidimensional information
information cube
select the four primary traits of the value of information
information governance, information timeliness, information type, information quality
what refers to the extent of detail within the information?
information granularity
what occurs when the same data element has different values?
information inconsistency
what occurs when a system produces incorrect, inconsistent, or duplicate data?
information integrity issues
what is the duplication of data, or the storage of the same data in multiple places?
information redundancy
information _______________ is a measure of the quality of information
integrity
what are rules that help ensure the quality of information?
integrity constraints
managers need to perform cross-functional analysis using data from all department, which differed in granularities, formats, and level
lack of data standards
the __________________ view of information focuses on how individual users logically access information to meet their own particular business needs
logical
What analyzes such items as websites and checkout scanner information to detect customers' busying behaviors and predict future behavior by identifying affinities among customers' choices of products and services?
market basket analysis
what is the practice of gathering data and ensuring that it is uniform, accurate, consistent, and complete, including such entities as customers, suppliers, products, sales, employees, and other critical entities that are commonly integrated across organizational systems?
master data management
what provides details about data?
metadata
data _______________ is the process of analyzing data to extract information not offered by the raw data alone
mining
one primary goal of a database is to eliminate information redundancy by recording each piece of information in __________________ place(s) in the database
only one
a data value that is numerically distant from most of the other data points in a set of data
outliers
analysis ____________________ occurs when the user goes into an emotional state of over-analysis (or-over-thinking) a situation so that a decision or action is never taken, in effect paralyzing the outcome. In the time of big data, analysis paralysis is a growing problem.
paralysis
the classification or labeling of an identified pattern in the machine learning process
pattern recognition analysis
the _____________ view of information deals with the physical storage of information on a storage device
physical
the data, if available, were often incorrect or incomplete, therefore users could not rely on the data to make decisions
poor data quality
a __________________ is a statement about what will happen or might happen in the future; for example, predicting future sales or employee turnover
prediction
what is a field that uniquely identifies a given record in a table
primary key
a _______________-by-example tool helps users graphically design the anser to a question against a database
query
___________-time information means immediate, up-to-data information
real
What is a data mining algorithm that analyzes a customer's purchases and actions on a website and then uses the data to recommend complementary products?
recommendation engine
a ____________ is a collection of related data elements.
record
select the three statements reflecting the business advantages of a relational database
reduced information redundancy, increased information security, increased information integrity
________________ integrity constraints are rules that enforce basic and fundamental information-based constraints.
relational
a _____________ database management system allows users to create, read, update, and delete data in a relational database
relational
what stores information in the form of logically related two-dimensional tables?
relationship database model
what is a central location in which data is stored and managed
repository
A data _______________ extracts knowledge from data by performing statistical analysis, data mining, and advanced analytics on big data to identify trends, market changes, and other relevant information.
scientist
analyzes text flowing across the internet, including unstructured text from blogs and messages
social media analysis
what identifies the primary location where data is collected
source data
the process of analyzing recorded calls to gather information
speech analysis
what includes fixed data incapable of change in the event of a user action?
static information
data __________________ is the management and oversight of an organization's data assets to help provide business users with high-quality data that is easily accessible in a consistent manner
stewardship
a ______________________ query language asks users to write lines of code to anser questions against a database
sturctured
real-time _____________ provide real-time information in response to requests.
systems
a data map is a technique for establishing a match, or balance, between the source data and the ______________ data warehouse.
target
analyzes unstructured data to find trends and patterns in words and sentences
text analysis
_____________-series information is timestamped information collected at a particular frequency
time
is the information current with respect to business needs?
timely
airline ticket, packing slip, sales receipt
transactional information
encompasses all of information contained within a single business process or unit of work, and its primary purpose is to support daily operational tasks
transactional information
true or false: business-critical integrity constraints tend to mirror the very rules by which an organization achieves success.
true
true or false: databases today scale to exceptional levels, allowing all types of users and programs to perform information processing and information-searching tasks
true
true or false: gender, for instance can be referred to in many times (male, female, M/F, 1/0), but it should be standardized on a data warehouse with one common way of referring to each data element that stores gender (M/F)
true
true or false: information is everywhere in an organization. Managers in sales, marketing, human resources, and management need information to run their departments and make daily decisions. When addressing a significant business issue, employees must be able to obtain and analyze all the relevant information so they can make the best decision possible.
true
is each transaction and event represented only once in the information?
unique
data __________________ includes the tests and evaluations used to determine compliance with data governance policies to ensure correctness of data
validation
different forms of structured and unstructured data; data from spreadsheets and databases as well as from email, videos, photos, and PDFs, all of which must be analyzed
variety
why does a database offer increased information security?
various security features of databases ensure that individuals have only certain types of access to certain types of information
the analysis of streaming data as it travels around the internet; analysis necessary of social media messages spreading globally
velocity
The uncertainty of data, including biases, noise, and abnormalities; uncertainty of untrustworthiness of data; data must be meaningful to the problem being analyze
veracity
virtualization is the creation of a virtual (rather than actual ) version of computing resources, such as an operating system, a server, a storage device, or network resources
virtual
the scale of data; includes enormous volumes of data generated daily; massive volume created by machines and networks
volume
analyzes unstructured data associated with websites to identify consumer behavior and website navigation
web analysis
which of the following are answers to tough business questions BI can answer
where is the business now? where is the business going? where has the business been?
select the statement below that accurately reflects a database
while a database has only one physical view, it can easily support multiple logical views that provides for flexibility