Big Data Exam 1
Bill Inmon advocates the data mart bus architecture whereas Ralph Kimball promotes the hub-and-spoke architecture, a data mart bus architecture with conformed dimensions.
False
Data is the contextualization of information, that is, information set in context.
False
Data is the main ingredient for any BI, data science, and business analytics initiative.
False
Data warehouse administrators (DWAs) do not need strong business insight since they only handle the technical aspect of the infrastructure.
False
Data warehouses are subsets of data marts.
False
Due to industry consolidation, the analytics ecosystem consists of only a handful of players across several functional areas.
False
Managing information on operations, customers, internal procedures and employee interactions is the domain of cognitive science.
False
Moving the data into a data warehouse is usually the easiest part of its creation.
False
OLTP systems are designed to handle ad hoc analysis and complex queries that deal with many data items.
False
Organizations seldom devote a lot of effort to creating metadata because it is not important for the effective use of data warehouses.
False
Properly integrating data from various databases and other disparate sources is a trivial process.
False
Successful BI is a tool for the information systems department, but is not exposed to the larger organization.
False
The BPM development cycle is essentially a one-shot process where the requirement is to get it right the first time.
False
The data storage component of a business reporting system builds the various reports and hosts them for, or disseminates them to users. It also provides notification, annotation, collaboration, and other services.
False
The growth in hardware, software, and network capacities has had little impact on modern BI innovations.
False
User-initiated navigation of data through disaggregation is referred to as "drill up."
False
Visual analytics is aimed at answering, "What is it happening?" and is usually associated with business analytics.
False
With the balanced scorecard approach, the entire focus is on measuring and managing specific financial goals based on the organization's strategy.
False
How does the use of cloud computing affect the scalability of a data warehouse?
Hardware resources are dynamically allocated as use increases.
OLTP systems are designed/optimized for
Inputs like inserts, updates, deletes in the database
Computer applications have moved from transaction processing and monitoring activities to problem analysis and solution applications.
True
Data is the main ingredient for any BI, data science, and business analytics initiative.
True
In the 2000s, the DW-driven DSSs began to be called BI systems.
True
Many business users in the 1980s referred to their mainframes as "the black hole," because all the information went into it, but little ever came back and ad hoc real-time querying was virtually impossible.
True
Predictive algorithms generally require a flat file with a target variable, so making data analytics ready for prediction means that data sets must be transformed into a flat-file format and made ready for ingestion into those predictive algorithms.
True
Structured data is what data mining algorithms use and can be classified as categorical or numeric.
True
The data warehousing maturity model consists of six stages: prenatal, infant, child, teenager, adult, and sage.
True
There are basic chart types and specialized chart types. A Gantt chart is a specialized chart type.
True
Visualization differs from traditional charts and graphs in complexity of data sets and use of multiple dimensions and measures.
True
With key performance indicators, driver KPIs have a significant effect on outcome KPIs, but the reverse is not necessarily true.
True
Which type of question does visual analytics seeks to answer?
Why is it happening?
What is Six Sigma?
a methodology aimed at reducing the number of defects in a business process
When you tell a story in a presentation, all of the following are true EXCEPT
a well-told story should have no need for subsequent discussion.
Which characteristic of data requires that the variables and data values be defined at the lowest (or as low as required) level of detail for the intended use of the data?
data granularity
Which characteristic of data means that all the required data elements are included in the data set?
data richness
When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is
drill down
What is the fundamental challenge of dashboard design?
ensuring that the required information is shown clearly on a single screen
Which of the following is LEAST related to data/information visualization?
graphic artwork
Data warehouses provide direct and indirect benefits to organizations. Which of the following is an indirect benefit of data warehouses?
improved customer service
Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates?
independent data mart
All of the following are true about in-database processing technology EXCEPT
it is the same as in-memory storage technology.
What is the management feature of a dashboard?
operational data that identify what actions to take to resolve a problem
The very design that makes an OLTP system efficient for transaction processing makes it inefficient for
output like end-user ad hoc reports, queries, and analysis.
Which of the following is an umbrella term that combines architectures, tools, databases, analytical tools, applications, and methodologies?
BI
Kaplan and Norton developed a report that presents an integrated view of success in the organization called
balanced scorecard-type reports.
Why is a performance management system superior to a performance measurement system?
because measurement alone has little use without action
This plot is a graphical illustration of several descriptive statistics about a given data set.
box-and-whiskers plot
Which kind of chart is described as an enhanced version of a scatter plot?
bubble chart
This measure of dispersion is calculated by simply taking the square root of the variations.
standard deviation
When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure?
star schema