Big data homework
Which characteristic of data requires that the variables and data values be defined at the lowest (or as low as required) level of detail for the intended use of the data?
Data granularity
Which process is NOT part of ETL? Extraction Deletion Transformation Load
Deletion
An operational data store provides a long-term form oof customer information file.
False
Analytics represents a single discipline to solve real problems.
False
Correlation describes the dependence of a response variable on one (or more) explanatory variables.
False
Data Warehouse is the same as Data Warehousing.
False
Data warehouses are subsets of data marts.
False
Multiple Linear Regression has more than one response variable.
False
The amount of structured data is much larger than that of unstructured data.
False
The growth in hardware, software, and network capacities has had little impact on modern BI innovations.
False
The target variable of logistic regression is a numerical variable.
False
What kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates?
Independent data mart
Which of the following is NOT part of descriptive analytics? OLAP OTP Inferential statistics Descriptive statistics
Online Transaction Processing OTP
Which type of visualization tool can be very helpful when the intention is to show relative proportions of dollars per department allocated by a university administration?
Pie chart
What type of analytics seeks to determine what is likely to happen in the future?
Predictive
What type of analytics seeks to recognize what is going on as well as the likely forecast and make decisions to achieve the best performance possible?
Prescriptive
BI is the descriptive analytics portion of business analytics continuum.
True
Data is worthless if it does not provide business value.
True
Executive Information Systems (EIS) were designed as graphical dashboards and scorecards so that they could serve as visually appealing displays while focusing on the most important factors for decision-makers to keep track of the key performance indicators. T/F
True
The Inmon Model, is known as the EDW approach, emphasizes top-down development, employing established database development methodologies and tools.
True
The idea behind Operations Research (OR) is to do the best with limited resources. T/F
True
The validity of the linear model built depends on its ability to comply with these assumptions.
True
Which of the following can NOT be used to measure central tendency?
Variance
Which of of the following characteristics is NOT related to data warehousing? Subject-Oriented Time Variant Integrated Volatile
Volatile
When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is
drill down
Choose the CORRECT statement from the following: Big data is the data that can be stored in a single-storage unit. Big data has one unified form Big data comes from different sources. It is OK to pass all computation to one powerful computer.
Big data comes from different sources.
In order to show progress toward a goal, which one of the following charts should be used?
Bullet
Which of the following is an umbrella term that combines architectures, tools, databases, analytical tools, applications, and methodologies?
Business Intelligence (BI)
When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure?
Star schema