IDC3931 Exam 1
Given the following range of numbers, what is the value of the 2nd Quartile? 10 11 11 13 14 15 15 16 17 19 20 22
15
DW, busienss analytics, business performance management, user interface like a dashboard.
4 main components of BI
You have a survey question that asks: "What do you think the likelihood is that the FSU football team will win the ACC championship?" If you have survey results from 100 people and the average response is 40% with a standard deviation of 5. Which of the following can you approximate from the results?
95% of the respondents think that there is a 30% - 50% chance that the FSU football team will win the ACC championship.
Today, many vendors offer diversified tools, some of which are completely preprogrammed (called shells). How are these shells utilized?
All a user needs to do is insert the numbers
A variable that contains the values of either Yes or No would best be categorized as which of the following variable types?
Binary
Species Cat Human Can't be a Batman or Catwoman
Categorical
Statistical Data Variable Type: A variable that contains a countable number of distinct values would best be categorized as which of the following variable types?
Discrete
countable number of distinct values (usually whole numbers) # of children Points scored in a football game`
Discrete
For the following Frequency Distribution graph, what can be surmised about the data from looking at the Skewness represented by the graph?
Extreme values exist on the right tail
Almost all BI applications are constructed with shells provided by an outsourcing provider who may themselves create a custom solution for a vendor or work with another client.
False
Computerized support is only used for organizational decisions that are responses to external pressures, not for taking advantage of opportunities.
False
Data warehouse administrators (DWAs) do not need strong business insight since they only handle the technical aspect of the infrastructure.
False
Data warehouses are subsets of data marts.
False
Information systems that support such transactions as ATM withdrawals, bank deposits, and cash register scans at the grocery store represent transaction processing, one of the critical components of BI.
False
OLTP databases are optimized for output (querying/asking questions of the data) and data warehouses are optimized for input (getting new or updated data into the database).
False
OLTP systems are designed to handle ad hoc analysis and complex queries that deal with many data items.
False
One of the four components of BI systems, business performance management, is a collection of source data in the data warehouse.
False
Organizations seldom devote a lot of effort to creating metadata because it is not important for the effective use of data warehouses.
False
Subject oriented databases for data warehousing are organized by detailed subjects such as disk drives, computers, and networks
False
The term intelligence in a BI context is used to describe clandestine operations dedicated to stealing corporate secrets, in the manner of the government's CIA and other covert agencies.
False
The two critical partnerships required for BI governance are (a) a partnership between functional area users and/or product/service area employees, and (b) a partnership between representatives of the marketing and vendor sides.
False
There is a web-based survey that asks you, "On a rating of 1(hated it) to 5(loved it), how much did you like the movie." This value is stored in your database and you need to categorize the statistical variable type. Which of the following variable types would be best?
Interval
For the following Frequency Distribution graph, what can be said about the Skewness?
It is <0
Which of the following measures of central location would be best to use when the Skewness is approximately zero?
Mean
Which of the following measures of central location would be best to use when the Skewness is highly positive or negative?
Median
A variable that contains the values of United States Zip Codes (aka Postal Codes) would best be categorized as which of the following variable types?
Nominal
two things that are equivalent in some sense and are given the same name or number The name or number is nothing more than a way of grouping similar things together (can't do math on them) NFL Player numbers mean things like QB numbers are between 1-19, OL are in 60-79, etc. but adding 12 to 65 does not get you an offensive QB Zip Codes are a way of grouping buildings together but adding zip codes means nothing
Nominal
A variable that contains the one of the three values: "1st Place""2nd Place" or "3rd Place" would best be categorized as which of the following variable types?
Ordinal
when categories are ordered and the order means something Olympic Finishes: 1st Place/2nd Place/3rd Place Shirt Sizes: S, M, L, XL
Ordinal
Active data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is
Speed of Data Transfer
Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are
Subject-Oriented and Nonvolatile.
Which of the following is NOT a qualitative data type?
Surveys with numerical answers
You have a survey question that asks: "What is your likelihood that you will watch the next asteroid shower?" If you have survey results from 100 people (sample A) and the average response is 20% with a standard deviation of 5. You ask another 100 people (sample B) the same question and they have the same average of 20% but their standard deviation is 10. What can you say about the two different survey results?
There was a wider range of responses in Sample B than in Sample A but the overall likelihood was the same.
How are descriptive analytics methods different from the other two types?
They answer "what-is?" queries, not "what will be?" queries
Actionable intelligence is the primary goal of modern-day Business Intelligence (BI) systems vs. historical reporting that characterized Management Information Systems (MIS).
True
Because of performance and data quality issues, most experts agree that the federated architecture should supplement data warehouses, not replace them.
True
Data warehouse and BI initiatives typically follow a process similar to that used in military intelligence initiatives.
True
Many business users in the 1980s referred to their mainframes as "the black hole," because all the information went into it, but little ever came back and ad hoc real-time querying was virtually impossible.
True
One way an operational data store differs from a data warehouse is the recency of their data.
True
The overwhelming majority of competitive actions taken by businesses today feature computerized information system support.
True
Traditional BI systems use a large volume of static data that has been extracted, cleansed, and loaded into a data warehouse to produce reports and analyses.
True
Volume, velocity, and variety of data characterize the Big Data paradigm.
True
Online transaction processing (OLTP) systems handle a company's routine ongoing business. In contrast, a data warehouse is typically
a distinct system that provides storage for data that will be made use of in analysis.
For a column in your dataset, your data analysis tool is telling you that the standard deviation is zero. What does this say about the data in that column?
all values are the same
Yes or No
binary
what is the best channel to reach my customer in each segment
channel optimization
infinitely uncountable values on whatever scale we are using Examples: Time Number Line Return on investment
continous
Customer risk of leaving?
customer attrition
what is the lifetime profitability of my customer
customer profability
what market segments do my customers fall into and their characteristics?
customer segmentation
Business intelligence (BI) can be characterized as a transformation of
data to info to decision to action
The very design that makes an OLTP system efficient for transaction processing makes it inefficient for what?
end-user ad hoc reports, queries, and analysis
Which of the following is NOT an example that falls within the four major categories of business environment factors for today's organizations? a.) globalization b.) increased pool of customers c.)fewer government regulations d.) increased competition
fewer government regulations
In answering the question "Which customers are likely to be using fake credit cards?" you are most likely to use which of the following analytic applications?
fraud detection
Which of the following is a common way of visualizing the frequency distribution of data points over a range of possible values?
histogram
What can the BI users in an organization help guide and direct?
how the DW is structured and the types of BI tools and other supporting software that are needed
Once a data warehouse is in place, the general process of intelligence creation begins with
identifying and prioritizing specific BI projects
RateMyProfessor on a continuous scale between 1 & 5 Grading scales: 90-92 = A- 92-100 = A Salaries Temperatures - the diff between 100 & 90 is the same as 90 & 80
interval
If a company's strategy is properly aligned with DW and BI initiatives, and if the company's IS organization can be made capable of playing its role in such a project, and if the requisite user community is in place and has the proper motivation, then
it has the basics down and you can go to the next steps of starting BI and establishing a BI Competency Center (BICC) within the company.
a graph that plots the cumulative frequency or the cumulative relative frequency of each class against the upper limit of the corresponding class
ogive
Prescriptive BI capabilities are viewed as more powerful than predictive ones for all the following reasons EXCEPT
only prescriptive BI capabilities have monetary value to top-level managers.
In the Magpie Sensing case study, the automated collection of temperature and humidity data on shipped goods helped with various types of analytics. Which of the following is an example of prescriptive analytics?
optimal temperature setting
In answering the question "Which customers are most likely to click on my online ads and purchase my goods?" you are most likely to use which of the following analytic applications?
propensity to buy
expressed verbally rather than numerically Conversations Magazine articles Media broadcasts
qualitative
numerical values Surveys Experiments
quantitative
goes a step further than interval by requiring that the ratio of values along the scale are meaningful. A rating of 4 is twice as good as a rating of 2. Weight When a variable = 0.0, there is NONE of that variable Degf vs. DegC vs DegK = no/no/yes
ratio
Organizations counter the pressures they experience in their business environments in multiple ways. Which of the following is NOT an effective way to counter these pressures? a.) retroactive actions b.) reactive actions c.) anticipative actions d.) adaptive actions
retroactive actions
Which of the following Visualization charts would you use to plot the relationship between TWO variables?
scatterplot
Extreme values on the left tail These values pull the mean DOWN
skewness<0
means data evenly distributed on both sides of the mean
skewness=0
means data positively skewed Extreme values on the right tail These values pull the mean UP The handle of the "Bell" determines the skewness
skewness>0
Big Data often involves a form of distributed storage and processing using Hadoop and MapReduce. One reason for this is
the processing power needed for the centralized model would overload a single computer.
In addition to deploying business intelligence (BI) systems, companies may also perform other actions to counter business pressures, such as improving customer service and entering business alliances.
true
The use of statistics in baseball by the Oakland Athletics, as described in the Moneyball case study, is an example of the effectiveness of predictive analytics.
true