Econ ch. 1 prep test
The entities on which data are collected are:
elements.
The sample size:
is always smaller than the population size.
Examples of quantitative data include all of the following except:
your zip code.
In a random sample of 200 items, 5 items were defective. An estimate of the percentage of defective items in the population is:
2.5% 5/200 = .025 = 2.5%
The number of observations in a complete data set having 20 elements and 3 variables is:
20
What percentage of countries have tax rates above the mean tax rate for the dataset? GDP - per capita compares GDP on a purchasing power parity (PPP) basis divided by population as of 1 July for the year 2014. HISTOGRAM SHOWING THE FREQUENCY OF TAX RATES
50%
Which of the following is not a type of data acquisition errors?
A particularly extreme value is verified, and is still included in the data set.
What is the principal connection between a sample and a population?
A random sample drawn from a population seeks to describe the characteristics of that population.
An experiment is conducted to study the effects of a new blood pressure medicine on subjects. A random sample of 60 people are placed into three groups of 20 according to their age (young, middle-aged, and senior). Each group of 20 people are randomly assigned to two treatment groups (new medicine and placebo). How many total treatment groups are there?
6 (Three age groups) x (two treatments) = 6 treatment groups.
Data obtained from a nominal scale:
can be either numeric or nonnumeric.
The major applications of data mining have been made by companies with a strong _____ focus.
consumer
What is the principal difference between time series data and cross-sectional data?
Cross-sectional data are limited to an approximate window of time, while time series data are collected over several time periods.
In a sample of 1600 registered voters, 912, or 57%, approve of the way the President is doing his job. The 57% approval rating is an example of:
descriptive statistics.
The Department of Homeland Security has noted that on average 1120 suspicious vehicles are stopped and searched each day in the United States. This number is used to estimate the number of cars stopped in an average yearly period. The average number of cars stopped is not an example of:
descriptive statistics.
The Department of Transportation of a city has noted that on average there are 17 accidents per day. The average number of accidents is an example of:
descriptive statistics.
The owner of a factory regularly requests a graphical summary of all employees' salaries. The graphical summary of salaries is an example of:
descriptive statistics.
The summaries of data, which may be tabular, graphical, or numerical, are referred to as:
descriptive statistics.
Statistical Inference:
is the process of drawing conclusions about the population based on the evidence taken from the sample.
Data mining deals with methods for developing useful decision-making information from large databases. It performs all actions except the:
reselling of the packaged data.
A company wants to enhance the benefits for its yearly healthcare package offering. It observes the set of employees who smoke on their break times at 10 a.m., 12 p.m., and 2 p.m. It records the number of cigarettes smoked by each individual. This is an example of:
an observational study.
Data collected through a survey attached to this month's pay stub:
is considered an observational study because no control is imposed.
In a sample of 200 students in a university, 40, or 20%, are communications majors. Based on the above information, the school's paper reported that "an estimated 20% of all the students at the university are communications majors." This report is an example of:
statistical inference.
All of the following are examples of observational studies except:
the behavior of Walmart shoppers after they are given a $20 gift card from the store.
Examples of cross-sectional data include all of the following except:
the comparison of sales output of all 10 salespeople in the Western Sales Region for the 3rd quarter.
The collection of all elements of interest in a particular study is:
the population of interest.
A time series is a sequence of data points, typically consisting of successive measurements made over a time interval. Examples of the time series include all of the following except:
the volume of shares traded today in the stock market.
The population of countries (n = 60) with the top GDPs in 2014 is shown above. Assume for the purposes of this problem that the mean and median GDP of this population are unknown. In reality, they are ($51,195, $44,600). A random sample of 10 countries (GDPs) was selected from this unknown population. What range would the sample mean most likely fall into?
$50K — $59.9K -The average is sensitive to outliers. The outlier to the far right pulls the mean toward it. Income distributions are commonly skewed right because there are far fewer super-rich people than impoverished in the world's population.
Examples of categorical data include all of the following except:
.228 lbs.
The American Statistical Association describes eight general topic areas and specifies important ethical considerations under each topic. One area is "Responsibilities to Research Subjects." It describes the requirements for protecting the interests of human and animal subjects of research not only during data collection but also in the analysis, interpretation, and publication of results of the findings. Which of the following is not an example of this?
Assuming legal privacy and confidentiality protections where they may not apply
The American Statistical Association describes eight general topic areas and specifies important ethical considerations under each topic. One area is "Professionalism." Professionalism points out the need for competence, judgment, diligence, self-respect, and worthiness of the respect of other people. Which of the following does not adhere to upholding the ethical guidelines for statistical practice?
Engage in discrimination based on personal characteristics.
Which of the following is used for data-driven decision making?
analytics
Statistical studies in which researchers do not control variables of interest are:
observational studies.
Which of these is a continuous and quantitative variable?
The time it takes to grill a steak
Some hotels ask their guests to rate the hotel's services as excellent, very good, good, and poor. This is an example of the:
ordinal scale.
The largest experimental statistical study ever conducted is believed to be for:
polio
A group of employees at a major grocery chain is tasked with analyzing point of sale data to predict the success of a marketing campaign they are considering launching. This is an example of:
predictive analysis.
In data mining, statistical models play an important role in developing _____.
predictive models
Anyone who wants to use the data and statistical analysis as aids to decision making must be aware of the time and cost issues. If important data are not readily available, it would be best to:
use a cross-sectional data set.
Big data is often defined according to the four v's of data: volume, variety, veracity, and ___________.
velocity
The main difference between prescriptive analysis and descriptive/predictive analysis is that prescriptive analysis:
yields a course of action.