Ch 4, Descriptive Statistics

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

In a neighborhood there are five houses listed for sale for the following amoints: $250,000; $275,000; $280,000, $295,000, and $515,000. What is the best measure of center for the price of a house on the neighborhood?

Median

The second quartile is also the

Median 50th percentile

Which of the following are measures of center of a data set?

Median Mode Mean

Match the following terms with their meaning Mesokurtic= Platykuric= Leptokurtic=

Mesokurtic= Normal bell-shaped distribution Platykuric= A flatter distribution than normal with heavier trails Leptokurtic= A sharply peaked distribution with thinner trails

Which of the following correlation coefficients indicate the strongest inverse relationship between two variables?

-0.87

When the correlation coefficient approaches the value ____, it indicates that there is a weak relationship between the two variables.

0

In which of the following data sets would the arithmetic mean not be a good measure of central locations

0, 8, 8, 9, 10

If a data set has a standard deviation of 4 units and a mean of 10 units, the coefficient of variation is

0.4 (4/10)

Place in order, from beginning to end, the steps to calculate the mean absolute deviation

1. Calculate the arithmetic mean for the data set 2. Find the absolute difference between each data set value and the mean 3. Sum the absolute difference 4. Divide the sample (or the population) size

Place the following steps in order, from beginning to end, to create a box plot

1. Calculate the five-number summary 2. Plot the five-number summary values in numerical order on a horizontal or vertical axis 3. Draw a box from Q1 to Q3 then add lines from Q1 to the minimum value and Q3 to the maximum value

Place the for the method of medians in finding quartiles in the proper order

1. Sort the observations 2. Find the median for the entire data set, Q2 3. Find the median for the value above the below Q2

Place the step for using the method of medians in finding quartiles in the proper order

1. Sort the observations. 2. Find the median for the entire data set, Q2. 3. Find the median of the data values above and below Q2.

The population standard deviation of the data set 3, 4, 5, 6, and 7 is ______ (Round your final answer to 1 decimal place)

1.4

Inner fences on a boxplot are ____ x IQR above Q3 and below Q1. Outer fences are ___ x IQR above Q3 and below Q1.

1.5 3

The sample standard deviation of the data set 3, 4, 5, 6, and 7 is _____ (Round your final answer to 1 decimal place.)

1.6

Calculate the standardized score of the following data value. Assume the mean + 100 and the standard deviation = 25: x= 60, z =

1.6.

A certain value has a standard score = 1.75. How man y standard deviation from the mean does the value fall? Is the value greater than or less than the mean

1.75 greater than the mean

A certain value has a standardized sore = 1.75. how many standard deviations from the mean does this value fall? Is the value greater than or less than the mean?

1.75. Greater than the mean

Randall Racer runs the 100 meter dash in an average time of 10.4 seconds with a standard deviation of 0.1 seconds. If Randall's time are normally distributed, we have a 99.7% expectation that his finishing time for his race will be grayer than ____ seconds.

10.4

The maximum value of a data set is 200 and the minimum value is 80. The midrange is equal to ____

140

Nadia purchased 400 shares of XYZ stocks at $20 per share. When the stock decreased in value to $16 a share, Nadia purchased 600 more shares of XYZ stock. The weighted average price per share that Nadia paid for XYZ stock is $ ____ (use 2 decimal places).

17.60

The midhinge for data with Q1 = 10 and Q3 = 45 is _____.

27.5

If the revenue over a four-year period was $2000, $2000, $3000, and $5000, what is the geometric mean revenue? Round to a whole number

2783

If a company sold 1000 units in its first year of operation, and 1400 units in its second year of operation, and 1680 units in the third year of operation. The average growth rate of the company's sales for years one to three is ____ (Round the final answer to a decimal answer with four places and then convert to % with 2 decimals).

29.61%

If a company sold 1000 units in its first year of operation, and 1400 units in its second year of operation, then the growth rate of the company sales is

40%

The mean for the data sets 6, 4, 9, 5 is

5.5

If the median price for a home is $200,000, then ____ of homes cost less than $200,000.

50%

For the data set 4, 5, 6, and 9 the arithmetic mean is

6

The range of the data set: 2, 5, 5, 7, and 10 is

8

Using Chebyshev's Theorem at least ____ % of observations should fall within 2.5 standard deviation of the mean

84

The empirical rule states that approximately_____ of observations will fall within two standard deviation of the mean.

95%

The owner of BevaMart wants to study the relationship between the temperature and hot chocolate sales. The owner compared the covariance between temperature and hot chocolate sales to be -81.46. Based on the covariance, which option best describes the liner relationship between temperature and hoy chocolate?

As the temperature increases, hot chocolate sales decrease.

When calculating a percentile, the first step is to arrange the data set in

Ascending order. (from least to greatest)

Which of the following can be used to determine the proportion of data point that fall within a specific number of standard deviations from the mean?

Chebyshev's Theorem The empirical rule-assuming a normal distribution

The skewedness coefficient can be used to

Compare two samples with different measurement units Compare one sample to a known reference distribution.

The skewness coefficient can be sued to

Compare two samples with different measurements units Compare one sample to a known reference distribution.

The correlation coefficient values

Fall between -1 and +1, inclusive.

True or False: Summaries of grouped observations are just as accurate as summaries of a data set of individual observations.

False

True or false Chebyshev's Theorem should only be applied to data sets that are normally distributed

False

Place the steps in order, from beginning to end to calculate a mean for grouped data

Find the midpoint for each class of grouped data Multiply the midpoint of each class by the number of observations in its class Sum the products of the midpoint and observations Divide by the total number of observations

Standard deviation can be compared

For data sets with the same measurement units and similar magnitude. For data sets with the same measurement units.

Which of the following situations are valid reasons for removing an outlier from a data set

If the observed value was taken from a population different from the one under study If the data point was typed incorrectly into the spreadsheet.

The interquartile range of a data set

Is calculated by subtracting the first quartile from the third quartile Represents the middle 50% of the data.

Which of the following is not a characteristic of the midrange?

It is robust to outliers

Which shape matches the mean and median relationship Lest skewed= Right skewed= Symmetrical=

Lest skewed= Mean < Median Right skewed= Mean > Median Symmetrical= Mean = Median

When estimating sigma using the following formula: (Xmax - Xmin) / 6, one is assuming the distribution is

Normal

The summary measures for grouped data are

Only approximate values

Find the first and third quartiles from yhe following data set using the method of medians: 2, 3, 3, 5, 6, 8, 12.

Q1= 1 Q3= 8

Which of the following characteristics can be seen on a boxplot?

Shape Variability Center

When comparing two data sets with different units of measurements, what is the relative measure of dispersion

The coefficient of variation

If the midhinge > median

The data are skewed right

Accuracy of grouped estimates depend on

The distribution of data within the bins. The bin frequencies. The number of bins.

A box plot is contracted using several different values. Which of the following values from a data set are included in a box plot?

The first quartile The largest value

Generally, skewness can be assessed by comparing

The mean and median

A box plot contracted using several different values. Which of the following values are included in a box plot?

The smallest value The third quartile The second quartile

Two widely used measure of dispersion are

The variance and the standard deviation

The mode for the data set: 4, 5, 6, 9 is

There is no mode

Which of the items below describes the usefulness of a standard deviations?

To gauge the relative position of data values within the data set.

True or false: The arithmetic mean in the average of a data set.

True

When monitoring a process distribution, both the _____ and the _____ must be tracked

center variability

The standard value of the covariance is called the ___ (one word) coefficient.

correlation

The variance measures the average squared _________ from the ______

deviation mean

When a data set is symmetrical the mean and median are approximately ____

equal

Suppose a data set has 80 data points. A 5% trimmed mean would be calculated by removing the _____ highest values and the ______ lowest values.

four four

If the covariance is positive, then as one variable increases the other variable will generally

increase

The average of the absolute difference between the values of the data set and the mean is the

mean absolute deviation

Generally, the ______ is the best measure of center when outliers are present.

median

Generally, the _______ is the best measure of center when outliers are present.

median

The measure of center where half the value of data set lies above this measure and half the values of the data set lie below this measure is known as the ______

median

An owner of a grocery store wanted to determine the brands of soda that consumers purchase at the store. When summarizing the data about soda brand purchase the meaningful measure of center is the

mode

The _____ is the measure of center that identifies the most frequency occurring value in the data set.

mode

When summarizing a qualitative data set the ______ is the best measure of central location.

mode

When a quantitative data set is significantly affected by ______, then the arithmetic mean is usually not a good measure of central location.

outliers

The _____ measures the difference between the largest and smallest values in a data set.

range

The square root of the average squared deviation of data values from their mean is known as the_______ _______.

standard deviation

In general, a data point is considered an outlier if it falls more than _____ standard deviation away from the average.

three

The average squared difference of data values from their mean is the _______

variance

Multiply data values by a fraction (where the fraction add to 1) and summing results in a _____ mean.

weighted


Ensembles d'études connexes

RE Exam Prep: Contract law Subtopic

View Set

Construction Productivity Midterm

View Set

A&P Chapter 5.1 Functional Anatomy of the Skin

View Set