Bus211
Mary has been tasked with reviewing a large data file. She wants to begin by first inspecting the number of values in each cell, both numeric and non-numeric, for any blank entries. The plan is to first find the blank or missing values for first review. Using Excel, what function(s) should she use to complete this task?
Both COUNT and COUNTA
In Analytic Solver, Aimee is trying to create a new column called RFM. This column is merging multiple values into one cell. The function to accomplish this is called?
CONCATENATE
Kara is reviewing categories where a series of numbers represent the type of loan. She would prefer the actual name of the loan be retained when running her analysis. Using Analytic Solver, what function will allow Kara to retain the category name instead of recording them in numbers?
If function
Henry wants to analyze income, but the sheer number of categories in the data's current form will make a clear analysis less meaningful. In Excel with Analytic Solver, how will Henry determine the frequency of each category to transform his data?
Income variable is selected and Analytic solver produces frequency levels for each income category from most to least frequent.
Using the omission strategy, what value would be placed in the missing observation in x1?
No value because excluded
When performing an analysis, one technique is called RFM. Which of the following is not reflective of RFM?
Relevency
Select, From, and Where keywords are statements used in __________.
SQL
concatenate
allows for multiple cells to be merged into one cell.
IF function
allows for statements to be crafted to transform numbers into category names.
After creating bins using quntile(), use the __________ function to divide data into each bin and label it.
cut()
The primary purpose of a(n) _____________ is to support decision-making and provide a composite view of the organization.
data warehouse
In R, Mary wants to understand the number of days between rain events in Chicago, IL. What function is used to find the number of rain events between today and January 1, 2019
difftime
Which term represents data items, events, or things stored in a database file?
entity
In R, the following represents how to receive results from column 3, row 2 > myData[3,2].
false
Using R, what function is used to evaluate the categories in the variable to identify the dummy variables
ifelse
difftime
is used to determine the number of days between dates.
In R programming, the _________ fuction removes missing values from the dataset in the parentheses.
na.omit()
Mark wants to have a better understanding of his client base at the credit union. To do so, he is running a report to show loan amount approval with corresponding credit scores. He realized the data set is quite large and wants to create categories by grouping. To do this, he needs to do all the following except
remove 20% of the data to create a training set
Which of the following is NOT a process of the data management system?
summarize
The function that provides a natural logarithm in Excel is
the LN function
Megan took a phone survey where each question posed had an answer range of unsatisfied to completely satisfied describing her purchase experience. Because the categories are in equal increments, the category can be recoded into a number transforming the category into what is called a category score.
true
In R programming,, to see the entire data in a spreadsheet format, you use the _________ function.
view()