Business Analytics Final Review
Much like the IMPACT cycle, requesting data is often an iterative process a. true b. false
a. true
Comparing the number of records within the data is an example of which of the following? a. validating the data for completeness b. cleaning the data c. obtaining the data d. validating the data for integrity
a. validating the data for completeness
Removing leading zeroes and non-printable characters from the data is an example of which of the following? a. validating the data for completeness b. cleaning the data c. obtaining the data d. validating the data for integrity
b. cleaning the data
A composite primary key is made up of the three or more primary keys in the tables that it is linking a. true b. false
b. false
A foreign key is an attribute that is required to exist in each table of a relational database and serves as the unique identifier for each record in a table a. true b. false
b. false
Comparing the number of records that were extracted to the number of records in the source database is an example of validating the data for integrity a. true b. false
b. false
Once you have extracted the data of interest, it will need to be validated for completeness and existence a. true b. false
b. false
The M in IMPACT Cycle represents Manipulating the Data a. true b. false
b. false
Unlike the IMPACT cycle, requesting data is not an iterative process a. true b. false
b. false
Relational databases help to reduce redundant data. Which of the following is NOT a reason to reduce redundant data? a. it increases the risk of data-entry errors b. it is easier to perform analysis in spreadsheets c. it takes up unnecessary space d. it is expensive
b. it is easier to perform analysis in spreadsheets
At which step of the ETL process should you try to answer the question "What tools will be used to perform data analytics tests or procedures and why?" a. step 5: loading the data for data analytics b. step 2: obtain the data c. Step 1: determine the purpose and scope of the data request d. Step 3 or 4: transformation
b. step 2: obtain the data
There are a variety of methods that you could take to retrieve the data, including SQL. What does SQL stand for? a. structured question language b. structured query language c. systems question language d. systems query language
b. structured query language
Which of the following best describes the purpose of a primary key? a. to provide business information, but are not required to build a database b. to ensure that each row in the table is unique c. to support business processes across the organization d. to create the relationship between two tables
b. to ensure that each row in the table is unique
The objective of loading data is a. to identify which approach to data analytics should be used b. to load the data into the appropriate tool for analysis c. to identify and obtain the data from the appropriate source d. to validate the data for completeness and integrity
b. to load the data into the appropriate tool for analysis
What is the purpose of the Audit Data Standards? a. to increase the cost of audits b. to provide a guide to standardize audit data requests c. to create standardized data storage systems d. to help auditors learn SQL code
b. to provide a guide to standardize audit data requests
Comparing descriptive statistics for numeric fields within the data is an example of which of the following? a. obtaining the data b. validating the data for completeness c. validating the data for integrity d. cleaning the data
b. validating the data for completeness
There are many times when using SQL is the best option for extracting data, but sometimes it is not preferred. Which of the following is an example of when SQL would NOT be a preferred method of data extraction? a. when the data is stored across different tables in a relational database b. when the data is already stored in Excel c. when you wish to extract precise attributes and records that fit your criteria d. when the data in the table you wish to analyze is too large for Excel's resources
b. when the data is already stored in Excel
Formatting negative numbers in the data is an example of which of the following? a. obtaining the data b. validating the data for integrity c. cleaning the data d. validating the data for completeness
c. cleaning the data
Mastering the data can also be described via the ETL process. The ETL process stands for: a. enter, transform, and load data b. extract, total and load data c. extract, transform, and load data d. enter, total, and load data
c. extract, transform, and load data
Which of the following describes a means of maintaining all of your data in one place, instead of across different related tables? a. microsoft access b. microsoft SQL server c. flat file d. SQL lite
c. flat file
The data dictionary is paramount in helping data analysts do which of the following? a. track outcomes b. communicating insights c. identify the data they need to use d. maintain databases
c. identify the data they need to use
All of the following are Audit Data Standards (ADS) developed by the American Institute of Certified Public Accountants except a. general ledger standards b. order-to-cash subledger c. investments subledger standards d. procure-to-pay subledger standards
c. investments subledger standards
At which step of the ETL process should you try to answer the question "What business problem will the data address?" a. Step 2: obtain the data b. Step 3 or 4: transformation c. Step 1: determine the purpose and scope of the data request d. step 5: loading the data for data analytics
c. step 1: determine the purpose and scope of the data request
Which of the following is most likely to be the primary key in an Employee table? a. Employee ID b. Employee Type c. Employee Name d. Employee Social Security Number
a. Employee ID
Correcting inconsistencies across data is an example of which of the following? a. cleaning the data b. validating the data for integrity c. validating the data for completeness d. obtaining the data
a. cleaning the data
Removing headings or subtotals from data is an example of which of the following? a. cleaning the data b. validating the data for completeness c. validating the data for integrity d. obtaining the data
a. cleaning the data
All of the following are benefits of using a normalized relational database except a. data is stored in one place b. no redundancy b. business rules are enforced d. completeness
a. data is stored in one place
A data dictionary is paramount in helping database administrators do which of the following? a. maintain databases b. communicating insights c. track outcomes d. identify the data they need to use
a. maintain databases
At which step of the ETL process should you try to answer the question "What other information will impact the nature, timing and extent of the data analysis? a. Step 1: determine the purpose and scope of the dat request b. step 3 or 4: transformation c. step 5: loading the data for data analysis d. step 2: obtain the data
a. step 1: determine the purpose and scope of the data request
At which step of the ETL process should you try to answer the question "Where are the data located in the financial or other related systems?" a. Step 2: obtain the data b. Step 5: loading the data for data anaysis c. step 1: determine the purpose and scope of the data request d. step 3 or 4: transformation
a. step 2: obtain the data
Which of the following is NOT true regarding Primary Keys? a. the purpose of the primary key is to uniquely identify each column in a table b. primary keys do not provide meaningful business information c. a primary key is typically a sequential number instead of something truly descriptive about each record d. primary keys are required in each table in a relational database
a. the purpose of the primary key is to uniquely identify each column in a table
Which of the following best describes the purpose of a foreign key? a. to create the relationship between two tables b. to support business processes across the organization c. to provide business information d. to ensure that each row in the table is unique
a. to create the relationship between two tables
The objective of data extraction is: a. to identify and obtain the data from the appropriate source b. to validate the data for completeness and integrity c. to load the data into the appropriate tool for analysis d. to identify which approach to data analytics should be used
a. to identify and obtain the data from the appropriate source
Which of the following best describes the purpose of a non-key attribute? a. to provide business information b. to support business processes across the organization c. to create the relationship between two tables d. to ensure that each row in the table is unique
a. to provide business information
The objective of data transformation is a. to validate the data for completeness and integrity b. to identify which approach to data analytics should be used c. to load the data into the appropriate tool for analysis d. to identify and obtain the data from the appropriate source
a. to validate the data for completeness and integrity
A flat file is a means of storing data in one place, such as in an Excel spreadsheet, as opposed to storing the data in multiple tables, such as in a relational database a. true b. false
a. true
A foreign key is an attribute that exists in relational databases in order to carry out the relationship between two tables a. true b. false
a. true
A primary key is an attribute that is required to exist in each table of a relational database and serves as the unique identifier for each record in a table a. true b. false
a. true
A template can make communication easier between data requestor and provider. a. true b. false
a. true
After obtaining the data and determining the purpose and scope of the data request, the next step is to validate the data a. true b. false
a. true
Data normalization can reduce data redundancy and improve data integrity a. true b. false
a. true
Descriptive attributes are attributes that exist in relational databases that are neither primary nor foreign keys a. true b. false
a. true
Formatting negative numbers is an example of cleaning the data a. true b. false
a. true
If the extraction and transformation steps have been done correctly, the loading part of the ETL process should be the simplest step a. true b. false
a. true
In order to obtain the right data, it is important to have a firm grasp of what data is available and how it is stored a. true b. false
a. true
Mastering the data requires a firm understanding of what data is available to you and where it is stored, as well as being skilled in the process of extracting, transforming, and loading (ETL) a. true b. false
a. true
When obtaining the data yourself, you should do all of the following before you begin except a. identify the tables that contain the information you need b. identify which attributes specifically hold the information you need in each table c. identify how those tables are related to each other d. identify any errors or issues from the extraction
d. identify any errors or issues from the extraction
When using [EmployeeID] as the unique identifier of the Employee tablle, [EmployeeID] is an example of which of the following a. foreign key b. composite key c. key attribute d. primary key
d. primary key
Which of the following best exemplifies a way that data will need to be cleaned after extraction and validation? a. compare string limits for text fields b. remove trailing zeroes c. validate date/time fields d. remove headings and subtotals
d. remove headings and subtotals
All of the following are included in the five steps of the ETL process except a. obtain the data b. determine the purpose and scope of the data request c. validate the data for completeness and integrity d. scrub the data
d. scrub the data
Which of the following best describes the purpose of relational databases? a. to increase information redundancy in the organization b. to provide business information to data analysts c. to ensure that business rules are enforced d. to support business processes across the organization
d. to support business processes across the organization
What is one of Excel's tools for joining data from two separate spreadsheets? a. sumif b. sql c. datarequest d. vlookup
d. vlookup