ACCT4300 CH 4, 19, 5, 6
Suzette sends Jimmy a flat file with a list of all sales transactions the company made during the last year. Each line contains all the information about a single sale. Jimmy prepares a report that shows three different views of the data (1) the total sales for each quarter, (2) the total sales by customer, and (3) the total sales for the entire year. To make this report, Jimmy had to do which of the following to the data Suzette sent?
Aggregate data
Catalina was reviewing the data imputation formula for missing values in a customer's credit score. She found that two lines of data had the exact same customer name, birthdate, address, and buying history, but they had two different social security numbers. Assuming there was no fraud by the customer, what best describes what Catalina likely found?
Data contradiction error
A data owner sends you an e-mail with a file to prepare for analysis. The file contains data from multiple database tables all merged into a single file. There are multiple fields in the file each separated by a "~" symbol. For fields that contain large amounts of text, the file contains a "+" at the beginning and end of the text field. Indicate which of the following best describes (1) the type of file the data owner sent, (2), what the "+" is called, and (3) what the "~" is called.
Flat file, text qualifier, delimiter
Good questions should be "SMART." Which of the SMART objectives suggests that a question should relate to the objectives of the organization or the situation under consideration?
Relevant
Which of the following is the first stage of the database design process?
Systems analysis
Which statement is true?
The choice of cardinalities in an REA diagram reflects facts about the organization's business practices.
Chunhua has been building financial forecasting models for the company for several years. For each model, she saves all the data that could possibly be used in the model, even if she doesn't use all the data in her finished model. She does not document anything about the different items she has saved. When her intern, Minsuh pulls the data, she cannot understand what all the fields mean. How would Minsuh most accurately describe the data?
The data has become a data swamp
In most business environments, who is responsible for coordinating, controlling, and managing the database?
The database administrator
Which of the following statements is not true?
The minimum cardinality in a relationship in an REA diagram can be many.
____________ is a process of changing data into a format that another program can use.
Transforming
Which of the following techniques is most likely to discover an error where a data analyst did not correctly parse data from one field into two fields?
Visual inspection
One of the benefits of using a database is to allow an organization to change its data in one location without concern of affecting its data in another location. This benefit is called
data independence.
In an E-R diagram, relationships are depicted as
diamonds.
Annie Walker is a sales associate for Drug Co. Annie uses her company issued computer to access Drug Co.'s accounting information system when she travels across the country. She has access to view customer and product information. She also has access that allows her to enter and cancel customer orders. Annie's right to access those portions of Drug Co.'s AIS represents a(n) ________ in the company's database management system.
external-level schema.
The problem of not being able to add records to a database is called
insert anomaly.
If an instance of entity A can only be linked to one instance of entity B, the maximum cardinality for entity A in this relationship is
one.
The database used for processing current business transactions is called the
online transaction processing database (OLTP).
Extracting zip code from within an address field into a separate field is called _______________.
parsing
To answer the question of "What should be done," one would apply ____________________.
prescriptive analytics
Cardinalities in an REA diagram _________
reflect the business practices of the organization.
Using Robotic Process Automation is best for tasks that are ________________.
repetitive
In an REA diagram, suppliers would be examples of the type of entity called a(n):
Agent
Understanding what big data means helps to know what types of questions can be fruitfully examined using data. Big data differs from regular data in four ways, often called the "four V's." Which of the following is not one of the four V's?
.Validity
An update anomaly can occur if
an instance of the same data is stored in two or more places in the database.
External level schemas
are where rights are set for individual users.
When accountants build bots to help them with the tasks of their job, what description best explains the type of bot they would build?
A computer program that is designed to perform a specific task
Before becoming the CEO, Kurt designed a new toy for the company. Although the sales of the new toy are the same as other toys in the company, the CEO gives employees in the new toy division a reward and bonus. The CEO is likely showing what?
A confirmation bias
Which of the following statements about data aggregation is (are) true? (Check all that apply.)
A record usually has more than one field. A database usually has more than one file.
In an REA diagram, what kind of entity is "Employee"?
Agent
Using which of the following data validation techniques, can the validator estimate a likely error rate in the population of data?
Audit a sample
Jane needs to create a data dashboard for each employee showing their performance during the last quarter. To build this dashboard, she must download data from a system, reformat it, upload it to a new system, and then build a visualization. To do this, Jane uses a program to automatically do all of these steps. What Jane built is an example of which of the following?
Automation
Which of the following techniques is most likely to discover a very large data threshold violation in a dataset containing 10 billion transactions?
Basic statistical tests
At what stage(s) in the database design process is a data model created? (Check all that apply.)
Conceptual design Systems analysis
At which stage(s) of the database design process is the information needed to determine the cardinalities of relationships obtained? (Check all that apply.)
Conceptual design Systems analysis
Which of the following is an entity?
Customers Purchases Suppliers Sales
The DBMS language that builds the data dictionary is called
DDL.
Adi queries the company database to return all values from the field "FullAddress." Adi reviews the information and finds that half of the time the values store the city and country values before the street address and the other half of the time the street address is listed before the city and country. What type of error did Adi find in the database?
Data consistency error
Joleen queried the company database and returned 23 columns of information for her report. In examining the data, she noticed that one column only had values half of the time. Joleen decided to delete this column from her report. This is an example of which of the following?
Data filtering
________ is the process of changing the organization and relationships among data fields to prepare the data for analysis.
Data structuring
Santiago reviewed a recent extract of data about customer credit limits. He noticed that one company had a credit limit of $1,000,000,000 USD, whereas the next highest credit limit was $10,000 USD. What might have Santiago discovered?
Data threshold violation
Which data structure provides support for analytical processing by storing both detailed and summarized data?
Data warehouse
An analytic that answers the question, "why did this happen?" is best described as which of the following?
Descriptive analytic
Amitola created a dashboard showing key metrics about the accounts payable process at her organization. The dashboard showed various metrics including: the total number of vendors, the amount saved by paying vendors on early, and the number of late payments to vendors. Which of the following best describes the type of analytics included in the dashboard?
Descriptive analytics
Which of the following is the best example of correlation not being the same as causation?
During an economic downturn, a company changes its computer policy to only allow purchases of windows-based laptops and see profits go down. The company concludes that windows-based laptops cause profits to go down.
At what point in the ETL process should data validation take place?
During data cleaning .During data structuring During data standardization
The text mentions four guidelines for developing a properly structured relational database. Which of the following is not one of these guidelines?
Each object identified by the primary key must have at least three non-key attributes
Which of the following statements about database systems is (are) true? (Check all that apply.)
Encourages data sharing Minimizes data redundancy Provides data independence
In an REA diagram, information about all the payments received from customers would be stored in which type of entity?
Event
In an REA diagram, what kind of entity is "Receive Cash"?
Event
Which kind of entity in an REA diagram would contain information about business activities performed by an organization?
Event
Asking the right questions is the first step of an analytics mindset. Which of the following is not part of the analytics mindset as defined by the accounting firm EY?
Exercise professional skepticism when using data
The logical view of data is:
How people organize, view, and understand relationships among data items
Developing an REA diagram for a specific business cycle consists of which of the following step(s)?
Identify the events about which management wants to collect information. Determine the cardinalities of each relationship. Identify the resources affected by each event and the agents who participate in those events.
Which of the following statements are true? (Check all that apply.)
In an E-R diagram, attributes can be depicted in a two-column table that is separate from the graphical diagram itself In an E-R diagram, attributes can be depicted as named ovals attached to entities In an E-R diagram, relationships can be depicted as diamonds linked to two entities. In an E-R diagram, relationships can be depicted as a line between two entities
When data is aggregated, some of the detailed information is lost. Which of the following is needed if you want to show both the aggregated and disaggregated data together?
Joining the aggregated data with the disaggregated data
Analyn spent the entire day entering information about suppliers into the company database. She did not make a single spelling mistake in any of the entries. However, at the end of the day, Analyn notices that she entered the state into the country field for all of the data. The mistaken data values in the country field are best described as which of the following?
Misfielded data values
If analytics are performed well, it is certain that ____________________________.
None of these are necessarily certain.
Check each example of structured data in the list below.
Phone numbers of employees saved in a database Customer addresses saved in a customer relation database
At what stage(s) in the database design process is a data model used? (Check all that apply.)
Physical design Implementation and conversion Operation and maintenance
An analytic that answers the question, "what might happen in the future?" is best described as which of the following?
Predictive analytic
According to the EY Foundation, over which data analytic techniques should accountants gain mastery?
Querying, trends, forecasting
Check the likely benefits of using robotic process automation among the four options below.
RPA performs tasks faster than a human. RPA will make fewer mistakes for rules based tasks than a human.
Which of the following are access rights that can be granted in a subschema? (Check all that apply.)
Read Update Delete Create
In an E-R diagram, entities are represented as:
Rectangles
Which of the following statements is true?
Relationship cardinalities differ across companies because business practices differ across companies.
In an REA diagram, information about the various cash accounts that an organization has (checking, payroll, savings, etc.) would be found in which type of entity?
Resource
Which of the following best describes what data pivoting means?
Rotating data from rows to columns
Which is the correct sequence of steps in the database design process?
Systems analysis, conceptual design, physical design, implementation and conversion, operation and maintenance
Which details would not be lost if purchase data were at the level of aggregation associated with a Vendor?
TotalPurchases
Following the creation of an ETL process, the following action should be performed.
Update the data dictionary
Which data set would be considered to be at the level of aggregation associated with a Vendor?
VendorID, VendorName, Sum(PurchaseOrderTotal)
Assuming you need to validate data cleansing on a very large dataset, which technique would be most useful?
Visual inspection Audit a sample Statistical tests
A delimiter is _________.
a data element that separates field values
A set of interrelated, centrally coordinated data files that are stored with as little data redundancy as possible is called
a database.
Every employee at the Grizzley Corp can enroll in more than one training program, and each training program can have more than one employee to participate. Thus, the cardinality that exists between employee and training program is
a many-to-many (M:N) relationship.
The central organizing tenant of the file-oriented approach is to store data related to
a specific program.
The data manipulation language ______
changes database content, including updates, insertions, and deletions.
Joining ProductID with ProductDescription into a single field is an example of ____________________.
concatenation
The organization-wide view of the entire database that lists all data elements and the relationships between them is called
conceptual-level schema.
Using the number 4 to represent the East division of a football league is an example of ____________________.
cryptic data
A construction company classifies their projects into one of seven different types. To keep track of project classification, the clerk enters a number from 1 to 7 in the ProjectType field. The values 1 to 7 are best described as ________.
cryptic data values
When data is joined together it is called _________, when it is split apart it is called ________.
data concatenation, data parsing
While cleaning data, you reduce the dataset to rows with the same values in all columns except one. Most likely you are looking for ____________.
data contradiction errors
While cleaning data, you run a routine to reduce a dataset to rows with all the same values. Most likely you are looking for ____________.
data duplication
When an employee makes a mistake typing data into the system, it is called a _______.
data entry error
A collection of structure, semi-structured, and unstructured data stored in a single location is called a ___________.
data lake
Mary Grossman has been selling cupcakes for the past few years. She keeps meticulous record of every cupcake that she has ever made and sold. She has also kept records of customers' comments on all her cupcakes. These have been entered into a computerized accounting information system that her accountant refers to as a "data warehouse." Mary is considering rebranding her cupcake business and has asked her accountant to identify the top-selling cupcakes for the past few years. Mary wants her accountant to engage in
data mining.
The process of translating complex data analyses into easier to understand terms is ______________________.
data storytelling
The process of translating complex data into easier to understand terms is called ________.
data storytelling
A schema is defined as a(an)
description of data elements in a database, relationships among them, and logical model used to organize and describe data.
Normalization is the process of ______
designing a database to avoid anomalies.
In an REA model, activities about which management wants to collect information for planning or control purposes are known as
events.
An attribute in a table that serves as a unique identifier in another table and is used to link the two tables is a
foreign key.
The referential integrity rule states that
foreign keys must have values that correspond to the value of a primary key in another table.
Asking the right questions involves questions that are ____________.
measurable
Pivoting data _____________________.
often loses some detailed data
If at least one instance of entity A must participate in a relationship with an instance of entity B, the minimum cardinality for entity A in this relationship is
one.
The National Butterfly Society maintains a database of all types of butterflies within the United States. One database table will store the butterfly species name and family classification number, with each species uniquely identified by a registration number. A second table will contain butterfly families, uniquely identified by the family classification number, and will be linked to the species table by means of the family classification number. The primary key of the species table is
registration number.
The database view that allows a sales manager to view all customer information as if stored in a table is called
the logical view.
If it is optional whether a specific instance of entity A participates in a relationship with an instance of entity B, the minimum cardinality for entity A in this relationship is
zero.