Module 3
Data manipulation is performed in places like...
... Tableau Prep Builder (rather than a sheet or something)
The human brain can only distinguish approximately _______ colors at a time.
8
Dashboards
A collection of views from multiple worksheets
Heat Map
Data visualization in which two measures can be displayed using size and color to express value
When the decision maker must consider several possible outcomes for each alternative, each with a given probability of occurrence, this is decision making under __________.
Decision making under risk is when the decision maker must consider several possible outcomes for each alternative, each with a given probability of occurrence.
The Trust Services Framework reliability principle that states sensitive information be protected from unauthorized disclosure is known as ________________________.
The Trust Services Framework reliability principle that states sensitive information be protected from unauthorized disclosure is known as confidentiality.
At what point in the systems development life cycle does the company determine how the conceptual AIS design is to be implemented?
The company determines how the conceptual AID design is to be implemented in the physical design phase of the SDLC.
What is the fundamental challenge of dashboard design?
The fundamental challenge of dashboard design is ensuring that the required information is shown clearly on a single screen.
Which field would not be used to create a hash control total? a. item number b. amount c. sales order number d. quantity ordered
b. amount
Duplicate checking of calculations and preparing bank reconciliations and monthly trial balances are examples of what type of control?
Duplicate checking of calculations and preparing bank reconciliations and monthly trial balances are examples of detective controls..
How does Hadoop work?
It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on multiple computers.
Picture Superiority Effect
Pictures are retained at much higher rates than words
Live Connection
Slower, but the data will update in real time; can create dynamic dashboards with real time updated information
What are the three fundamental information security concepts?
The time-based model of security focuses on the relationship between preventive, detective, and corrective controls. Security is a management issue, not a technology issue. The idea of defense-in-depth employs multiple layers of controls.
The most common method for solving a risk analysis problem is to select the alternative with the
greatest expected value.
Scatterplot
shows the relationship between data
When data is concatenated, it is...
... joined together by things like hyphens; a series/chain that is linked together
Data should be oriented so that...
... people can easily read it. For instance, it should ideally be oriented left to right, because we read English from left to right.
When you add a measure to the columns or rows shelf...
... you add an axis to the view that shows data points that lie within a range of values
The first step in analyzing your data should always be...
...to examine it visually. Visualization can play a critical role in helping you figure out what the interesting questions are.
Five Questions to Consider in Dashboard Review
1. What problem/question does this solve/answer? 2. Is this really the best way to display the information? 3. Does everything add value? 4. Is there functional interactivity? Are there clear labels?
A best practice for orienting data is:
A best practice for orienting data is left to right orientation.
Which kind of chart is described as an enhanced version of a scatter plot?
A bubble chart is an enhanced version fo a scatter plot.
Stories
A collection of sheets arranged in sequence purpose is to tell a story each sheet is called a story point
A ________________ is a collection of several worksheets and supporting information shown in a single place so you can compare and monitor a variety of data simultaneously.
A dashboard is a collection of several worksheets and supporting information shown in a single place so you can compare and monitor a variety of data simultaneously.
A ________________ is a large storage location that can hold vast quantities of data (mostly unstructured) in its native/raw format for future/potential analytics consumption.
A data lake is a large storage location that can hold vast quantities of data (mostly unstructured) in its native/raw format for future/potential analytics consumption.
Treemaps
A data visualization used to display data nested in rectangles; show proportions of the whole
hash control total
A hash control total is the sum of an attribute in a file that has no real meaning or use. For example the sum of the customer number field of all the records in a batch is a meaningless number for purposes other than as a control total. But if it is calculated when the batch is first assembled. the computer can recalculate it after the records have been entered for processing. If the computer-generated sum is the same as the original amount, we have some assurance that all records have been processed.
A _________ is a data visualization in which two measures can be displayed by using size and color to express values.
A heat map is a visualization in which two measures can be displayed by using size and color to express values.
A ______________________ can determine whether the necessary control procedures are in place.
A systems review can determine whether the necessary control procedures are in place.
A newly popular unit of data in the Big Data era is the exabyte (EB), which is ______ bytes.
An exabyte (EB) is 10 to the eighteenth power bytes.
An ____________________ is created separately from the enterprise data warehouse by a department and not reliant on it for updates.
An independent data mart is created separately from the enterprise data warehouse by a department and not reliant on it for updates.
A(n) _____________________________ audit is concerned with the economical and efficient use of resources and the accomplishment of established goals and objectives.
An operational or management audit is concerned with the economical and efficient use of resources and the accomplishment of established goals and objectives.
In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected?
Anomalies are detected and corrected in the Transformation stage of the ETL process.
Which of the following is not one of the basic actions that an organization must take to preserve the confidentiality of sensitive information? a. Identification of information to be protected. b. Controlling access to the information. c. Backing up the information. d. Training.
Backing up the information is not one of the basic actions that an organization must take to preserve the confidentiality of sensitive information.
___________________ is an umbrella term that combines architectures, tools, databases, analytical tools, applications, and methodologies.
Business Intelligence (BI) is an umbrella term that combines architectures, tools, databases, analytical tools, applications, and methodologies.
Pre-attentive Visual Attributes
Can be processed before we are even really paying attention
Discrete
Can only take on one of a limited set of distinct and separate values Discrete pills are blue Text and categories are inherently discrete
Continuous
Can take on any value within a range Continuous pills are green Numbers tend to be continuous (can be discrete tho)
What is a major drawback of canned software?
Canned software may not meet all of a company's information or data processing needs.
Classification of confidential information is the responsibility of whom, according to COBIT5?
Classification of confidential information is the responsibility of the information owner?
A customer forgot to include her account number on her check, and the AR clerk credited her payment to a different customer with the same last name. Which control could have been used to most effectively prevent this error?
Closed-loop verification
The four parts of the transaction processing cycle are:
Data input Data storage Data processing Information output
In which phase of the SDLC does developing a general framework for implementing user requirements occurDeveloping a general framework for implementing user requirements occur?
Developing a general framework for implementing user requirements occurs during the conceptual systems design phase of the SDLC.
A ________________ variable is a variable that can only take on a certain number of values. In other words, they don't have an infinite number of values. For example, the number of quarters in a purse, jar, or bank.
Discrete variables are variables that can only take on a certain number of values. In other words, they don't have an infinite number of values. For example, the number of quarters in a jar.
How are enterprise resources planning (ERP) systems related to supply chain management (SCM) systems?
ERP systems and SCM systems are complementary systems (siblings).
Extract Feature
Essentially taking a snapshot of the data and working with the data in memory; results in a faster experience
Dimensions
Generally contain qualitative data Discrete; usually (not always) categorical e.g. names. dates, geographic regions Come into view as themselves (as opposed to measures, which come into view as aggregates)
Identifying and preventing incorrect claim payments and fraudulent activities falls under which type of data mining applications?
Identifying and preventing incorrect claim payments and fraudulent activities falls under the insurance type of data mining applications.
If a simulation result does NOT match the intuition or judgment of the decision maker, what can occur?
If a simulation result does not match the intuition or judgement of the decision maker, a confidence gap can occur.
Important spreadsheet features for modeling include:
Important spreadsheet features for modeling include: macros goal seeking what-if analysis
__________________ is the primary output of an AIS.
Information is the primary output of an AIS.
Information that is free from error or bias and accurately represents the events or activities of the organization is ________________.
Information that is free from error or bias and accurately represents the events or activities of the organization is reliable.
KPIs are metrics typically used to measure __________________.
KPIs are metrics typically used to measure internal results.
Kaplan and Norton developed a report that presents an integrated view of success in the organization called _________________________.
Kaplan and Norton developed a report that presents an integrated view of success in the organization called balanced-scorecard type reports.
For discrete data, try to ([limit] or [emphasize]) color.
Limit color for discrete data. Under 5 colors is ideal. Using too many colors makes it hard to distinguish and also requires frequent referencing of the legend
Measures
Numeric values Continuous Can perform calculations Come into view as aggregates (as opposed to dimensions, which come into view s themselves)
__________________ is a method for comparing alternative vendor proposals for development of an accounting information system when the vendors differ with regard to their ability to meet the project criteria?
Point scoring is a method for comparing alternative vendor proposals for development of an accounting information system when the vendors differ with regard to their ability to meet the project criteria?
Pre-numbered shipping documents and pre-numbered invoices are examples of ________________.
Pre-numbered shipping documents and pre-numbered invoices are examples of sequence codes.
________________ seeks to determine what is likely to happen in the future.
Predictive analytics seeks to determine what is likely to happen in the future.
____________________ seeks to recognize what is going on as well as the likely forecast and make decisions to achieve the best performance possible.
Prescriptive analytics seeks to recognize what is going on as well as the likely forecast and make decisions to achieve the best performance possible.
______________________ are responsible for ensuring that the new system will meet the needs of users.
System analysts are responsible for ensuring that the new system will meet the needs of users.
Which internal control framework is widely accepted as the authority on internal controls?
The COSO integrated control framework is widely accepted as the authority on internal controls.
What is the correct sequence of phases in the systems development life cycle (SDLC)?
The sequence of phases in the SDLC is: system analysis, conceptual design, physical design, implementation and conversion, and operations and maintenance
Split Function
The split function is used to separate data that is concatenated
Data flows can be highly inconsistent, with periodic peaks, making data loads hard to manage. What is this feature of Big Data called?
The variability feature of big data describes how data flows can be highly inconsistent, with periodic peaks, making data loads hard to manage.
Five Second Test
The viewer should be able to understand what is going on with the dashboard in five seconds or less
Third party providers of publicly available data sets protect the anonymity of individuals in the data set primarily by ___________________________.
Third party providers of publicly available data sets protect the anonymity of individuals in the data set primarily by removing identifiers such as names and social security numbers.
Why are threats to accounting information systems increasing?
Threats to AIS are increasing because many companies do not realize that data security is crucial to their survival.
Which type of question does visual analytics seeks to answer?
Visual analytics seeks to answer "Why is it happening?"
When a request for proposal (RFP) is solicited based on ____________________, total costs are usually lower and less time is required for vendor preparation and company evaluation.
When a request for proposal (RFP) is solicited based on exact hardware and software specifications, total costs are usually lower and less time is required for vendor preparation and company evaluation.
When a subsystem's goals are inconsistent with the goals of another subsystem or with the system as a whole, it creates _______________.
When a subsystem's goals are inconsistent with the goals of another subsystem or with the system as a whole, it creates goal conflict.
Sheets (Tableau)
Where you build visualizations Contains a single view of all the shelves, cards, and legends, as well as a data pane and an analytics pane Not really for data manipulation
Histogram
a chart that displays the shape of a distribution; looks like a bar chart but groups values by continuous measures into ranges or bins
Table Calculation
a secondary tableau operation that is performed on top of a returned result set to create a new or complementary measure a transformation you apply to the values of a single measure in your view based on the dimensions and the level of detail
Story
a sequence of visualizations that work together to convey information -- can work this to create a data narrative, provide context, demonstrate how decisions relate to outcomes, or to simply make a compelling case
Calculations (tableau)
allow you to create new data from data that already exists in your data source as well as perform computations on your data
Understanding customers better has helped Amazon and others become more successful. The understanding comes primarily from
analyzing the vast data amounts routinely collected.
In talking about geographic maps: If the pill is continuous tableau will show a ____________; if it is discrete it will show a ___________________. [palette of distinct colors/color gradient]
color gradient; palette of distinct colors
Using color: for continuous data, __________________ are effective
color ramps
What is not a typical responsibility of an external auditor? a. assisting in the design and implementation of an AIS b. preparation of the company's financial statements c. helping management to improve organizational effectiveness d. all of the options provided are not typical
d. all of the options provided are not typical
BI applications must be integrated with a. databases. b. enterprise systems. c. legacy systems. d. all of the options provided
d. all of the options provided; databases, enterprise systems, and legacy systems
Scatter plot
data visualization used to visualize and compare relationships between two measures (in tableau you need at least one measure on the columns shelf and one measure on the rows shelf)
Anscombe's Quartet
developed to show the importance of graphing data as opposed to applying statistical tests to analyze information (if you just ran regressions and looked at statistics you might think they were all the same, but when you look at them you realize they are all very different)
In talking about geographic maps: A measure on color defaults to a
filled map
In talking about geographic maps: A dimension on colour defaults to a
symbol map