OMIS 472 EXAM 1
Statistics is a rigorous branch of mathematics that deals with: -data mining -visualization -understanding data -linear regression -all of the above
-understanding data
In SAP the program that loads and monitors the data load is called the: DGP DTP DLP DMP none of the above
DTP
Data analytics is a process that involves all of the following EXCEPT: -identifying the problem -gathering relevant data that frequently are not in a usable form -cleaning up the data to make them usable -normalizing the data -manipulating them to discover information that leads to actionable insights -making decisions based on those insights
Normalizing the data
If a key figure is a type quantity, then it must have a: -UoM key -BCD key -quality key -virtual key -none of the above
UoM key
A dimension table provides: -columns for data storage -rows for data storage -a more detailed view of the fact -a and b -none of the above
a more detailed view of the fact
After you have organized and filtered the data, you can then: cluster them sort them slice and dice them creat graphs aggregate them
aggregate them
NLP reflects the beliefs that: -when people are communicating with their computer they should be able to speak as they would.. -the computer should be able to translate the speech into inforation and commands it can... -all of the above -none of the above
all of the above
Tasks within data harmonizing include: data consolidation data cleansing data reformatting all of the above none of the above
all of the above
The enterprise data warehouse (EDW) layer refers to the layers in which data are: -acquired -transformed -stored -all of the above -none of the above
all of the above
The most prominent database management systems for developing relational databases are: -DB2 -MS-SQL -Oracle -all of above -none of above
all of the above
The process of populating the data warehouse and other informational data structures is called: extraction transformation loading all of the above none of the above
all of the above
Transactional systems are designed to process transactions: -quickly -reliably -accurately -all of the above -none of the above
all of the above
When aggregating data you may also perform: sorting filtering ranking all of the above none of the above
all of the above
_____________ can all change over time.: Attributes Texts Hierarchies all of the above none of the above
all of the above
satellites orbiting the earth provide scientists with data regarding: -changes to the environment -weather conditions -popular concentrations -all of the above -none of the above
all of the above
social media include social networking such as: -snapchat -facebook and twitter -reddit and digg -all of the above -none of the above
all of the above
to detect fraud and errors, auditors need to examine: -all of the data within a company's database -all accounting databases and excel spreadsheets -IT systems -none of the above
all of the data within a company's database
When we look at auditing and analysis of internal controls, data from the internal systems are used to: -analyze risk -review policies -find system vulnerabilities -find revenue anomalies -none of the above
analyze risk
Fuzzy logic deals with: raw data normalized data mismatched data approximate values rather than absolutes none of the above
approximate values rather than absolutes
The layer that enables users to access data stored in the warehouse logically and efficiently is... -architected data mart layer -data acquisition layer -persistent staging area -quality and harmonization layer -none of the above
architected data mart layer
Typically data from the web are in _________ format. XML HTML SEC both a and b All of the above
both a and b
one of the most useful aspects of social media is that in addition to analyzing the data: -business can get detailed customer information -police can monitor criminal activity -businesses can evaluate the effectiveness of their operations -businesses can evaluate the effectiveness of their marketing strategies -none of the above
businesses can evaluate the effectiveness of their marketing strategies
Time-dependent hierarchies: -change over time -are static -are dependent on other hierarchies -none of the above
change over time
A bubble chart uses: bullet points to represent data points in a chart. colored lines to represent data points in a chart. dynamic bubbles to represent data points in a chart. circle shapes to represent data points in a chart. none of the above
circle shapes to represent data points in a chart.
Identifying specific data values within the dataset requires not only sorting and ranking the data, but also a process known as: conditional massaging. conditional formatting. non-conditional formatting. non-conditional massaging. aggregation
conditional formatting
Flat files are utilized to: -consolidate or synchronize data -visualize data -create graphs -store programs -none of the above
consolidate or synchronize data
__________ refers to the total number of values in the list. Sum MAX Average Min Count
count
A pivot table creates a: crosstab structure. summarized database. static references. clustered data. none of the above.
crosstab structure
A layered scalable architecture (LSA) is a flexible framework for: -data acquisition -storage -retrieval -all of the above -none of the above
data acquisition
The database management system (DBMS) resides at the: -data services tier -business logic tier -programming tier -presentation tier -none of the above
data services tier
data for analysis are temporarily stored in a: -data staging area -temp file -relational database -data modeling cube -none of the above
data staging area
The process of modeling, implementing and managing the data warehouse is called: -data modeling -warehouse modeling -data warehousing -data management -none of the above
data warehousing
Consolidation is sometimes called: data consolidation data wrangling data deduping data encryption none of the above
data wrangling
sensor data are the data gathered from all of the following devices except: -heating units -vehicles -databases -electrical transformers -airplanes
databases
OLAP systems often use _____ structures: -denormalized database -relational database -flat file system -temp file -none of the above
denormalized database
sampling is the act of: -extracting only certain data value from a dataset -hitting various websites for information -asking questions of a few out of many -all of the above -none of the above
extracting only certain data value from a dataset
The process of identifying data sources and source fields and acquiring, or sourcing, the data required for analysis is called: Transformation Loading extraction none of the above all of the above
extraction
The first step in ETL is: data transformation data integration extractoin of data none of the above all of the above
extraction of data
Often an analysis of large datasets requires both sorting and: filtering. aggregating. normalizing. de-normalizing. none of the above
filtering
Whereas sorting lists fields in ascending or descending order, ____________ displays only select rows, regardless of order. filtering conditional formatting aggregating massaging none of the above
filtering
One use of transformational programming is: to transform fuzzy data to harmonize raw data for data acquired from web sources. normalize databases none of the above
for data acquired from the web sources
Usually, the data contained informational systems are: -dispursed -structured -fully integrated -all of the above -none of the above
fully integrated
The acronym GIGO means: greater input, greater output givin input, given output garbage-in, garbage-out none of the above
garbage-in, garbage-out
Data reformatting involves converting a data field from one format to another with the goal of: normalizing it harmonizing it deduping it transforming it none of the above
harmonizing it
Source systems that are transactional systems are: -highly effective at removing data duplication at the source because they are normalized. -ineffective at removing data duplication at the source because they are normalized. -highly effective at removing data duplication at the source because they are not normalized. -ineffective at removing data duplication at the source because they are not normalized.
highly effective at removing data duplication at the source because they are normalized.
A typical use of social media data is to: -identify consumer buying habits, likes and dislikes -bring down foreign governments -communicate privately with friends around the world -tell everyone what you are doing every minute of the day -none of the above
identify consumer buying habits, like and dislikes
Law enforcement uses data analysis to: -the visualization -the dashboard -the infographics -the columns and rows -the dataset -catch red-light runners -identify patterns of crime -help with department budgets -all of the above
identify patterns of crime
Noise reduction: increases the signal-to-noise ratio decreases the signal-to-noise ratio masks the signal-to-noise ratio all of the above none of the above
increases the signal-to-noise ratio
A delta load is also known as an: full load partial load random load incremental load none of the above
incremental load
Enterprise resource planning (ERP) systems are: -integrated OLAP systems that enable all.. -integrated database systems that enable all.. -integrated transactional systems that enable all.. -all of the above -none of the above
integrated transactional systems that enable all the functional ...
The primary key of the fact table: holds character based data is encrypted is a composite of the foreign keys all of the above none of the above
is a composite of the foreign keys
The most common approach to NULL values is to: correct the null values add more data delete them leave them as they are none of the above
leave them as they are
Demand forecasting is the core of: -retail -manufacturing -supply chain -customer service
manufacturing
OLTP systems are accessed by: -database systems -transactional systems -many, perhaps thousands, of users at the same time -web sites -none of the above
many, perhaps thousands, of users at the same time
The main goal of the transformation phase is to: map and harmonize data from multiple sources store the data harmonize the data transform and normalize data none of the above
map and harmonize data from multiple sources
The structure of a database is called a data: -cube -structure -network -model -all of the above
model
A data flow diagram (DFD) is used to: -model the flow of data from one object to another -diagram the virtual layer -diagram the reporting layer -visualize the transformation layer -none of the above
model the flow of data from one object to another
From the data warehouse, data are pushed to the ____ for data storage -transactional table -multidimensional model -OLAP area -none of the above
multidimensional model
A key characteristic of web services is that they have: -a robust interface -lots of hits -a web crawler -no user interface -none of the above
no user interface
A _________ is a series of rule-based schedules of data extracts and loading. schedule chain value chain rule schedule process rule none of the above
none of the above
One of the advantages a PowerPivot has over a standard pivot table is: it allows you to create graphs it's faster it allows you to use colors on rows or columns it allows you export to databases none of the above
none of the above
The federal government collects census data every: -year with tax returns -5 years -7 years -9 years -none of the above
none of the above
The most common model for storing data in a cube for analytics is the: -multidimensional model -transformational model -dimensional model -exceptional model -none of the above
none of the above
Transactional systems generally are configured to use a three-tiered architecture that consists of: -the user interface or presentation tier -the business service or business logic tier -the operations services and programming tier -all of the above -none of the above
none of the above
When they are extracted, they are stored in a staging area, sometimes called a: LSA CBT LSL FRG none of the above
none of the above
___________ displays only select rows, regardless of order. Formatting Clustering Sorting Aggregation none of the above
none of the above
a rogue load occurs during the: -data staging process -modeling process -data loading process -denormalizing process -none of the above
none of the above
the number of dimensions depends on: -the star structure -the size of the data warehouse -the data mart -the transformation layer -none of the above
none of the above
To eliminate anomalies, the database tables need to be: -syncronized -synthasized -massaged -normalized -all of the above
normalized
ODS stands for: odd data store operational data store open data store omni data store none of the above
operational data store
Data provisioning is the process of: -scrubbing the data -providing data to outside organizations -providing users and systems with access to data -all of above -none of above
providing users and systems with access to data
An interval-dependent hierarchy has nodes that depend on: -specific values -range of values -singualar data points -transactional data -all of the above
range of values
denormalized data bases in OLAP systems are: -read only -read/write -/read/write/update -write only -update only
read only
Throughout the loading process, ______________ constraints must be met referential integrity harmonization data loading encryption f. trigger
referential integrity
Web crawlers: -hack websites -create denials of attack modify web pages -search web sites one page at a time for information -all of the above -none of the above
search web sites one page at a time for information
Exception filtering entails: showing error exceptions. showing all data values that you have selected not to display. showing filtered exceptions showing all data values except those you have selected not to display. none of the above
showing all data values except those you have selected not to display.
The addition of the new linked dimension table is called: -data modeling -linked lists -snowflaking -busting -none of the above
snowflaking
Ranking utilizes a combination of filtering and: aggregating normalizing de-normalizing sorting none of the above
sorting
The snowflake schema creates ___________ for language-dependent fields. special tables binary temp files surroagate ID's none of the above
special tables
The most commonly used tool for slicing and dicing is: databases spreadsheets text processors OLAP none of the above
spreadsheets
We use a _______ table to map the alphanumeric master data primary key to the numeric characteristic. -static -binary -relational -surrogate ID -none of the above
surrogate ID
Probable the most well-known method for dealing with unstructured data is called: -normalizing -tagging -coding data -none of the above
tagging
The first step in the data warehousing process involves identifying the: -the data staging area -the data warehouse -the data mare (cube) -analytic tools -the source systems
the source systems
Insert anomalies result when: -the insert key hasn't been used properly -the customer record can't be found -there is no place within to store the new data -all of the above -none of the above
there is no place within the table to store the new data
the data package dimension is the : -first dimension -second dimension -third dimension -fourth dimension -none of the above
third dimension
clickstream analysis is the process of collecting and analyzing data about: -people who logged onto the website -website visitors' mouse clicks -the effectiveness of their marketing -none of the above
website visitors' mouse clicks
Of the four interactions, __ has no impact on the table. -write -read -edit -delete
-read
Replication ensures that the source data: -are encrypted and transferred properly -are copied using bit transfer -remain intact -are validated -none of the above
-remain intact
During the provisioning process data from a source system are: -replicated, or copied -analyzed and scrubbed -presented in a graphical format -all of the above -none of the above
-replicated, or copied
Small business owners in particular, need only the data in: -an excel spreadsheet to understand.. -their electronic cash register -their MS-Access database -the POS to understand their business -none of the above
-the POS to understand their business
Data scientists are specialists in: -mathematics -computer science -statistics -mathematics and statistics -math, computer science, statistics
-all of the above
A common business use of spreadsheets is for: -creating web pages -editing pictures -manipulating data -budgeting -all of above
-budgeting
The definition of the data and their relationships is called: -data staging -data processing -data modeling -data massaging -none of the above
-data modeling
Structured data are based on: -transactional records -flat files -hierarchical data -data models
-data models
Most businesses create forecasts and budgets based on: -gut instincts -historical data -knowledge of the business environment -seasonality -gut instincts and historical data
-gut instincts and historical data
Out-of-date computer systems are referred to as: -legacy systems -OLAP systems -transactional systems -none of the above
-legacy systems
exception reporting is ___ tool for managers and other individuals who are responsible for... -a common -an uncommon -not a useful -a legacy -none of the above
a common
Data science is the intersection of ALL of the following EXCEPT: -computer science -statistics -domain knowledge -algorithms
Algorithms
Analysts can collect unstructured data and tag them to derive their meaning for the computer using: -Python -HTML -XML or XBRL -NPL
XML or XBRL
When the dataset contains multiple fields, measures, and dimensions to be analyzed: each one can have a different sort order. the sort order is applied to one field at a time. the sort order is applied to all fields at the same time. each one can have the same sort order. a & b above b & d above
a & b above
