OMIS 472 EXAM 1

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

Statistics is a rigorous branch of mathematics that deals with: -data mining -visualization -understanding data -linear regression -all of the above

-understanding data

In SAP the program that loads and monitors the data load is called the: DGP DTP DLP DMP none of the above

DTP

Data analytics is a process that involves all of the following EXCEPT: -identifying the problem -gathering relevant data that frequently are not in a usable form -cleaning up the data to make them usable -normalizing the data -manipulating them to discover information that leads to actionable insights -making decisions based on those insights

Normalizing the data

If a key figure is a type quantity, then it must have a: -UoM key -BCD key -quality key -virtual key -none of the above

UoM key

A dimension table provides: -columns for data storage -rows for data storage -a more detailed view of the fact -a and b -none of the above

a more detailed view of the fact

After you have organized and filtered the data, you can then: cluster them sort them slice and dice them creat graphs aggregate them

aggregate them

NLP reflects the beliefs that: -when people are communicating with their computer they should be able to speak as they would.. -the computer should be able to translate the speech into inforation and commands it can... -all of the above -none of the above

all of the above

Tasks within data harmonizing include: data consolidation data cleansing data reformatting all of the above none of the above

all of the above

The enterprise data warehouse (EDW) layer refers to the layers in which data are: -acquired -transformed -stored -all of the above -none of the above

all of the above

The most prominent database management systems for developing relational databases are: -DB2 -MS-SQL -Oracle -all of above -none of above

all of the above

The process of populating the data warehouse and other informational data structures is called: extraction transformation loading all of the above none of the above

all of the above

Transactional systems are designed to process transactions: -quickly -reliably -accurately -all of the above -none of the above

all of the above

When aggregating data you may also perform: sorting filtering ranking all of the above none of the above

all of the above

_____________ can all change over time.: Attributes Texts Hierarchies all of the above none of the above

all of the above

satellites orbiting the earth provide scientists with data regarding: -changes to the environment -weather conditions -popular concentrations -all of the above -none of the above

all of the above

social media include social networking such as: -snapchat -facebook and twitter -reddit and digg -all of the above -none of the above

all of the above

to detect fraud and errors, auditors need to examine: -all of the data within a company's database -all accounting databases and excel spreadsheets -IT systems -none of the above

all of the data within a company's database

When we look at auditing and analysis of internal controls, data from the internal systems are used to: -analyze risk -review policies -find system vulnerabilities -find revenue anomalies -none of the above

analyze risk

Fuzzy logic deals with: raw data normalized data mismatched data approximate values rather than absolutes none of the above

approximate values rather than absolutes

The layer that enables users to access data stored in the warehouse logically and efficiently is... -architected data mart layer -data acquisition layer -persistent staging area -quality and harmonization layer -none of the above

architected data mart layer

Typically data from the web are in _________ format. XML HTML SEC both a and b All of the above

both a and b

one of the most useful aspects of social media is that in addition to analyzing the data: -business can get detailed customer information -police can monitor criminal activity -businesses can evaluate the effectiveness of their operations -businesses can evaluate the effectiveness of their marketing strategies -none of the above

businesses can evaluate the effectiveness of their marketing strategies

Time-dependent hierarchies: -change over time -are static -are dependent on other hierarchies -none of the above

change over time

A bubble chart uses: bullet points to represent data points in a chart. colored lines to represent data points in a chart. dynamic bubbles to represent data points in a chart. circle shapes to represent data points in a chart. none of the above

circle shapes to represent data points in a chart.

Identifying specific data values within the dataset requires not only sorting and ranking the data, but also a process known as: conditional massaging. conditional formatting. non-conditional formatting. non-conditional massaging. aggregation

conditional formatting

Flat files are utilized to: -consolidate or synchronize data -visualize data -create graphs -store programs -none of the above

consolidate or synchronize data

__________ refers to the total number of values in the list. Sum MAX Average Min Count

count

A pivot table creates a: crosstab structure. summarized database. static references. clustered data. none of the above.

crosstab structure

A layered scalable architecture (LSA) is a flexible framework for: -data acquisition -storage -retrieval -all of the above -none of the above

data acquisition

The database management system (DBMS) resides at the: -data services tier -business logic tier -programming tier -presentation tier -none of the above

data services tier

data for analysis are temporarily stored in a: -data staging area -temp file -relational database -data modeling cube -none of the above

data staging area

The process of modeling, implementing and managing the data warehouse is called: -data modeling -warehouse modeling -data warehousing -data management -none of the above

data warehousing

Consolidation is sometimes called: data consolidation data wrangling data deduping data encryption none of the above

data wrangling

sensor data are the data gathered from all of the following devices except: -heating units -vehicles -databases -electrical transformers -airplanes

databases

OLAP systems often use _____ structures: -denormalized database -relational database -flat file system -temp file -none of the above

denormalized database

sampling is the act of: -extracting only certain data value from a dataset -hitting various websites for information -asking questions of a few out of many -all of the above -none of the above

extracting only certain data value from a dataset

The process of identifying data sources and source fields and acquiring, or sourcing, the data required for analysis is called: Transformation Loading extraction none of the above all of the above

extraction

The first step in ETL is: data transformation data integration extractoin of data none of the above all of the above

extraction of data

Often an analysis of large datasets requires both sorting and: filtering. aggregating. normalizing. de-normalizing. none of the above

filtering

Whereas sorting lists fields in ascending or descending order, ____________ displays only select rows, regardless of order. filtering conditional formatting aggregating massaging none of the above

filtering

One use of transformational programming is: to transform fuzzy data to harmonize raw data for data acquired from web sources. normalize databases none of the above

for data acquired from the web sources

Usually, the data contained informational systems are: -dispursed -structured -fully integrated -all of the above -none of the above

fully integrated

The acronym GIGO means: greater input, greater output givin input, given output garbage-in, garbage-out none of the above

garbage-in, garbage-out

Data reformatting involves converting a data field from one format to another with the goal of: normalizing it harmonizing it deduping it transforming it none of the above

harmonizing it

Source systems that are transactional systems are: -highly effective at removing data duplication at the source because they are normalized. -ineffective at removing data duplication at the source because they are normalized. -highly effective at removing data duplication at the source because they are not normalized. -ineffective at removing data duplication at the source because they are not normalized.

highly effective at removing data duplication at the source because they are normalized.

A typical use of social media data is to: -identify consumer buying habits, likes and dislikes -bring down foreign governments -communicate privately with friends around the world -tell everyone what you are doing every minute of the day -none of the above

identify consumer buying habits, like and dislikes

Law enforcement uses data analysis to: -the visualization -the dashboard -the infographics -the columns and rows -the dataset -catch red-light runners -identify patterns of crime -help with department budgets -all of the above

identify patterns of crime

Noise reduction: increases the signal-to-noise ratio decreases the signal-to-noise ratio masks the signal-to-noise ratio all of the above none of the above

increases the signal-to-noise ratio

A delta load is also known as an: full load partial load random load incremental load none of the above

incremental load

Enterprise resource planning (ERP) systems are: -integrated OLAP systems that enable all.. -integrated database systems that enable all.. -integrated transactional systems that enable all.. -all of the above -none of the above

integrated transactional systems that enable all the functional ...

The primary key of the fact table: holds character based data is encrypted is a composite of the foreign keys all of the above none of the above

is a composite of the foreign keys

The most common approach to NULL values is to: correct the null values add more data delete them leave them as they are none of the above

leave them as they are

Demand forecasting is the core of: -retail -manufacturing -supply chain -customer service

manufacturing

OLTP systems are accessed by: -database systems -transactional systems -many, perhaps thousands, of users at the same time -web sites -none of the above

many, perhaps thousands, of users at the same time

The main goal of the transformation phase is to: map and harmonize data from multiple sources store the data harmonize the data transform and normalize data none of the above

map and harmonize data from multiple sources

The structure of a database is called a data: -cube -structure -network -model -all of the above

model

A data flow diagram (DFD) is used to: -model the flow of data from one object to another -diagram the virtual layer -diagram the reporting layer -visualize the transformation layer -none of the above

model the flow of data from one object to another

From the data warehouse, data are pushed to the ____ for data storage -transactional table -multidimensional model -OLAP area -none of the above

multidimensional model

A key characteristic of web services is that they have: -a robust interface -lots of hits -a web crawler -no user interface -none of the above

no user interface

A _________ is a series of rule-based schedules of data extracts and loading. schedule chain value chain rule schedule process rule none of the above

none of the above

One of the advantages a PowerPivot has over a standard pivot table is: it allows you to create graphs it's faster it allows you to use colors on rows or columns it allows you export to databases none of the above

none of the above

The federal government collects census data every: -year with tax returns -5 years -7 years -9 years -none of the above

none of the above

The most common model for storing data in a cube for analytics is the: -multidimensional model -transformational model -dimensional model -exceptional model -none of the above

none of the above

Transactional systems generally are configured to use a three-tiered architecture that consists of: -the user interface or presentation tier -the business service or business logic tier -the operations services and programming tier -all of the above -none of the above

none of the above

When they are extracted, they are stored in a staging area, sometimes called a: LSA CBT LSL FRG none of the above

none of the above

___________ displays only select rows, regardless of order. Formatting Clustering Sorting Aggregation none of the above

none of the above

a rogue load occurs during the: -data staging process -modeling process -data loading process -denormalizing process -none of the above

none of the above

the number of dimensions depends on: -the star structure -the size of the data warehouse -the data mart -the transformation layer -none of the above

none of the above

To eliminate anomalies, the database tables need to be: -syncronized -synthasized -massaged -normalized -all of the above

normalized

ODS stands for: odd data store operational data store open data store omni data store none of the above

operational data store

Data provisioning is the process of: -scrubbing the data -providing data to outside organizations -providing users and systems with access to data -all of above -none of above

providing users and systems with access to data

An interval-dependent hierarchy has nodes that depend on: -specific values -range of values -singualar data points -transactional data -all of the above

range of values

denormalized data bases in OLAP systems are: -read only -read/write -/read/write/update -write only -update only

read only

Throughout the loading process, ______________ constraints must be met referential integrity harmonization data loading encryption f. trigger

referential integrity

Web crawlers: -hack websites -create denials of attack modify web pages -search web sites one page at a time for information -all of the above -none of the above

search web sites one page at a time for information

Exception filtering entails: showing error exceptions. showing all data values that you have selected not to display. showing filtered exceptions showing all data values except those you have selected not to display. none of the above

showing all data values except those you have selected not to display.

The addition of the new linked dimension table is called: -data modeling -linked lists -snowflaking -busting -none of the above

snowflaking

Ranking utilizes a combination of filtering and: aggregating normalizing de-normalizing sorting none of the above

sorting

The snowflake schema creates ___________ for language-dependent fields. special tables binary temp files surroagate ID's none of the above

special tables

The most commonly used tool for slicing and dicing is: databases spreadsheets text processors OLAP none of the above

spreadsheets

We use a _______ table to map the alphanumeric master data primary key to the numeric characteristic. -static -binary -relational -surrogate ID -none of the above

surrogate ID

Probable the most well-known method for dealing with unstructured data is called: -normalizing -tagging -coding data -none of the above

tagging

The first step in the data warehousing process involves identifying the: -the data staging area -the data warehouse -the data mare (cube) -analytic tools -the source systems

the source systems

Insert anomalies result when: -the insert key hasn't been used properly -the customer record can't be found -there is no place within to store the new data -all of the above -none of the above

there is no place within the table to store the new data

the data package dimension is the : -first dimension -second dimension -third dimension -fourth dimension -none of the above

third dimension

clickstream analysis is the process of collecting and analyzing data about: -people who logged onto the website -website visitors' mouse clicks -the effectiveness of their marketing -none of the above

website visitors' mouse clicks

Of the four interactions, __ has no impact on the table. -write -read -edit -delete

-read

Replication ensures that the source data: -are encrypted and transferred properly -are copied using bit transfer -remain intact -are validated -none of the above

-remain intact

During the provisioning process data from a source system are: -replicated, or copied -analyzed and scrubbed -presented in a graphical format -all of the above -none of the above

-replicated, or copied

Small business owners in particular, need only the data in: -an excel spreadsheet to understand.. -their electronic cash register -their MS-Access database -the POS to understand their business -none of the above

-the POS to understand their business

Data scientists are specialists in: -mathematics -computer science -statistics -mathematics and statistics -math, computer science, statistics

-all of the above

A common business use of spreadsheets is for: -creating web pages -editing pictures -manipulating data -budgeting -all of above

-budgeting

The definition of the data and their relationships is called: -data staging -data processing -data modeling -data massaging -none of the above

-data modeling

Structured data are based on: -transactional records -flat files -hierarchical data -data models

-data models

Most businesses create forecasts and budgets based on: -gut instincts -historical data -knowledge of the business environment -seasonality -gut instincts and historical data

-gut instincts and historical data

Out-of-date computer systems are referred to as: -legacy systems -OLAP systems -transactional systems -none of the above

-legacy systems

exception reporting is ___ tool for managers and other individuals who are responsible for... -a common -an uncommon -not a useful -a legacy -none of the above

a common

Data science is the intersection of ALL of the following EXCEPT: -computer science -statistics -domain knowledge -algorithms

Algorithms

Analysts can collect unstructured data and tag them to derive their meaning for the computer using: -Python -HTML -XML or XBRL -NPL

XML or XBRL

When the dataset contains multiple fields, measures, and dimensions to be analyzed: each one can have a different sort order. the sort order is applied to one field at a time. the sort order is applied to all fields at the same time. each one can have the same sort order. a & b above b & d above

a & b above


Ensembles d'études connexes

PrepU Chp 28: Assessment of Hematologic Function and Treatment Modalities

View Set

Origins and Insertions (Abductor Pollicis Longus)

View Set

Business Management II - VB Management Reading

View Set