database final

Pataasin ang iyong marka sa homework at exams ngayon gamit ang Quizwiz!

All of the following are ways to consolidate data EXCEPT. - Application integration - Data Rollup and integration - Business Process Integration - User interaction integration

- Data Rollup and integration

The real-time data warehouse is characterized by which of the following? - It accepts batch feeds of transaction data - Data are immediately transformed and loaded in to the warehouse - It provides periodic access for the transaction processing systems to an enterprise data warehouse - It is based on Oracle technology

- Data are immediately transformed and loaded in to the warehouse

When we consider data in the data warehouse to be time variant, we mean: - That the time of storage varies - Data in the warehouse contain a time dimension so that they may be used to study trends and changes - That there is a time delay between when the data are posted and when we report on the data - That time is relative

- Data in the warehouse contain a time dimension so that they may be used to study trends and changes

Converting data from the format of its source to the format of its destination is called: - Data transformation - Data Loading - Data Scrubbing - Data Storage

- Data transformation

Loading Data into the warehouse does not involve? - Appending new rows and tables to the data warehouse - Purging data that may have become obsolete or were incorrectly loaded - Formatting the hard drive - Updating rows with new data

- Formatting the hard drive

The level of detail in a fact table determined by the intersection of all components of the primary key, including all foreign keys and any other primary key elements, is called the: - Span - Grain - Selection - Aggregation

- Grain

The NoSQL model that is specifically designed to maintain information regarding the relationships (often real-world instances of entities) between data items is called a: - Key-value store - Document store - Wide-column store - Graph oriented database

- Graph oriented database

A dependent data mart: - Is filled with data extracted directly from the operational system. - Is filled exclusively from the enterprise data warehouse with reconciled data. - Is dependent upon an operational system. - Participates in a relationship with an entity.

- Is filled exclusively from the enterprise data warehouse with reconciled data.

The process of combining data from various sources into a single table or view is called: - Extracting - Updating - Selecting - Joining

- Joining

The NoSQL model that includes a simple pair of a key and an associated collection of values is called a: - Key-value store - Document store - Wide-column store - Graph database

- Key-value store

It is True in HDFS clusters Data Nodes are the: - Large number slaves - Single master servers - Language barriers - Business intelligence

- Large number slaves

The hadoop Framework consists of the ____ algorithm to solve large scale problems. - MapSystem - MapReduce - MapCluster - MapComponent

- MapReduce

________ technologies are allowing more opportunities for real-time data warehouses.

RFID

Operational and informational systems are generally separated because of which of the following factors? - A data warehouse centralizes data that are scattered throughout disparate operational systems and makes them readily available for decision support applications. - A properly designed data warehouse decreases value to data. - A separate data warehouse increases contention for resources. - Only operational systems allow SQL statements.

- A data warehouse centralizes data that are scattered throughout disparate operational systems and makes them readily available for decision support applications.

The process of transforming data from a detailed to a summary level is called: - Extracting - Updating - Joining - Aggregating

- Aggregating

Which of the following factors drive the need for data warehousing? - Informational data must be kept together with operational data - Businesses need an integrated view of company information - Data Warehouses generally have better security - None of the above

- Businesses need an integrated view of company information

When using a HDFS, the __ node creates new files by communicating with the __ node. - Client; name - Data; name - Data; client - Host; client;

- Client; name

Apache Cassandra is a leading producer of _____ NoSQL database management systems. - Key value store - Columnar - Relational - Graph

- Columnar

All of the following are tasks of data cleansing EXCEPT? - Decoding data to make them understandable for data warehousing applications. - Adding time stamps to distinguish values for the same attribute over time. - Generating primary keys for each row of a table. - Creating foreign keys.

- Creating foreign keys.

_____ duplicates data across databases - Data propagation - Data duplication - Redundant replication - A replication server

- Data propagation

An organization that requires a graph database that is highly scalable would select the _____ database management system. - Access - Excel spreadsheet - Neo4j - Redis

- Neo4j

An organization using HDFS realizes that the hardware failure is an? - Norm - Anomaly - Irregularity - Inconsistency

- Norm

_______ includes concern about data quality issues. - Velocity - Vigilant - Veracity - Variety

- Veracity

The three 'v's' commonly associated with big data include? - Viewable, volume, and variety - Volume, variety, velocity - Verified, variety, and velocity - Vigilant, viewable, and verified

- Volume, variety, velocity

Factless Fact tables may apply when: - We are deleting sales - We are tracking sales - We are taking inventory of a set of possible occurrences - We are deleting correlated data

- We are taking inventory of a set of possible occurrences

_____ is the most popular key-value store NoSQL database management system.

Neo4J

________ includes the value of speed in a NoSQL database

Velocity

One characteristic of independent data marts is complexity for end users when they need access to data in separate data marts. This complexity is caused by not only having to access data from separate databases, but also form: - The possibility of a new generation of inconsistent data systems, the data marts themselves - Lack of user training - Denormalized data - Incongruent data formats

- The possibility of a new generation of inconsistent data systems, the data marts themselves

The key discovery that triggered the development of data warehousing was: - Computer viruses - New ways to present information using mobile devices - The recognition of the differences between transactional and informational systems - The invention of the ipad

- The recognition of the differences between transactional and informational systems

Every key used to join the fact table with a dimension table should be a _______ key. - Primary - Surrogate - Foreign - Secondary

- Surrogate

_______ includes NoSQL accommodation of various data types. - Velocity - Vigilant - Verified - Variety

- Variety

A data mart is a(n): - Enterprise-wide data warehouse - Smaller system built upon file processing technology - Data warehouse that is limited in scope - Generic on-line shopping site

- Data warehouse that is limited in scope

A star schema contains both fact and _____ tables. - Narrative - Cross functional - Dimension - Starter

- Dimension

Big Data: - Requires a normalized dataset to 3rd Normal Form - Does not require a strictly defined data model. - Requires a strictly defined schema - Requires a normalized dataset to BCNF

- Does not require a strictly defined data model.

Grain and duration have an impact on the size of ________ tables. - Selection - Grain - Fact - Figure

- Fact

NoSQL focuses on: - Avoidance of replication of data - Minimizing storage space - Normalized data - Flexibility

- Flexibility

Informational and operational data differ in all of the following ways EXCEPT: - Level of detail - Normalization level - Scope of data - Data quality

- Level of detail

Big data requires effectively processing: - A single data type (numeric) - Two data types (text and numeric) - Many data types - A single data type (text)

- Many data types

When using MapReduce, a _____ function takes a collection and data and sorts and filters it into a set key-value pairs - Reduce - Map - Data - Block

- Map

The process of transforming data from a detailed to a summary level is called? - Irregularity - Norm - Inconsistency - Normalization

- Normalization

According to your text, NoSQL stands for: - Numbered SQL - No SQL - Not Only SQL - Numeric Only SQL

- Not Only SQL

_____ Is an important scripting language to help reduce the complexity of MapReduce - Pig - Horse - Dog - Cat

- Pig

Data federation is a technique which: - Creates an integrated database from several separate databases - Creates a distributed database - Provides a virtual view of integrated data without actually creating one centralized database - Provides a real-time update of shared data

- Provides a virtual view of integrated data without actually creating one centralized database

Event driven propagation: - Provides a means to duplicate data for events - Pushes data to duplicate sites as an event occurs - Pulls duplicate data from redundant sites - Triggers a virus

- Pushes data to duplicate sites as an event occurs

Conformed dimensions allow users to do the following: - Delete correlated data - Query across fact tables with consistency - Identify viruses in websites - Fix viruses in HTML documents

- Query across fact tables with consistency

Informational systems are designed for all of the following EXCEPT: - Running a business real-time - Support decision making - Complex queries - Data mining

- Running a business real-time

NoSQL systems allow ______, by incorporating servers that can be easily added to the architectural solution. - Scaling down - Scaling out - Scaling up - Scaling over

- Scaling out

When reporting and analysis organization of the data is determined when the data is used is called a(n): - Entity relationship diagram - Schema binding - Schema on read - Cognitive schema

- Schema on read

It is true that in an HDFS cluster the NameNode is the: - Large number of slaves - Single master server - Language library - Business intelligence

- Single master server

An expanded version of a star schema in which all tables are fully normalized is called a(n) - Snowflake schema - Operational schema - DSS schema - Complete schema

- Snowflake schema

______ focuses on filtering data as it enters the system to determine which data to keep and which to discard. - Scaling up - Feedback loop processing - Stream processing - Scaling out

- Stream processing

Big Data includes: - large volumes of data with many different data types that are processed at very high speeds. - large volumes of entity-relationship diagrams (ERD) with many different data types that are processed at very high speeds. - large volumes of data entry with a single data type processed at very high speeds. - large volumes of entity-relationship diagrams (ERD) with a single data type processed at very high speeds.

- large volumes of data with many different data types that are processed at very high speeds.


Kaugnay na mga set ng pag-aaral

NUR316 prepu Ch. 21 antidepressant agents

View Set

AP Computer Science Prinicples Lists & Reviews

View Set

SmartBook 2.0 - Ch. 14 - Accessing Resources for Growth from External Sources

View Set