BUAL 5650 Test 1 Review Questions Joonghee Lee

Ace your homework & exams now with Quizwiz!

Which of the following is true regarding a data warehouse? A) A data warehouse stores historical data. B) A data warehouse is an OLTP database. C) Data stored in a data warehouse is unorganized. D) It is application-oriented

A) A data warehouse stores historical data.

Which of the following is a characteristic of NoSQL databases? A) Avoid the rigid schemas of relational model B) Continuous consistency C) Vertical scalability D) None of these

A) Avoid the rigid schemas of relational model

Which of the following is not a reason NoSQL has become a popular solution for organizations? _______ A) Improved ability to keep data consistent B) Handling large volume of data C) Better scalability D) Handling varied datatypes

A) Improved ability to keep data consistent

Which of the following must be true for two tables to be related? _____ A) One table must have a primary key and the other must have a foreign key B) Both tables must have composite keys C) Both tables must have a matching attribute D) None of the above is true

A) One table must have a primary key and the other must have a foreign key

Consistency, in the context of ACID, means that ___________.

At the end of the transaction, all data must be left in a consistent state

Choose all of the incorrect statements regarding SQL and NewSQL databases: _______ A) A NewSQL database is optimized for horizontal scaling out. B) A NewSQL database does not follow the relational data model. C) A SQL database is better than a NewSQL database to handle a large volume of data. D) A SQL and NewSQL database use a same data model.

B) A NewSQL database does not follow the relational data model. C) A SQL database is better than a NewSQL database to handle a large volume of data.

Which of the following is an example of semi-structured data? _______ A) Database data B) Twitter data C) Text data D) Image data

B) Twitter Data

Which of the following is not true regarding a data mart? A) A small version of a data warehouse B) Storing data relevant to a specific business line C) A data mart cannot be an alternative to a data warehouse. D) A data mart can be connected to a data warehouse

C) A data mart cannot be an alternative to a data warehouse.

Which one is NOT TRUE about the distributed computing? A) Parallel processing B) The ability to handle a growing amount of work C) Discontinue in operation after a fault has occurred D) Sharing recourses

C) Discontinue in operation after a fault has occurred

Which one refers to the process of gathering data from multiple sources, organizing it, and placing it into a data warehouse? _______ A) Collect, Transformation, Load B) Extract, Transportation, Load C) Extract, Transformation, Load D) Read, Clean, Write

C) Extract, Transformation, Load

Which of the following is not a data model used in NoSQL databases? _______ A) Key-Value B) Document C) Relational D) Graph

C) Relational

NewSQL databases use which of the following data models? A) Graph model B) Document model C) Relational model D) Key-value model

C) Relational Model

What are the characteristics of a data warehouse? A) Subject-oriented, Integrated, Time-variant, and Volatile B) Subject-oriented, Integrated, Time-invariant, and Non-volatile C) Subject-oriented, Integrated, Time-variant, and Non-volatile D) Application-oriented, Integrated, Time-variant, and Non-volatile

C) Subject-oriented, Integrated, Time-variant, and Non-volatile

In ( ), the structure of that data is strictly defined before any data is stored in the database. A) schema-on-read B) structure-on-store C) schema-on-write D) schema-on-use

C) schema-on-write

Which of the following is an incorrect comparisons of big data types? ______ Structured data Unstructured data A)Well-defined Structure not obvious B)Data in database tables Images, video, text C)Easy to enter, store, and analyze Difficult to analyze D)RDBMS is not a good fit RDBMS is a good fit

D)RDBMS is not a good fit RDBMS is a good fit

What is a repository of data containing all enterprise data in its natural or raw format?

Data Lake

is a federated repository for the data collected by an enterprise's various operational systems.

Data Warehouse

Limited resource and the need to build a data warehouse solution to satisfy an immediate business pain often lead organizations to select a(n) __________approach to satisfy their information needs.

Data mart

A (__________) refers to a technique that allows computers to be connected and work together.

Distributed Computing

is the idea that all of the data will become consistent over time

Eventual Consistency

A Hadoop data lake and data warehouses cannot be used to complement each other. (True / False)

False

A commodity cluster refers to using a larger number of high-performance computers in serial (True / False)

False

A core task in the "data analytics" is building databases (True / False)

False

A data warehouse is a repository for unstructured data. (True / False)

False

Online Transaction Processing (OLTP) supports data analysis rather than data processing. (True / False)

False

Parallel processing is the act of processing one task at a time (True / False)

False

Semi-structure data does not have any organizational property (True / False)

False

distributed processing framework that is widely used for most big data technologies

Hadoop

( ) is used to store data in Hadoop. ( ) holds very large amount of data and provides easier access. To store such huge data, ( ) breaks the file (entire data) into one or more segments and distributes them to different nodes.

Hadoop Distributed File System (HDFS)

______architecture comprises a central data warehouse as well as data marts for business units which are connected.

Hub and Spoke

With (________), the queries and data reside in the server's random access memory (RAM). Because stored data is accessed much more quickly when it is placed in RAM

In-memory database

The high cost of data warehouses limits their use to large companies. As an alternative, many firms use a lower-cost, scaled-down version of a data warehouse referred to as a(n)

Independent data mart

RDBMS stands for

Relational Database Management System

(__________) refers to the capability to sustain a certain level of performance under increasing loads.

Scalability

Atomicity refers to the fact that

Tasks in a transaction must be either completed, or none of them is performed.

An entity is in a relational database is a(n)

Thing that users want to store

Both data warehouse and data lake are a centralized repository to store data (True / False)

True

Data Management is more focused on the acquisition and preparation of data (True / False

True

Data analytics includes the application of statistical modeling tools or other techniques to get information from the data (True / False)

True

Data lakes primarily store raw, unprocessed data, while data warehouse store processed and refined data. (True / False)

True

Volume, velocity, and variety are the three Vs of big data. What is the fourth?

Veracity


Related study sets

Chapter 7 The Therapeutic Response to Angry, Aggressive, Abused, or Abusive Clients

View Set

Chapter 2: Strategy and Technology: Concepts and Frameworks for Achieving Success

View Set

Life Policy Provisions, Riders and Options

View Set

Chapter 3: Running Meetings & Asking Questions

View Set