Data Management Final

Lakukan tugas rumah & ujian kamu dengan baik sekarang menggunakan Quizwiz!

True

A person is an example of an entity. True/False

drill down.

Going from a summary view to progressively lower levels of detail is called data:

data replication.

A form of denormalization where the same data are purposely stored in multiple places in the database is called:

incremental

A method of capturing only the changes that have occurred in the source data since the last capture is called ________ extract.

many-to-many

A student can attend five classes, each with a different professor. Each professor has 30 students. The relationship of students to professors is a ________ relationship.

unplanned duplicate data files are the rule rather than the exception.

Because applications are often developed independently in file processing systems:

large volumes of data with many different data types that are processed at very high speeds.

Big data includes:

many data types.

Big data requires effectively processing:

Physical design

DDL is typically used during which phase of the development process?

tables

Data is represented in the form of:

SQL commands (Insert/Update).

Data may be loaded from the staging area into the warehouse by following:

data characteristics are important in the design of programs and other systems components.

Data modeling may be the most important part of the systems development process because:

True

Descriptive analytics answers the question: "What happened yesterday?" True/False

True

Descriptive analytics is the oldest form of analytics. True/False

True

Enforcement of business rules can be automated through the use of software that can interpret the rules and enforce them. True/False

True

File processing systems have been replaced by database systems in most critical business applications today. True/False

True

Human intervention is an important part of big data analytics. True/False

True

Information is processed data. True/False

False

OLAP, ROLAP, and TLAP are tools commonly used to load data into intermediate hypercube structures. True/False

relation.

Relational databases establish the relationships between entities by means of common fields included in a file called a(n):

informational processing.

The analysis of data or information to support decision making is called:

descriptive analytics.

The oldest form of analytics is:

True

Using data to predict events is an example of predictive analytics. True/False

Relates to a technical characteristic of the system

Which of the following is NOT a good characteristic of a data name?

Business rules

Which of the following is a component of processing logic?

Structured query language (SQL)

________ is a tool even non-programmers can use to access information from a database.

causes two tables with a common domain to be combined into a single table or view.

A join operation:

constraint

A rule that CANNOT be violated by database users is called a:

dimension

A star schema contains both fact and ________ tables.

API

A(n) ________ is a set of application routines that programs use to direct the performance of procedures by the computer's operating system.

Not Only SQL.

According to your text, NoSQL stands for:

personal finances.

All of the following are applications for big data and analytics EXCEPT:

efficiency.

An advantage of partitioning is:

snowflake schema.

An expanded version of a star schema in which all of the tables are fully normalized is called a(n):

authorization rules.

Controls designed to restrict access and activities are called:

data transformation.

Converting data from the format of its source to the format of its destination is called:

True

Customers leave clues about their preferences when navigating a company's Web site. True/False

entities.

Customers, cars, and parts are examples of:

flexibility.

NoSQL focuses on:

the possibility of a new generation of inconsistent data systems, the data marts themselves.

One characteristic of independent data marts is complexity for end users when they need to access data in separate data marts. This complexity is caused by not only having to access data from separate databases, but also from:

government.

One major source of data for analytics is:

False

Predictive analytics answers the question: "How can we make it happen?" True/False

True

Smartphones can produce millions of observations per second making them Business Intelligence and Analytics 3.0. True/False

concurrency control

The actions that must be taken to ensure data integrity is maintained during multiple simultaneous transactions are called ________ actions.

FROM

The first part of an SQL query to be read is the ________ statement.

degree.

The number of entity types that participate in a relationship is called the:

True

The three major types of analytics are: descriptive, predictive, and prescriptive. True/False

How many products have product descriptions in the Product Table?

Which of the following questions is answered by the SQL statement? SELECT COUNT (Product_Description) FROM Product_T;

pointer

A(n) ________ is a field of data used to locate a related field or record.

creating foreign keys.

All of the following are tasks of data cleansing EXCEPT:

BLOB

An appropriate datatype for adding a sound clip would be:

as short as possible.

An entity type name should be all of the following EXCEPT:

location on disk.

Metatdata typically describes all of the following EXCEPT:

Inconsistent

Which of the following is NOT a characteristic of a good business rule?

Redis

________ is the most popular key-value store NoSQL database management system.

extra space and update time.

A disadvantage of partitioning is:

data model.

A graphical system used to capture the nature and relationships among data is called a(n):

outer join.

A join in which rows that do not have matching values in common columns are still included in the result table is called a(n):

True

A join in which the joining condition is based on equality between values in the common column is called an equi-join. True/False

the data is unstructured in big data systems.

A major difference between data warehouses and transactional systems as compared to big data is:

True

A natural join is the same as an equi-join, except that it is performed over matching columns that have been defined with the same name, and one of the duplicate columns is eliminated. True/False

attributes.

A person's name, birthday, and social security number are all examples of:

entity

A person, place, object, event, or concept about which the organization wishes to maintain data is called a(n):

composite key.

A primary key that consists of more than one attribute is called a:

enterprise key.

A primary key whose value is unique across all relations is called a(n):

attribute.

A property or characteristic of an entity type that is of interest to the organization is called a(n):

unary

A relationship between the instances of a single entity type is called a ________ relationship.

mandatory one

A relationship where the minimum and maximum cardinality are both one is a(n) ________ relationship.

explanatory

A researcher trying to explain why sales of garden supplies in Hawaii have decreased would be an example of ________ data mining.

referential integrity constraint.

A rule that states that each foreign key value must match a primary key value in the other relation is called the:

transaction.

A sequence of steps that constitute a well-defined business activity is called a:

ternary

A simultaneous relationship among the instances of three entity types is called a ________ relationship.

data scrubbing.

A technique using artificial intelligence to upgrade the quality of raw data is called:

data scrubbing.

A technique using pattern recognition to upgrade the quality of raw data is called:

any changes in the database are not revealed to the user until the transaction is complete.

A transaction is considered isolated when:

time stamp.

A value that indicates the date or time of a data value is called a:

horizontal partitioning.

A(n) ________ is a field of data used to locate a related field or record.

exclusive lock

A(n) ________ prevents another transaction from reading and therefore updating a record until it is unlocked.

trash collection.

All of the following are categorizations of human activity affected by big data analytics EXCEPT:

it is often more expedient to build a data mart than a data warehouse.

All of the following are limitations of the independent data mart EXCEPT:

use a lot of storage space.

All of the following are objectives when selecting a data type EXCEPT:

Persistence.

All of the following are well-accepted characteristics of transactions EXCEPT:

descriptive analytics.

Allowing users to dive deeper into the view of data with online analytical processing (OLAP) is an important part of:

veracity and value.

Although volume, variety, and velocity are considered the initial three v dimensions, two additional Vs of big data were added and include:

refresh mode.

An approach to filling a data warehouse that employs bulk rewriting of the target data periodically is called:

CHAR.

An appropriate datatype for one wanting a fixed-length type for last name would include:

primary key.

An attribute (or attributes) that uniquely identifies each row in a relation is called a:

composite

An attribute that can be broken down into smaller parts is called a(n) ________ attribute.

required attribute.

An attribute that must be present for every entity (or relationship) instance is a(n):

associative entity.

An entity that associates the instances of one or more entity types and contains attributes peculiar to the relationships is called a(n):

False

An equi-join is a join in which one of the duplicate columns is eliminated in the result table. True/False

periodic

Data that are never physically altered once they are added to the store are called ________ data.

metadata

Data that describe the properties of other data are:

transaction volumes.

Database access frequencies are estimated from:

design web pages.

Database management involves all of the following EXCEPT:

False

Databases were developed as the first application of computers to data processing. True/False

prescriptive

Descriptive, predictive, and ________ are the three main types of analytics.

once a transaction is committed, no subsequent failure of the database can reverse the effect of the transaction.

Durability means that:

surrogate

Every key used to join the fact table with a dimension table should be a ________ key.

fact

Grain and duration have a direct impact on the size of ________ tables.

when different categories of a table's rows are processed separately.

Horizontal partitioning makes sense:

two

In an E-R diagram, there are/is ________ business rule(s) for every relationship.

WHERE

In an SQL statement, which of the following parts states the conditions for row selection?

provide rapid random and sequential access to base-table data.

Indexes are created in most RDBMSs to:

level of detail.

Informational and operational data differ in all of the following ways EXCEPT:

running a business in real time.

Informational systems are designed for all of the following EXCEPT:

True

Joining tables or using a subquery may produce the same result. True/False

True

Many developing countries are using advanced applications of analytics to utilize data collected from mobile devices. True/False

key-value store.

NoSQL systems allow ________ by incorporating commodity servers that can be easily added to the architectural solution.

scaling out

NoSQL systems allow ________ by incorporating commodity servers that can be easily added to the architectural solution.

A data warehouse centralizes data that are scattered throughout disparate operational systems and makes them readily available for decision support applications.

Operational and informational systems are generally separated because of which of the following factors?

referential integrity.

Requiring a CustomerID to exist in a Primary table before it can exist in a foreign table is an example of:

wide-column store.

The NoSQL model that incorporates 'column families' is called a:

graph-oriented database.

The NoSQL model that is specifically designed to maintain information regarding the relationships (often real-world instances of entities) between data items is called a:

subject-oriented.

The characteristic that indicates that a data warehouse is organized around key high-level entities of the enterprise is:

no primary key attribute can be null.

The entity integrity rule states that:

subquery.

The following code is an example of a: SELECT CustomerName, CustomerAddress, CustomerCity, CustomerState, CustomerPostalCode FROM Customer_T WHERE Customer_T.CustomerID = (SELECT Order_T.CustomerID FROM Order_T WHERE OrderID = 1008);

only rows that match both Customer_T and Order_T Tables.

The following code would include: SELECT Customer_T.CustomerID, Order_T.CustomerID, CustomerName, OrderID FROM Customer_T, Order_T WHERE Customer_T.CustomerID = Order_T. CustomerID;

joining.

The process of combining data from various sources into a single table or view is called:

Data are immediately transformed and loaded into the warehouse.

The real-time data warehouse is characterized by which of the following?

Data Administration.

The role that is responsible for developing overall policies and procedures for database security is:

field.

The smallest unit of application data recognized by system software is a:

volume, variety, and velocity.

The three 'v's' commonly associated with big data include:

default value.

The value a field will assume unless the user enters an explicit value for an instance of that field is called a:

DISTINCT

To eliminate duplicate rows in a query, the ________ qualifier is used in the SQL Select command.

Selects all the fields from the Customer table for each row with a customer labeled "Best"

What does the following SQL statement do? SELECT * From Customer WHERE Cust_Type = "Best"

The Customer_Name and telephone of all customers living in either Boston, New York or Denver

What result set is returned from the following query?SELECT Customer_Name, telephone FROM customers WHERE city in ('Boston','New York','Denver');

The Item_No of all orders that had more than 10 items

What result set will the following query return? SELECT Item_No FROM Order_V WHERE quantity > 10;

The Item_No and description for all items weighing between 101 and 199

What result set will the following query return? SELECT Item_No, description FROM item WHERE weight > 100 and weight < 200;

The average Standard_Price of all products in Product_V

What result will the following SQL statement produce? SELECT Avg(Standard_Price) as average from Product_V;

The total price of all products that are of type wood

What results will be produced by the following SQL query? SELECT SUM(Standard_Price) as Total_Price from Product_V where Product_Type = 'WOOD';

The first product description alphabetically in Product_V will be shown.

What will result from the following SQL Select statement? SELECT MIN(Product_Description) FROM Product_V;

data lake.

When a data repository (including internal and external data) does NOT follow a predefined schema, this is called a:

prescriptive analytics.

When an organization must decide on optimization and simulation tools to make things happen it is using:

descriptive analytics.

When online analytical processing (OLAP) studies last year's sales, this represents:

data in the warehouse contain a time dimension so that they may be used to study trends and changes.

When we consider data in the data warehouse to be time variant, we mean:

No two rows in a relation are identical.

Which of the following are properties of relations?

COUNT

Which of the following counts ONLY rows that contain a value?

Target marketing

Which of the following data-mining applications identifies customers for promotional activity?

Businesses need an integrated view of company information.

Which of the following factors drive the need for data warehousing?

Eliminate the need for application software

Which of the following is NOT an objective of derived data?

To specify syntax and semantics of SQL data definition and manipulation languages

Which of the following is a purpose of the SQL standard?

Improve data integrity

Which of the following is an objective of selecting a data type?

Database management system (DBMS)

Which of the following is software used to create, maintain, and provide controlled access to databases?

Downsizing

Which of the following organizational trends does not encourage the need for data warehousing?

SELECT MIN(Standard_Price) FROM Product_V;

Which of the following will produce the minimum of all standard prices?

Relationships

________ are established between entities in a well-structured database so that the desired information can be retrieved.

Smartphones

________ are examples of Business Intelligences and Analytics 3.0 because they have millions of observations per second.

Word processing programs

________ are not used for querying and analyzing data stored in data warehouses.

Transaction processing

________ generally processes the largest quantities of data.

Variety

________ includes NoSQL accommodation of various data types.

Veracity

________ includes concern about data quality issues.

Velocity

________ includes the value of speed in a NoSQL database.

Data integrity

________ is a component of the relational data model included to specify business rules to maintain the integrity of data when they are manipulated.

DCL

________ is a set of commands used to control a database, including security.

Big data

________ is an ill-defined term applied to databases where size strains the ability of commonly used relational DBMSs to manage the data.

foreign key.

An attribute in a relation of a database that serves as the primary key of another relation in the same database is called a:

how they write code.

Companies learn a lot about customers because they leave clues in all of the following ways EXCEPT:

the maximum price customers are willing to pay.

First-degree or complete price discrimination relates to:

not based on the relational model.

NoSQL includes data storage and retrieval:

True

The applications of analytics are not just limited to business but extend to a wide-array of human activity. True/False

all rows of the Customer_T Table regardless of matches with the Order_T Table.

The following code would include: SELECT Customer_T.CustomerID,CustomerName, OrderID FROM Customer_T LEFT OUTER JOIN Order_T ON Customer_T.CustomerID = Order_T.CustomerID;

all rows of the Order_T Table regardless of matches with the Customer_T Table.

The following code would include: SELECT Customer_T.CustomerID,CustomerName, OrderID FROM Customer_T RIGHT OUTER JOIN Order_T ON Customer_T.CustomerID = Order_T.CustomerID

exploratory.

The goal of data mining related to analyzing data for unexpected relationships is:

entity-relationship model.

The logical representation of an organization's data is called a(n):

targeting market communications to specific customers.

One of the best known uses of data analytics in business is:

one.

The number of entity types that participate in a Unary relationship is:

derived attribute.

An attribute of an entity that is computed is a(n):

sharding

NoSQL systems enable automated ________ to allow distribution of the data among multiple nodes to allow servers to operate independently on the data located on it.

aggregating

The process of transforming data from a detailed to a summary level is called:

Database Administration.

The role responsible for administering database security is:

Personal privacy

________ is arguably the most common concern by individuals regarding big data analytics.

thin client.

A PC configured to handle user interfaces with limited local storage is called a(n):

business rule

A ________ defines or constrains some aspect of the business.

cardinality constraint

A ________ specifies the number of instances of one entity that can be associated with each instance of another entity.

True

A business rule is a statement that defines or constrains some aspect of the business. True/False

several actions against the database.

A business transaction requires:

NoSQL.

A class of database technology used to store textual and other unstructured data is called:

fat client.

A client PC that is responsible for processing presentation logic, extensive application and business rules logic, as well as many DBMS functions is called a:

database server.

A computer that provides database storage and access in a client/server environment is called a(n):

data warehouse that is limited in scope.

A data mart is a(n):

various operational data sources.

A data warehouse derives its data from:

event.

A database action that results from a transaction is called a(n):

is filled exclusively from the enterprise data warehouse with reconciled data.

A dependent data mart:

poor; mainframes

Legacy systems often contain data of ________ quality and are generally hosted on ________.

the recognition of the differences between transactional systems and informational systems.

The key discovery that triggered the development of data warehouses was:

ORDER BY

The last part of an SQL query to be read is the ________ statement.

grain.

The level of detail in a fact table determined by the intersection of all the components of the primary key, including all foreign keys and any other primary key elements, is called the:

predictive analytics.

Application of statistical and computational methods to predict data events is:

managing employees.

Business policies and rules govern all of the following EXCEPT:

repository

A centralized knowledge base of all data definitions, data relationships, screen and report formats, and other system components is called a(n):

analysis and interpretation of data.

At a basic level, analytics refers to:

does not require a strictly defined data model.

Big data:

wearable.

Devices which collect personal health data are:

Sam Hinz

Which of the following violates the atomic property of relations?

DML

________ is a set of commands used to update and query a database.

Middleware

________ is/are any of several classes of software that allow an application to interoperate with other software without requiring the user to understand all software involved.

Vertical

________ partitioning distributes the columns of a table into several separate physical records.

Systems analysts

________ specify computer systems requirements.

MOLAP

________ tools commonly load data into intermediate hypercube structures.


Set pelajaran terkait

GOVT-2306 Chapter 7,8,9 Exam 3 Review

View Set

A&P Chapter 9.6, Bone Resorption by Osteoclasts

View Set

Chapter 13 Learn Smart Microeconomics torture

View Set

Health Psychology Exam 1: Chapter 1+2

View Set

Social Justice: Unit 3 Test Capital Punishment

View Set

Digital Marketing Analytics 4650S - Osbon - Google Analytics Test

View Set

Questions: Around the World in 80 Days Chapters 17-24

View Set