Exam 2 Database Design

Pataasin ang iyong marka sa homework at exams ngayon gamit ang Quizwiz!

Two common data access privilege DCL commands include:

grant and revoke

What does a query do?

answer a question(s)

The performance of operational day-to-day tasks involving data use can be severely diminished if such tasks have to compete for computing resources with analytical queries.

True

A data warehouse is created within an organization as a separate data store whose primary purpose is data analysis.

True

Which of the following would typically be stored in target (data warehousing) systems?

All of these are typically present in target systems.

The data dictionary created by the ________ is often referred to as the catalog.

DBMS

Which of the following differences, renders normalization of data warehouses unnecessary?

Data Update Difference and Data Redundancy Difference

Which of the following is NOT true:

Data marts typically have more subjects of analysis than data warehouses

What does ETL stand for?

Extract - Transform - Load

Retrieval of data from operational data sources takes place during _____________.

Extraction

________ contain measures related to the subject of analysis.

Fact tables

Queries can only come from one table?

False

Implicit changes of data warehouse requirements are permitted during:

Implicit changes of requirements are not permitted

Which of the following is true when comparing a data warehouse to a data lake?

In ETL, a data lake has no transform part

What will be the entire outcome of the following SQL statement issued in a doctorâ s database? GRANT SELECT, INSERT, ALTER, UPDATE ON specialty TO katie;

Katie can read data from SPECIALTY, change data in SPECIALTY, change the metadata of SPECIALTY, insert data in SPECIALTY

Which data types can have up to 65,535 characters?

Long Text and Hyperlink

In the case of so-called active data warehouses, the loads occur in ________ that happen continuously, ensuring that the data in the data warehouse is updated close to real time (enabling analysis of the latest data)

Micro batches

Querying and presenting data from data warehouses and/or data marts for analytical purposes is known as:

OLAP

Updating, querying, and presenting data from databases for operational purposes is known as:

OLTP

Which of the following is a Data Control Language (DCL) command

REVOKE

OLAP/BI tools are:

Read-only tools

Which of the following would NOT be found in an authorization matrix?

Recovery log checkpoint

The application development DBMS component provides functionalities to the developers of:

Security and backup procedures

Internal deployment to the members of the development team for initial testing of the functionalities of a data warehouse is known as _____________.

The alpha release

Creating surrogate key values takes place during _____________.

Transformation

Data cleansing is a part of:

Transformation

Which of the following items would tend to have the most detail:

a current sales transaction

Access to data and privileges by users is implemented by maintaining a ________________________ .

authorization matrix

Which data type is 9 digits?

autonumber

Two common abbreviations with data warehouses are BI and OLAP. What does each stand for?

business intelligence and online analytical processing

Which data type is accurate to 15 digits on the left side of the decimal point and to 4 digits on the right side

currency

_____________ tables typically have the most foreign keys.

fact tables

Granularity describes _____________________ in the fact table

level of detail

A data dictionary is a repository of the ____________________ .

metadata

Data lake repositories of big data are not created through the process of formal database __________________.

modeling

You must _________ each field, table, and other object in an Access database

name

What is the primary reason for an Access form?

to make data input easier

Which of the following data examples has the most structure?

A row in a relational table

Which of the following is an example of a subject-oriented data set.

None of these

CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes. It also receives one daily 200-words email from its contributors. Consider the following CONSTANT DOCUMENTARIAN data sets: - Set A: collection of daily one-hour videos from the 50 contributors - Set B: collection of daily 200-words emails from its contributors - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters - Set D: relational table containing first name, last name, phone number, and email address of each contributor. Which data set is exhibiting the highest volume?

Set A

Operational queries typically process:

Smaller amounts of data than analytical queries

The dimensional schema is often referred to as the:

Star schema

Which of the following is a part of the data warehouse requirements step?

Conceptual modeling

Due to the amount of details that have to be considered, creating _________________ is often the most time- and resource-consuming part of the data warehouse development process

ETL infrastructure

T/F Fields can be renamed in Datasheet view or Design view

True

T/F: Big data is mostly unstructured

True

The data definition DBMS component uses the SQL commands from the DDL category.

True

Deleting an extracted row from the set of extracted rows (because an identical row was retrieved from another data sourc before the data is loaded into the data warehouse, is an example of ________________.

Active transformation

OLAP/BI tools can be used for:

Ad-hoc direct analysis and creation of front-end (BI) applications

The data manipulation DBMS component is used:

By end users, either directly or via front-end applications

__________________ data marts are sourced exactly from enterprise data-warehouses.

Dependent

Production release takes place during _____________.

Deployment

The set of processes that is used to populate the data warehouse tables with the appropriate relevant data retrieved from the operational databases is known as:

ETL

Big data sets, when compared with most databases and data warehouses:

Have higher possibility of different interpretations

A typical organization maintains and utilizes a number of operational data sources.

True

Which data type is 1 character only?

Yes/No

Operational queries are typically issued:

More often than analytical queries

Adding timestamp to a record to indicate when the row is valid/applicable is an example of ________________.

Passive transformation

CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes. It also receives one daily 200-words email from its contributors. Consider the following CONSTANT DOCUMENTARIAN data sets: - Set A: collection of daily one-hour videos from the 50 contributors - Set B: collection of daily 200-words emails from its contributors - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters - Set D: relational table containing first name, last name, phone number, and email address of each contributor. Which data set is exhibiting the highest velocity?

Set C

Big data does not easily fit into tables with rows and columns

True

Periodic backup copies of the database are made through use of recovery logs and _______________ .

check points

To relate two tables together, you must always have a _________________ field.

common

Which data type is 8 bytes?

date

________________ data marts are sourced from data captured from one or more operational systems or external data providers, or data generally locally within a different department or geographic area

Independent

Big data is not formally modeled for querying and retrieval and are not accompanied with detailed metadata

True

T/F Access does not allow duplicate values in the primary key field?

True

The data definition component is a part of every DBMS package.

True

The data manipulation component is a part of every DBMS package.

True

When you format a field, you change the way data is displayed, but not the _____________________.

actual values stored in the table

One way integrity of data in a database can be compromised is ______________ .

all of these ways

Metadata is __________________________.

data about the data

The _____________ determines what field values you can enter for the field and what other properties the field will have

data type

Choose the list that CORRECTLY has sizes of data sets from smallest to largest.

database, data warehouse, data lake

________ contain analytically useful information.

Both dimension and fact tables

The purpose of the source system is:

Both the original operational purpose and as a source system for the data warehouse

Which of the following is a Transaction Control Language (TCL) command

COMMIT

If, during the use of the database system, it becomes apparent that a new relation needs to be added to the database, the decision to add the new relation to the database will rest with the:

Database developer

Due to multiple subjects of analysis sharing dimensions, the following occurs:

A dimensional model contains more than one fact table

Data warehouse requirements can be refined and/or expanded following:

All of these

Which of the following is NOT a data administration task?

Creating a conceptual database model

Which of the following processes is typically the most time- and resource-consuming?

Creating the ETL infrastructure

Which of the following allows use of the same dimensions with multiple fact tables?

Galaxy of stars

What does BI stand for?

Business Intelligence

To create a data warehouse based on the star schema with 5 dimensions and 1 fact table, we will need ____ CREATE TABLE statements.

6

Data types are assigned automatically by ______________ or specifically by the _______________ .

Access Table Designer

Which of the following would typically be stored in source (operational databassystems?

Detailed Data

Typically, in a star schema all ________ are given a simple, non-composite system-generated primary key, also called a surrogate key.

Dimension tables

________ contain descriptions of the business, organization, or enterprise to which the subject of analysis belongs.

Dimension tables

A dimensional model cannot be based on a single source.

False

A large fact table in a data warehouse is an example of big data.

False

All sources for a dimensional model must be operational data sources from the company that is building the dimensional model.

False

SQL provides all the necessary functionalities for managing and analyzing big data.

False

T/F You cannot import data from Excel into Access due to software restrictions and formatting of the fields?

False

The application development component is a part of every DBMS package.

False

The operational data sources include the data warehouses and data marts.

False

If applicable, the archived data from operational sources is loaded in the data warehouse during the:

First load

In the process of query optimization, query cost refers to:

How long it takes to execute a query.

Where on the Spectrum of Solutions for Large Analytical Data Repositories, would the following example best fit? Data from 10 data sources is extracted. Two of those sources have the exact same structure, so the data from those two sources is pasted together before loading. There is a small overlap of data in those two sources, so the duplicates are eliminated before loading. The rest of the sources are loaded as they were.

Inside the spectrum, closer to the left edge of the spectrum (Pure Data Lake).

In a typical properly-designed star schema, the number of records/rows in any of the dimension tables is:

No typical ruleâ there is a similar number of dimension tables with larger, smaller and same number of records as in the fact table

Data time-horizon is:

Typically, shorter in operational systems than in analytical systems

Which of the following statements is true.

detailed fact tables contain the most information

Which of the following means to get more detailed data (higher granularity)?

drill down

When purchase items online, you are actually entering data for the vendor into a database. When ordering online, what does the graphic below represent in relation to a database?

form

Comparing a Data Warehouse vs a Data Lake, a common analogy is data warehouses are like agriculture while data lakes are like _____________________ .

hunting/gathering

Access responds faster to your requests for specific records based on the ______________.

primary key

Within a Data Lake, the large data pool in which the schema and data requirements are not defined until the data is ______________ .

queried

Data lakes store vast amount of __________________ data in its native format until it is needed

raw

Which field type has these characteristics: 0 to 255 characters; default is 255

short text

____________________ adds, replaces, or eliminates specified dimension attributes (or particular values of the dimension attributes) from the already displayed result

slice and dice

Which of the following is likely to be a user of a bank's data warehouse

Bank's CEO

CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes. It also receives one daily 200-words email from its contributors. Consider the following CONSTANT DOCUMENTARIAN data sets: - Set A: collection of daily one-hour videos from the 50 contributors - Set B: collection of daily 200-words emails from its contributors - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters - Set D: relational table containing first name, last name, phone number, and email address of each contributor. Which data set has the most structure?

Set D

Label the graphic below correctly between course and fine granularity: A) Students = 5 Classes = 10 B) Students = A. Johnson, K. Thessalonious, A. Palmer, K Walker, Z. Bundrick Classes = CIT 105, ENG 101, MAT 150, SOC 101, PHY 110, MAT 171, CIT 149, CIT 180, CIT 293, HIS 109

A = coarse, B = fine

Operational queries are typically issued:

By more users than analytical queries

The ________ slowly changing dimension approach creates a new additional dimension record using a new value for the surrogate key every time a value in a dimension record changes.

Type 2

The ________ slowly changing dimension approach involves creating a previous and current column in the dimension table for each column where changes are anticipated.

Type 3

Examples of sources generating large amounts of data

ALL OF THESE ANSWERS

CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes. It also receives one daily 200-words email from its contributors. Consider the following CONSTANT DOCUMENTARIAN data sets: - Set A: collection of daily one-hour videos from the 50 contributors - Set B: collection of daily 200-words emails from its contributors - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters - Set D: relational table containing first name, last name, phone number, and email address of each contributor. Which data set is exhibiting the lowest velocity?

Set D

CONSTANT DOCUMENTARIAN is an organization that receives each day a one-hour log video from 50 contributors around the world, recording mundane daily-life scenes. It also receives one daily 200-words email from its contributors. Consider the following CONSTANT DOCUMENTARIAN data sets: - Set A: collection of daily one-hour videos from the 50 contributors - Set B: collection of daily 200-words emails from its contributors - Set C: video footage of its 24/7 CCTV camera constantly recording scenes outside its headquarters - Set D: relational table containing first name, last name, phone number, and email address of each contributor. Which data set is exhibiting the lowest volume?

Set D

Which of the following column names would not appear in a fact table?

Product Name

Which way(s) can you import data into an Access database

ALL of these ways


Kaugnay na mga set ng pag-aaral

Chapter 24 Nursing Assessment of the Newborn

View Set

Manufacturing, ROA, Purchasing, Vertical Integration, Outsourcing, Insourcing, Supplier Certification, RFI, RFP, RFQ, Purchase Order, Total Cost of Manufacturing, Spend Analysis, Six Sigma, Strategic Sourcing, Preferred Suppliers, Sourcing Sustainability,

View Set

8.4 Volumes of Rectangular Prisms

View Set

Psych Ch. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 Next Step

View Set