DW Quiz 6, DW Quiz 5, DW Quiz 3, DW Quiz 4, DW-Quiz1, DW Quiz2

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

A goal of data mining includes which of the following? a. To explain some obsserved event or condition b. To confirm that data exists c. To analyze data for expected relationships d. To create a new data warehouse

c

In an operational system, data is usually kept at the lowest level of detail. a. True b. False

a

Let business requirements drive your data warehouse, not technology a. True b. False

a

OLTP is stand for Online Transaction Processing. a. True b. False

a

ROLAP refers to ............OLAP. a. Relational b. Rapid c. Running d. Roll

a

Which dimension does a data warehouse usually have? a. Time b. Category c. User d. Invoice

a

............systems focus on individual events. a. Data warehouse b. OLTP c. Business system d. OLAP

b

............. indicator of high-quality data ensures the form and content of a data field is the same across multiple source systems. a. Accuracy b. Domain Integrity c. Data Type d. Consistency

d

..... table contains the values we want to analyze. a. Dimension b. Fact c. Aggregate d. Center

a

Which statement is TRUE? a. In SQL Server, primary key is clustered index as default. This makes it a very fast index when accessing data, though it may slow down writes if your primary key is not a sequential number. b. Clustered index uses a huge amount of storage to store indexing data. c. Clustered index can increase both read and write operations. d. None of the others.

a

Which statement is correct? a. We have to choose OLAP Cube when performance is a key factor, the key decision makers of the company can ask for statistics from the data anytime from your huge database. b. We have to choose OLAP Cube when storage is a key factor, the key decision makers of the company can ask for statistics from the data anytime from your small database. c. We have to choose OLAP Cube when we do not want to implement ETL. d. None of the others.

a

............. indicator of high-quality data ensures the data value of an attribute falls in the range of allowable, defined values. a. Accuracy b. Domain Integrity c. Data Type d. Consistency

b

DATA INTEGRITY characteristic of strategic information means: a.Every business factor must have one and only one value. b.Easily accessible with intuitive access paths, and responsive for analysis. c.Information must be accurate and must conform to business rules. d.Must have a single, enterprise-wide view.

c

............ may answer questions on overall process. a. OLTP b. OLAP c. Business Intelligence d. Data warehouse

d

........ focus is on how managers view the business. a. Data warehouse b. OLTP c. Business system d. OLAP

a

......... reveals business trends. a. Data warehouse b. OLTP c. Business system d. OLAP

a

........... acts as the glue that connects all parts of the data warehouse. a. Metadata b. ETL c. Operational Metadata d. Extraction and Transformation Metadata

a

............ systems capture details of events or transactions. a. OLTP b. Data warehouse c. OLAP d. Business Intelligence

a

............... signifies that the data warehouse stores the information around a particular subject such as product, customer, sales etc. a. Subject oriented b. Integrated c. Time-Variant d. Nonvolatile

a

................. means populating all the data warehouse tables for the very first time. a. Initial load b. Incremental load c. Load all d. Load

a

...........means applying ongoing changes as necessary in a periodic manner. a. Incremental load b. Updating load c. Full refresh d. Immediate load

a

A Web-enabled data warehouse adapts the Web for information delivery and collaboration among the users. Is this true? a. YES b. NO

a

Accurate requirements definition in a data warehouse project is many times more important than in other types of projects. a. True b. False

a

BIDS (Business Intelligence Development Studio) provides environment for developing your OLAP Cube and Deploy on SQL Server. a. True b. False

a

Benefits from a data warehouse accrue only after the users put it to full use. a. True b. False

a

Benefits from a data warehouse accrue only after the users put it to full use. a.True b.False

a

Bitmapped indexes are ideally suitable for .................data. a. low-cardinality b. high-cardinality c. numerical d. textual

a

Building a data warehouse is very different from building an operational system. a. True b. False

a

CREDIBLE characteristic of strategic information means: a. Every business factor must have one and only one value. b. Easily accessible with intuitive access paths, and responsive for analysis. c. Information must be accurate and must conform to business rules. d. Must have a single, enterprise-wide view.

a

Data granularity in a data warehouse refers to the level of detail. a. True b. False

a

Data on salary may be represented as monthly salary, weekly salary, and bimonthly salary in different source payroll systems. What do you call the above problem in ETL? a. Inconsistency b. Consistency c. Diverse d. Multiple-form

a

Degenerated dimensions ............ a. contain many attributes b. contain no attribute c. contain only one attribute d. contain a primary key and a fact value

a

Dimension tables contain the ...... of analysis. a. criterions b. measurements c. facts d. values

a

Entity-Relationship Modeling is suitable for............ a. OLTP systems b. Data warehouse systems c. OLTP and Data warehouse systems d. OLAP

a

Fact tables are which of the following? a. Completely denoralized b. Partially denoralized c. Completely normalized d. Partially normalized

a

Fact tables generally occupy the most of the storage capacity. a. True b. False

a

In Bottom-Up Approach of designing a data warehouse, data marts are created first to provide analytical and reporting capabilities for specific business subjects based on the dimensional data model. a. True b. False

a

In any analytical system, time is a critical dimension. a. True b. False

a

In data warehouse, Data Extraction function has to deal with numerous data sources. You have to employ the appropriate technique for each data source. a. True b. False

a

In designing a data warehouse, Top-Down Approach will create a data warehouse as a centralized repository for the entire enterprise. a. True b. False

a

Is that true, if one says that OLAP report on the past while Data Mining predicts the future? a. YES b. NO

a

MOLAP refers to ............OLAP. a. Multidimensional b. Multiple c. Many-to-Many d. Mix

a

Metadata in a data warehouse is similar to the .............or the data catalog in a database management system. a. data dictionary b. data structure c. data tables d. data constraints

a

OLAP systems provide ............. and roll-up capabilities. a. drill-down b. drop-down c. top-down d. up-down

a

One says that: ER modeling will have logical and physical model but Dimensional modeling will have only Physical model. ER Modeling is used for normalizing the OLTP database design whereas Dimensional Modeling is used for de-normalizing the ROLAP and MOLAP design. IS THIS TRUE? a. YES b. NO

a

Operational systems are online transaction processing (OLTP) systems. a. TRUE b. False

a

Operational systems are the systems that are used to run the day-to-day core business of the company. a. TRUE b. FALSE

a

Regarding OLAP, which statement is true? a. An OLAP cube is a technology that stores data in an optimized way to provide a quick response to various types of complex queries by using dimensions and measures. b. Most cubes store dimensional values with its special storage structure to provide quick response to queries. c. An OLAP cube is a technology that stores data in data warehouse. d. None of the others

a

Regarding to Star schema, which statement is NOT true? a. In star schema some dimension tables are normalized. b. In star schema each dimension is represented with only one dimension table. c. There is a fact table at the centre. This fact table contains the keys to each of dimension. d. None of the others.

a

SSAS (SQL Server Analysis Services) is Microsoft BI Tool for creating Online Analytical Processing and data mining functionality. a. True b. False

a

The data warehouse is an informational environment that: a. Makes the enterprise's current and historical information easily available for strategic decision making. b. Makes the enterprise's current information easily available for strategic decision making. c. Makes the enterprise's historical information easily available for strategic decision making. d. None of the others

a

The data warehouse is an informational environment that: a. Provides an integrated and total view of the enterprise. b. Store all the business data c. Data is often updated and inserted d. All company's staff access it everyday.

a

The detailed plan results in the definition of the transformation/conversion rules. a. True b. False

a

The initial load (in Data Loading) moves large volumes of data using up substantial amounts of time. a. True b. False

a

The online transaction processing applications support the day-to-day operations of an enterprise. a. True b. False

a

Using parallel processing technologies (e.g. multithreading), the performance of ETL can be improved. Is it true? a. Yes b. No

a

What are the three major areas in the data warehouse? a. Data Acquisition, Data storage, Information delivery b. Data Collection, Data storage, Information delivery c. Data Acquisition, Database, Reporting System d. Data Acquisition, Data storage, Business Intelligence

a

What is Business intelligence? a. The systems and technologies for gathering, cleansing, consolidating, and storing corporate data. The tools, techniques, and applications for analyzing the stored data. b. The tools for decision makers to get the strategic information. c. The tools for data warehouse developer to manipulate and monitor the data warehouse database. d. The most common Operation System for a data warehouse.

a

What should we do before executing ETL? a. Remove indexes in data warehouse database b. Create some non-clustered indexes in data warehouse database c. Do nothing d. None of the others

a

Which is the act of picking a rectangular subset of a cube by choosing a single value for one of its dimensions, creating a new cube with one fewer dimension? a. Slice c. Dice b. Roll-up c. Drill Down/Up

a

Which is the challenge of ETL? a. Source systems are very diverse and disparate. b. Source systems are usually in one platform. c. Specify all the source systems. d. None of the others

a

Which is the challenge of ETL? a. Source system structures keep changing over time because of new business conditions. b. Source system structures are very different. c. Source systems are hosted in very low performance systems. d. None of the others

a

Which is the question that may NOT answered by Data Mining? a. Who are our top 100 best customers for the last three years? b. Which 100 customers offer the best profit potential? c. For the next two years, which stores are likely to have best performance? d. What is the expected return for next year's promotions?

a

Which is the traditional data warehouse project life cycle? a. 1)Project plan 2)Requirements definition 3)Design 4)Construction 5)Deployment 6)Growth and maintenance b. 1)Requirements definition 2)Project plan 3)Design 4)Construction 5)Deployment 6)Growth and maintenance c. 1)Project plan 2)Requirements definition 3)Design 4)Construction 5)Deployment d. None of the others

a

Which operation allows an analyst to rotate the cube in space to see its various faces? a. Pivot b. Dice c. Roll-up d. Slice

a

Which operation allows the user to navigate among levels of data ranging from the most summarized to the most detailed? a. Drill Down b. Dice c. Roll-up d. Slice

a

Which statement is correct? a. With OLAP cube, we are able to create data mining structure on this data which can be helpful in forecasting, prediction. b. OLAP cube help us reduce the cost (time, CPU usage) of updating the data warehouse database. c. OLAP is an optimized way to design the data warehouse system. d. None of the others.

a

Which statement is the most correct definition of data warehouse system? a. Data warehouse is Subject Oriented, Integrated, Time-Variant and Nonvolatile collection of data that support management's decision making process. b. Data warehouse is Subject Oriented, Integrated, Time-Variant and Nonvolatile collection of data that support daily management process. c. Data warehouse is Subject Oriented, Integrated, Time-Variant and Nonvolatile collection of data that support auditing process. d. None of the others.

a

Who are the users that need information from the data warehouse? a. Decision makers b. Customers e. Staff f. Database Administrators

a

............. indicator of high-quality data ensures the value stored in the system for a data element is the right value for that occurrence of the data element. a. Domain Integrity b. Accuracy c. Data Type d. Consistency

b

...............represents the level of detail in the fact table. a. Aggregation b. Data granularity c. Data level d. Data size

b

A fact is a measure taken at the .............of all the dimensions. a. union b. intersection c. combination d. cross

b

ACCESSIBLE characteristic of strategic information means: a.Every business factor must have one and only one value. b.Easily accessible with intuitive access paths, and responsive for analysis. c.Information must be accurate and must conform to business rules. d.Must have a single, enterprise-wide view.

b

Data Cleansing is a. Large collection of data mostly stored in a computer system b. The removal of noise errors and incorrect input from a database c. The systematic description of the syntactic structure of a specific database. It describes the structure of the attributes the tables and foreign key relationships. d. None of these.

b

Data Mining means we want to mining the data? a. YES b. NO

b

Data warehouse architecture is just an overall guideline. It is not a blueprint for the data warehouse. a. True b. False

b

Dimensional Modeling is suitable for............ a. OLTP systems b. Data warehouse systems c. OLTP and Data warehouse systems d. OLAP

b

In SQL (Structured Query Language), Which statement(s) is (are) TRUE regarding to the term cardinality? 1) Low-cardinality refers to columns with few unique values. 2) The lower the cardinality, the less duplicated elements in a column. 3) SQL databases use cardinality to help determine the optimal query plan for a given query. a. 1 and 2 b. 1 and 3 c. 2 and 3 d. There isn't statement.

b

In a table, there is only one ............. index. a. bitmap b. clustered c. non-clustered d. tree

b

In the data warehouse architecture, ................ sits on top of all the other components. It coordinates the services and activities within the data warehouse. a. Metadata Component b. Management and Control Component c. Data source Component d. Data Storage Component

b

In the dimensional model, the attributes of the data warehouse are divided into ..........and dimension tables a. center b. fact c. summary d. aggregate

b

Justification the benefits of a data warehouse through stiff ROI (Return on investment) calculations is always easy. a. True b. False

b

The traditional methods of collecting requirements that work well for operational systems can be directly applied to data warehouses. a. True b. False

b

When a user queries the data warehouse for analysis, he or she usually starts by looking at very details of data. a. True b. False

b

Which operation produces a subcube by allowing the analyst to pick specific values of multiple dimensions? a. Slice b. Dice c. Roll-up d. Drill Down/Up

b

Which statement(s) is (are) correct? 1) Maintaining metadata in a modern data warehouse is just for documentation. 2) Metadata is needed by IT for data warehouse administration. a. Only statement 1 b. Only statement 2 c. Both statement 1 and 2 d. None of the others

b

Who needs strategic information in an enterprise? (choose the most correct) a. Staff b. Decision Markers c. Managers d. Accountants e. Directors

b

............. indicator of high-quality data ensures value for a data attribute is actually stored as the data type defined for that attribute. a. Accuracy b. Domain Integrity c. Data Type d. Consistency

c

A table may contain some ............. indexes. Select one answer only. a. bitmap b. clustered c. non-clustered, bitmap d. non-clustered

c

In which case, indexes can improve the performance? a. Insert data b. Update data c. Read data d. None of the others

c

Indexing in data warehouse database reduces performance of ........ a. Information delivery b. ETL c. Loading data into data warehouse d. OLAP

c

Select the correct comment on the two statements bellow. 1) When you perform the logical design of the database, your goal is to produce a conceptual model that reflects the information content of the real-world situation. 2) In physical design, you are getting closer to the operating systems, the database software, the hardware, and the computing platform. a. Only statement 1 is correct b. Only statement 2 is correct c. Both statements are correct d. Both statements are incorrect

c

Select the most correct definition of the data warehouse concept. a. Take all the data you already have in the organization, clean and transform it, and then provide useful strategic information. b. Take all the data you already have in the organization, store it, and then provide useful strategic information. c. Take all the data you already have in the organization and also in the related business domains, clean and transform it, and then provide useful strategic information. d. None of the others

c

What are the benefits of partitioning? a. Easy management and Better performance b. Availability and Easier backup and recovery c. Both of A and B d. None of A and B

c

Which is NOT a data warehouse building block? a. source data b. data staging c. data schema d. information delivery e. metadata f. management and control.

c

Which operation involves summarizing the data along a dimension? a. Dice b. Slice c. Roll-up d. Drill Down

c

............................. are designed to overcome any limitations placed on the warehouse by the nature of the relational data model. a. Operational database b. Relational database c. Multidimensional database d. Data repository

d

INTEGRATED characteristic of strategic information means: a. Every business factor must have one and only one value. b. Easily accessible with intuitive access paths, and responsive for analysis. c. Information must be accurate and must conform to business rules. d. Must have a single, enterprise-wide view.

d

In ETL, .............tables are loaded first. a. Small b. Fact c. Dimension and Fact d. Dimension

d

In a fact table, there are 3 kinds of attributes: (choose one answer only) a. Dimension foreign keys b. Degenerated dimensions c. Facts d. All the others

d

In data quality, which is (are) the Sources of Data Pollution? (choose one answer only) a. Input Errors b. Incomplete Information at Data Entry c. Internationalization/Localization d. All the others

d

In data quality, which is (are) the Sources of Data Pollution? (choose one answer only) a. System Conversions b. Poor Database Design c. Heterogeneous System Integration d. All the others

d

Regarding to Online Analytical Processing (OLAP), which statement is INCORRECT? a. Designed to hold historical data for analyses and forecast business needs b. Data stored in denormalized format c. Used by users who are associated with the decision making process, e.g., Managers, CEO. d. Holds daily Latest Transactional Data related to your application

d

Regarding to data warehouse. Which statement is true? a. All indexes are always specified at the design time. b. All indexes are NOT always specified at the design time. c. All indexes are always specified at the maintenance time. d. Some indexes are specified at the design time, and some others are added at maintenance time.

d

The biggest drawback of the level indicator in the classic star schema is that is limits ............ a. ability b. quantify c. qualify d. flexibility

d

What are some aspects of Data Mining? a. Association Rules b. Outlier Analysis c. Predictive Analytics d. All of them

d

What exactly do we mean by strategic information? a. Strategic information is important for the continued health and survival of the corporation. b. Strategic information is for the day-to-day operations of the business. c. It is intended to produce an invoice, make a shipment, settle a claim, or post a withdrawal from a bank account. d. The types of information needed to make decisions in the formulation and execution of business strategies and objectives are broad based and encompass the entire organization.

d

What is NOT a Benefit of Improved Data Quality? a. Improved Productivity b. Reduced Costs and Risks c. Reliable Strategic Decision Making d. None of these.

d

What kind of datamining techniques help to find the natural group of objects? a. Classification b. Regression c. Association Rules d. Clustering

d

Which are the optimization strategies in data warehouse system? (choose ONE answer only) a. Use Index b. Aggregate Table c. Query Optimization d. All the other choices

d

Which is (are) related to the time-variant nature of the data in a data warehouse? (choose one answer only) a. Allows for analysis of the past b. Relates information to the present c. Enables forecasts for the future d. All the others

d

Which is NOT popular data warehouse architecture? a. Real b. Virtual c. Remote d. Distributed

d

Which statement is correct regarding to dimensional model? a. A fact value CANNOT be calculated from other attributes. b. Usually, the primary key of the fact table is an auto generated key. c. Dimension tables occupy of the storage capacity. d. Every fact table has at least one Foreign Key.

d

Which statement(s) is (are) related to Dimensional Modeling? a. Captures critical measures b. Views along dimensions c. Intuitive to business users d. All the others

d

Which statement(s) is (are) related to Entity-Relationship Modeling? (choose one answer only) a. Removes data redundancy b. Ensures data consistency c. Expresses microscopic relationships d. All the others

d

Which value(s) is(are) typically Low-cardinality column values?(choose ONE answer only) a. status flags b. Boolean c. major classifications such as gender d. status flags, Boolean values, or major classifications such as gender

d

Source data in data warehouse may come from: (choose one answer only) a. Production Data b. Internal Data c. Archived Data d. External Data e. All the others

e

Which is NOT a characteristic of a data warehouse? a. Separate b. Integrated c. Time stamped d. Subject oriented e. Volatile

e

Which is NOT a key issue while planning for your data warehouse? a. setting proper expectations b. assessing risks c. deciding between top-down or bottom-up approaches d. Choosing from vendor solutions. e. requirement changes

e

The operational system is an information system that: a. Provides an integrated and total view of the enterprise. b. Makes the enterprise's current and historical information easily available for strategic decision making. c. Makes decision-support transactions possible without hindering operational systems. d. Renders the organization's information consistent. e. Presents a flexible and interactive source of strategic information. f. Support the basic business processes of the company.

f

Which are NOT the desired characteristics of strategic information? a. INTEGRATED b. DATA INTEGRITY c. ACCESSIBLE e. CREDIBLE f. TIMELY g. None of the others

g


Ensembles d'études connexes

OB-GYN Penny Book Review Questions

View Set

Small Business Management: Chapters 14-18

View Set

Science Final Chemistry Questions

View Set

PSYCH 260 Physio Psychology chapter 5

View Set

Patent Cooperation Treaty (MPEP 1800)

View Set