IS 3003 Chapter 6

¡Supera tus tareas y exámenes ahora con Quizwiz!

A field identified in a table as holding the unique identifier of the table's records is called the: A) primary key. B) key field. C) primary field. (meaningless phrase) D) foreign key. (is the primary key in a different table)

Answer: A

In terms of the data relationships found by data mining, sequences refers to: A) events linked over time. B) patterns that describe a group to which an item belongs. C) occurrences linked to a single event. D) undiscovered groupings.

Answer: A

What is the first step you should take in managing data for a firm? A) Identify the data needed to run the business B) Cleanse the data before importing it to any database C) Normalize the data before importing to a database D) Audit your data quality

Answer: A

Within a corporate information services department, the task of creating the physical database and its logical relations are responsibilities of the ________ function. A) database administration B) data administration C) server administration D) data modeling

Answer: A

A data ________ stores current and historical data of potential interest to decision makers throughout the company. A) warehouse B) mart C) archive D) mine

Answer: A Diff: 2 Page Ref: 199

OLAP is a tool for enabling: A) users to obtain online answers to ad hoc questions rapidly. B) users to view both logical and physical views of data. C) programmers to quickly diagram data relationships. D) programmers to normalize data.

Answer: A Diff: 2 Page Ref: 201

In clustering, a data mining tool will: A) find new groupings within data. B) find related predictions from existing values. C) find several events grouped by time. D) find new associations.

Answer: A Diff: 2 Page Ref: 202

Which of the following would you use to find patterns in user interaction data recorded by Web servers? A) Web usage mining B) Web server mining C) Web structure mining D) Web content mining

Answer: A Diff: 2 Page Ref: 205

A(n) ________ is a structured survey of the accuracy and level of completeness of the data in an information system. A) data quality audit B) systems analysis C) systems audit D) data analysis

Answer: A Diff: 3 Page Ref: 208

A data mart usually can be constructed more rapidly and at lower cost than a data warehouse because: A) it typically focuses on a single subject area or line of business. B) all the information is historical. C) it uses a Web interface. D) all of the information belongs to a single company.

Answer: A I don't like the wording on this question. A data mart is a subset of a data warehouse. Usually, a data warehouse is created, and from that a subset focused on a single subject area is taken to create that subset. It would be unusual to create a datamart without creating the data warehouse first. The reason I am leaving this question is that I would like you to know that a data mart focuses on a single subject area or line of business.

The process of streamlining data to minimize redundancy and awkward many-to-many relationships is called: A) normalization. B) data scrubbing. C) data cleansing. D) data administration.

Answer: A I need you to know that normalization is not easy. Users will not do it; database designers do it. If it is not done correctly, then the operation of the database will be clunky. It is an important step.

A DBMS makes the: A) physical database available for different logical views. B) logical database available for different analytical views. C) physical database available for different relational views. D) relational database available for different physical views

Answer: A Read the book and understand difference between physical and logical views of a database.

In a table for customers, the information about a single customer would reside in a single: A) field. B) row. C) column. D) table.

Answer: B Diff: 2 Page Ref: 188 AACSB: Application of knowledge CASE: Comprehension Learning Objective: 1 - How does a relational database organize data?

MySQL is a: A) DBMS for small handheld computing devices. B) popular open-source DBMS. C) mainframe relational DBMS. D) DBMS for desktop systems.

Answer: B Diff: 2 Page Ref: 193

Tools for consolidating, analyzing, and providing access to vast amounts of data to help users make better business decisions are known as: A) DSS. (to support making semi-structured decisions) B) business intelligence. C) OLAP. (to do multi-dimensional analysi) D) data mining.(to explore for patterns and relationships in data)

Answer: B Diff: 2 Page Ref: 198

Data mining is a tool for allowing users to: A) quickly compare transaction data gathered over many years. B) find hidden relationships in data. C) obtain online answers to ad hoc questions in a rapid amount of time. D) summarize massive amounts of data into much smaller, traditional reports.

Answer: B Diff: 2 Page Ref: 202

A field identified in a record as holding the unique identifier for that record is called the: A) primary key. B) key field. C) primary field. D) foreign key.

Answer: B NOTE: I don't like the distinction that the book has made between primary key and key field - it is subtle and can confuse people. I need you to know what a PRIMARY KEY is. Please read the book to get a clear idea.

All of the following are tools or technologies for extracting information from unstructured data sets except: A) sentiment analysis software. B) SQL queries. C) Hadoop. D) Non-relational DBMS.

Answer: B The key word here is 'unstructured'. SQL works only with structured data.

Businesses use ________ tools to search and analyze unstructured data sets, such as e-mails and memos. A) Hadoop B) Web mining C) text mining D) OLAP

Answer: C

Hadoop is a(n): A) NoSQL database technology that stores both data and procedures acting on the data as objects. B) data mining platform designed for storing and analyzing big data produced by Web communications and transactions. C) open-source software framework designed for distributing data-processing over inexpensive computers. D) pre-configured hardware-software system designed for analyzing big data.

Answer: C

The most prominent data manipulation language today is: A) Access. (this is a DBMS, not a language) B) DB2. (this is a DBMS, not a language) C) SQL. D) Crystal Reports. (generates reports, does not manipulate data)

Answer: C Diff: 1 Page Ref: 194

The type of logical database model that treats data as if they were stored in two-dimensional tables is the: A) two-tiered DBMS. (I don't think there is any such thing) B) pre-digital DBMS. (I don't think there is any such thing) C) relational DBMS. D) hierarchical DBMS. (these were used in earlier generations of databases; they are good for speedy retrieval of data if you know what you are looking for exactly; they are not as flexible as relational databases.)

Answer: C Diff: 2 Page Ref: 188

The tool that enables users to view the same data in different ways using multiple dimensions is: A) Hadoop. B) SQL. C) OLAP. D) data mining.

Answer: C Diff: 2 Page Ref: 201

Which of the following would you use to find out which Web sites with content related to database design were the most often linked to by other Web sites? A) Web usage mining B) Web server mining C) Web structure mining D) Web content mining

Answer: C Diff: 2 Page Ref: 203-205

A(n) ________ is a dedicated computer in a client/server environment that hosts a DBMS. A) web server B) application server C) database server D) client server

Answer: C Diff: 2 Page Ref: 206

Detecting and correcting data in a database or file that are incorrect, incomplete, improperly formatted, or redundant is called: A) data auditing. B) defragmentation. C) data scrubbing. D) data optimization.

Answer: C When I defined data scrubbing earlier in this document, I gave one example of why data scrubbing would be needed. Here you are seeing more reasons why data scrubbing would be needed.

Data mining is more ________ than OLAP. A) data focused B) multidimensional C) query oriented D) discovery driven

Answer: D

Which of the following statements about data warehouses is not true? A) They store supply data to be used across the enterprise for management analysis and decision making. B) Data warehouse systems provide a range of ad hoc and standardized query tools, analytical tools, and graphical reporting facilities. C) They may include data from Web site transactions. D) Data warehouse systems provide easy-to-use tools for managers to easily update data.

Answer: D Diff: 2 Page Ref: 199 Key word is 'update', which means change (edit) data or add data. Managers/users should not add or change data in a data warehouse, so no 'easy' tools are provided for that for the user.

Which of the following is not a typical feature of DBMS? A) Data manipulation language B) Report generation tools C) Data dictionary D) Query wizard tool (there is no such thing)

Answer: D Food for thought: Does a DBMS have other features?

A grouping of characters into a word, a group of words, or a complete number is called a record.

Answer: FALSE A grouping of fields is a record. (The phrases used in the question are nonsensical.)

OLAP is used to find hidden patterns and relationships in large databases, and infer rules from these to infer future behavior.

Answer: FALSE Data mining is used to find hidden patterns and relationships.

Most back-end databases are able to interpret HTML commands.

Answer: FALSE Diff: 1 Page Ref: 206 Front-end refers to those parts of the system which are closer to the user - the user interface in particular. Back-end refers to files in which the organized information is stored.

A data warehouse is typically comprised of several smaller data marts.

Answer: FALSE Diff: 2 Page Ref: 199 Data warehouse includes historic and current data from several databases. Data mart is a subset of the data warehouse, created to try and answer specific questions.

An information policy specifies the procedures and rules for defining the structure and content of corporate databases.

Answer: FALSE Diff: 2 Page Ref: 206

The use of different terms for identifying data in a firm's various information systems is an example of redundant data.

Answer: FALSE Diff: 2 Page Ref: 208-209

End users use a data manipulation language to add, edit, delete data in the database.

Answer: TRUE

Every record in a file should contain at least one key field.

Answer: TRUE Diff: 1 Page Ref: 188

Web mining is the discovery of useful patterns on the Web.

Answer: TRUE Diff: 1 Page Ref: 203 So, contrast this with text mining and data mining.

A firm's information policy lays out who is responsible for updating and maintaining the information in a database system.

Answer: TRUE Diff: 1 Page Ref: 207 What else does the information policy include? Look it up.

A foreign key in a table of the database is a field that links to the primary key of another table.

Answer: TRUE Diff: 2 Page Ref: 189

A DBMS is special software for creating, storing, organizing, and accessing data from a database.

Answer: TRUE Diff: 2 Page Ref: 193

A physical view shows data as it is organized on the storage media.

Answer: TRUE Diff: 2 Page Ref: 193

Both Oracle and Microsoft Access use SQL to retrieve information from the database.

Answer: TRUE Diff: 2 Page Ref: 194-195 How is SQL pronounced? (Sequel) What does it stand for? (structured query language)

Hadoop can process large quantities of any type of data, including video and Twitter feeds.

Answer: TRUE Diff: 2 Page Ref: 199 Hadoop can handle both relational and non-relational data, including text, video etc:

A data warehouse may include information from legacy systems.

Answer: TRUE Diff: 2 Page Ref: 199 Recall data warehouses include historical data; historical data may reside in legacy systems. (What is a legacy system?).

Data administration is a special organizational function that manages the policies and procedures through which data can be managed as an organizational resource.

Answer: TRUE Diff: 2 Page Ref: 207

Data cleansing is the same process as data scrubbing.

Answer: TRUE Diff: 2 Page Ref: 208 This is done when creating a data warehouse. Data warehouses contain historic data. Historic data over 10 or 15 years may have inconsistencies. For instance, in some databases my name may have been entered as Chino, and in others as Srinivasan. When you combine the two databases as part of creating the data warehouse, the system has no way of knowing that Chino is the same as Srinivasan. So, while creating the data warehouse, you have to go through a process of data cleansing - that is detecting such inconsistencies (and other kinds of inconsistencies) and rectifying them.

In-memory computing relies on a computer's main memory (RAM) for storing data.

Answer: TRUE Diff: 3 Page Ref: 200 Usually in-memory computing needs lots more of RAM than those systems that do not do in-memory computing. In-memory computing is used to speed up processing of data.

DBMS simplify how end users work with databases by separating the logical and physical views of the data.

Answer: TRUE Physical view of data refers to where on the hard drive the different pieces of data are placed. Logical view refers to the way in which we think about data - in the form of tables with columns for each field and a row for each record.


Conjuntos de estudio relacionados

Concept Covered: Electron Transport Chain, The Mitochondria, Overview of Respiration, Krebs Cycle, Glycolysis

View Set

Chem I Final Exam Comprehensive Review

View Set

IT Security: Defense Against the Digital Dark Arts

View Set