C724 Information Systems Management Module 4 (Unit 4)

Lakukan tugas rumah & ujian kamu dengan baik sekarang menggunakan Quizwiz!

Companies rely on data to drive information and decision making. Why would a business or enterprise implement a data warehouse? A data warehouse improves decision making. A data warehouse stores legacy paper files. A data warehouse stores employee files.

A - A key reason for implementing a data warehouse is to store and access data from multiple sources to be queried and analyzed to help manage decision making.

When considering data mining and its benefits to business, what is required for an enterprise today to be competitive and gain business intelligence from day-to-day operations? Perform analytics on enterprise data stored in data warehouses. Hire consultants to provide business intelligence. Implement the office of chief business intelligence officer.

A - A large business or enterprise must implement a data warehouse application and perform data mining or analytics to gain business intelligence.

Chris is explaining to management some ways in which an operational database can be used to improve performance and business decision making. What is a common business use of an operational database? Operational databases can be used to track real time inventory. Operational database is a type of OLAP. Operational databases store statistical data for analysis.

A - An operational database is an online transaction processing database (OLTP) and can be used to gather real-time transactional or operational data.

Horizontal and vertical partitioning are the two most common processes in partitioning databases. What is horizontal partitioning? Horizontal partitioning restricts the rows to replicate in a back up process. Horizontal partitioning transforms the table from two dimensions to three dimensions. Horizontal partitioning restricts the columns to replicate in a back up process.

A - Horizontal partitioning in relational database deals with rows, not columns.

Which of the following is an effective use of predictive analysis? An insurance company is analyzing medical records to determine malpractice rates. A local restaurant has decided to open another location across town. A fitness center had a decrease in profits last year, so they are increasing customer membership fees.

A - Insurance companies rely on predictive analysis to determine how to charge customers.

Predictive analytics of multidimensional data can lead to better decisions and lower costs. What is a multidimensional database? A relational database that is structured as a cube A flat file database An online transaction processing database

A - Multidimensional databases are typically relational databases that store data in a cube, which contains aggregated data related to all dimensions and allow businesses to visualize data assets in multidimensions to see what is happening in real time.

When designing a database, you must consider its use or function. One type of database is OLAP. What does it stand for and what is it used for? OLAP stands for online analytical processing and are databases used to assist decision making. OLAP stands for online analytical processing and are used for decision making and can be modified or deleted by users. OLAP stands for online analytical processing. The records are transactional and manipulated by users.

A - OLAP are analytical, fixed databases containing data that are used for making decisions. OLAP are used for multidimensional analysis of data.

Why does the ETL process run in parallel and not in sequential order? The ETL process to update a data warehouse runs in parallel because the extraction step is lengthy. The ETL process to update a data warehouse runs in parallel due to multicore CPU. The ETL process to update a data warehouse runs in parallel to minimize disruption.

A - The extraction process in ETL can be lengthy, so data already extracted can be processed and loaded.

Relational database models were first developed in the 1970s. It has become the most widely used database model in business. Which one of the following traits is NOT one of the benefits leading to widespread use of relational database? Tree-like structure Tabular format Use SQL language

A - Tree-like structure is a hierarchical database characteristic, not a relational database characteristic.

There are five rules that pertain to the relational database model. Which one of the following examples breaks one of the rules? The primary key of the orders table is the video title. The video table is sorted by production date. In the orders table, the video title field is before the video production date.

A - Video title is not a good primary key because it is possible to have two videos with the same name. A better choice would be a video ID number.

Which term describes the procedure in ETL (extract, transform, and load) that correlates the data with the data warehouse format? Consolidation algorithm Transfer procedure application Scrub algorithm

A consolidation algorithm correlates the data with the data warehouse format.

A data warehouse can be described or defined as a platform or capability for businesses to tap or develop business intelligence used for decision making. Describe what a data warehouse is and how it can benefit business.

A data warehouse is a collection of historical business transaction data, coupled with other, external market data or information. It serves to integrate data from multiple sources within the enterprise. The key benefit to business is that this data set can be queried and analyzed to support business decision making.

Select the best definition to describe a database. A database is a computer application that has business information. A database is a set of data that are organized and easily searchable through queries. A database is a file containing business intelligence.

A database is a set of data, in electronic format, organized in tables and easily searchable. This is a collection of data.

Assume you are explaining the types of servers involved in databases to a group of coworkers. You note that a __________is a dedicated computer that stores database files and database management systems.

A database server is a powerful computer that stores the databases and DBMS to access and administer the database.

Which of the following terms represents a set of data, typically aligned as a row in a table? Record Field Table

A record is a set of related data typically aligned in a row.

Which of the following attributes or fields would be a good choice for a primary key? Student ID Address Customer Name

A unique identifier can be made for each new student. While multiple students may have the same name, no two students will have the same student ID.

What is a key advantage in business in having a web-based database rather than a client-server traditional database? Data integrity Access from anywhere Security of data

A web-based database allows users to access data from anywhere; it is useful for remote or mobile staff and to maintain database consistency. This is a cloud-based application.

Frank works as a database administrator and focuses a lot of time and attention on data integrity and the quality of data in the database. What are some of the desirable characteristics of data integrity that he should look for in his work? (Select all that apply.) Consistency Duplication Accuracy

Accuracy and consistency define data integrity in database design.

Which of the following terms defines a database entity? Something for which a business collects data A collection of related fields A collection of related records

An entity is an event, person, or thing for which a business collects data.

What is one tool a business can use to map process requirements and data to develop a supporting relational database? Visio Decision tree Entity relationship diagram

An entity relationship diagram (ERD) outlines the business processes and subsequent relationships and possible data requirements.

A relational database which stores data in tables can be designed using a(n) ________________.

An entity-relationship model is the basic tool when designing a relational database.

In database modeling and design, what is meant by database normalization? It restores lost relationships. It means adjusting data values. It means removing redundancy in conceptual model.

B - Databases must go through normalization in the design process to simplify complexity where possible and remove redundancy between elements.

Which of the following is considered an efficient database backup strategy? Backing up entire database Reducing volume of data for regular backups Distributing database

B - Not all data in a database changes all the time, such as, historical data. Therefore it is a good practice to reduce the data that is backed up on a regular basis.

Which of the following examples represents a one-to-one type of relationship between entities? College classes have students. Each person has a social security number. A customer orders three videos.

B - One social security number is assigned to only one person; each person only has one social security number.

"Big data" has become a term that is all around us, and so is the term "the cloud." When we hear the term "big data,''' we also hear business intelligence, analytics, trend analysis, discoveries, etc. Please describe what is meant by big data and what big data offers to the business community.

Big data can be thought of as a very large depository or distributed data set that is too complex or too big for standard computers and applications to handle for analysis or processing. The business community can gain insights from analyzing big data. The gained insight that can lead to exploitation of new trends, new innovations or discoveries, or new predictions that can be leverage in business strategies.

There are many data management challenges created by big data. Which of the following items is NOT one of those challenges? Size Speed Variety Cost

Big data technologies and cloud-based analytics can provide substantial cost advantages.

Data mining, predictive analytics, and online analytic processing are all part of a category called ______________. Business intelligence is a strategic element and helps companies gain a ______________.

Business intelligence is a broad subject that encompasses many areas of data analysis. Large businesses that gather, store, and analyze their data using analytics will gain business intelligence, which will provide a competitive advantage.

_______________ can help with analyzing vast amounts of data from data warehouses and data marts to help run the business more efficiently.

Business intelligence tools

Consumers use data warehouses everyday, many times without even knowing it. Facebook is an example of a data warehouse that most consumers use on a daily basis. Which of the following best defines a data warehouse? A data warehouse is where big data is stored. A data warehouse is a data set of competitive pricing. A data warehouse is a collection of business data from multiple systems used for data analysis and mining. A data warehouse is used to analyze sales data.

C - A data warehouse is comprehensive business data set from multiple systems and is analyzed and mined for business intelligence. Facebook gathers personal data about your friends, your hobbies, and what you like, and stores that data in one central repository.

Big data has many characteristics, each with their own challenges. What is meant by "velocity" when discussing big data characteristics? Velocity is the high speed of the CPU bus. Velocity is the change measurement of big data. Velocity is the speed required to process or analyze big data.

C - Big data analytics must be performed at high speeds with powerful computers. This is referred to as high velocity requirement.

Sarah is doing a presentation on big data at a company lunch and learning to help employees understand the ways IT can help the business. Which of the following is the best definition of big data? Big data is data in the form of media objects like video. Big data is another term for Big Blue, or IBM. Big data is used to describe large, complex data sets. Big data is all the data a business enterprise collects and stores.

C - Big data is a very large data set (structured or unstructured) that can be mined or analyzed to find new trends or relationships for business intelligence.

What is the process to update or backup a data warehouse? Identify, restore, and backup Identify, transfer, and load Extract, transform, and load

C - ETL refers to the process of data identification, consolidation and scrubbing, formatting, and transferring to the target data warehouse.

Transform

Clean and reformat the data according to requirements of the target database.

__________ are high-level models that map the scope of the enterprise data architecture model and help support system and documentation requirements.

Conceptual data models

What is the name of the notation used to diagram the relationship between entities? Crow's foot Cardinality Schema

Crow's foot notation is used to indicate one, one and only one, none, or many. For example, one and only one customer can buy none to many products. One sales rep can help none or many customers.

Summarizing data into useful information can help increase revenue or cut costs.

Data Mining

________ provides managers and employees with information from multiple systems that will help them make better decisions.

Data analysis

Data warehouses can be described as repositories of data from several sources and are mostly used for _____________ and _____________ to help management decision making.

Data warehouses are key for management reporting. Data warehouses are key for management analysis.

Because data warehouses consist of data from__________ systems with different formats, the ETL process must be carefully designed to ensure data_______ during the load process

Data warehouses contain data sets from several databases or application systems. Data integrity means data is accurate and consistent.

Which database model connects or relates different data tables using common fields called primary keys? Hierarchical Relational

Databases are tables of information, and relational database model link these tables using common, primary keys.

There are several levels of analysis and methods. One data mining methodology is ____________. Another more complex data mining methodology is ______________. A third type of data mining is _____________.

Decision trees are used in data mining to generate rules and to classify data sets. Artificial neural networks are nonlinear algorithm models used in data mining. This can be described as machine learning or advanced algorithms. Rule induction is another type of data mining technique based on statistical importance.

A database administrator will encounter problems feeding the data warehouse database during the ______________ scheduled activities. Therefore, it is important to apply effective ______________ strategies.

ETL - A key challenge in maintaining a data warehouse database is loading homogeneous data from various business systems during the ETL process. Partition - To improve database manageability and performance, a database partition strategy must be considered.

A person, item, event, etc., for which an organization collects data.

Entity An example of an entity would be customer or student.

Data Warehousing Using ETL Process: __________, ____________, ___________.

Extract, Transform, and Load

True or False. For referential integrity to be enforced in a relational database, a primary key does not need a corresponding entry of equal value in the referenced table.

False - A row value or primary key in a table only exists if an item with an equal value (foreign key) exists in a referenced table.

An attribute of an entity, such as name, address, or student ID.

Field

___________ contain data describing attributes of an entity.

Fields

Data warehouses are implemented and widely used in large businesses, usually with more than 1,000 employees. Their purpose is to store all transactions and analytical operations data for data mining later. Describe the process and key tasks required to operate and maintain a data warehouse.

First, one must identify the data source that has the critical data to be warehoused. Second, one must create a consolidation algorithm that also correlates the data with the data warehouse format. Third, one has to create a scrub algorithm to ensure data quality and integrity. Finally a transfer procedure application is run to update the data warehouse database.

A common field that exists in multiple tables that is used to form a relationship. For example, a customer might be assigned a specific sales representative. The customer's record might contain a sales rep ID that would link to the Sales Rep table.

Foreign Key

Extract

Get the data from various sources as needed.

In contrast with a ________________ a _______________uses a flexible model in which data are distributed among several machines, often in a cloud-computing format.

In a machine-based relational database, the data are stored on one central machine and not multiple machines. A web- or cloud-based database uses a flexible model in which data are distributed among several machines, often in a cloud-computing format.

Data mining is sometimes referred to as ___________. knowledge discovery deep dive big data

Knowledge discovery is data mining or extracting useful information from data.

Load

Load the reformatted data into the targeted data warehouse.

A database that is optimized for data warehouse application.

Multidimensional Database

Isolating data so changes in a field only have to happen in one place, but carry over to the rest of the database

Normalization

________________is a process or practice to eliminate data redundancy, and ________________ is the process of establishing relationships between tables using keys.

Normalization eliminates data redundancy. Referential integrity establishes relationships between objects in different tables.

__________ allows users to extract data from different points of view.

OLAP

What kind of database management system enables businesses to create new records and update and delete records, providing real time information for decision making? Online transaction processing Hierarchical Flat file Object oriented

OLTP stands for "online transaction processing" and is a database design used for browsing and manipulating business transaction activity to enable real time business analysis of records to help in business decision making.

Which of the following is not a characteristic of big data? Velocity of processing Large volume Object data

Object Data

Which of the following is not a database model? Hierarchical Relational Flat file Object defined

Object defined" is not the correct name of any database model. Simply named, an object database is a database that is used to store objects and is the basis for an object-oriented DBMS.

Relational database systems are valuable to organizations because ________. their design and implementation requirements are minimal they are inexpensive systems, which make it more affordable for businesses they use SQL, which is easy to master, making it a very productive tool

Organizations are easily trained in SQL, and therefore, users become very productive in a short period of time.

What is a required process in implementing and maintaining a data warehouse? Partitioning Formatting Defragging

Partitioning - A typical relational data warehouse is made up of indexed tables that can be implemented using a partitioned approach. Relational data warehouses benefit from partitioning the data as part of implementation and maintenance.

__________ is used to analyze credit risks to determine the likelihood that customers will pay their bills.

Predictive analysis

A field that uniquely identifies a record, such as customer ID or student ID.

Primary Key

A set of related fields, also called a tuple, that describe one occurrence of an entity, such as a customer.

Record

As a database administrator, you must avoid inconsistencies, which cause errors and integrity issues. Database inconsistencies are avoided by implementing what concept? Flowcharting Referential integrity Planning

Referential integrity stops database inconsistency by creating relationships between tables using primary and foreign keys.

This type of database stores related information.

Relational Database Model

____________ allow for complex queries to be carried out easily.

Relational databases

You are helping to design a spreadsheet table for accounting. You explain to the team that a spreadsheet table is similar to a database; the difference is that database tables have _____________________.

Relational databases have strong relationships between tables, allowing for queries that lead to information.

If you were setting up a new database, which of the following items would represent "objects" in your database? Mirror Reports Schema

Reports are objects in database.

_________ are records; ____________ are fields.

Rows, columns

The complete definition of a database, including the tables, records, fields, and relationships.

Schema

Static data is data that does not change after being recorded. It is a fixed data set. In contrast, dynamic data may change after it is recorded, and has to be continually updated.

Static Data

Which item below is NOT a database model? Structured Relational Object

Structured is not a database model. Relational and object databases are two types of database models.

A collection of related records. For example, a set of customer records would all be contained in one table.

Table

______________consist of rows and columns.

Tables

The decision to implement a relational database system depends mostly on two factors. The first factor to consider is the _____________. The second factor to consider when implementing a relational database is _____________________.

The amount of information in the form of data is one key determinant is choosing to implement a relational database. The second determinant is the number of users of the information in an organization.

What term is used to describe the properties of data components? Relationships Entities Attributes

The attributes are the properties of the data components.

We hear about "Business Intelligence," "Big Data," "Analytics," and "Data Mining" often. The business community has been analyzing data—operational data, analytical data, financial data, market data—for many years. Describe the process of data mining.

The business world has been analyzing operations and markets to increase profitability, that is, to increase revenues or reduce cost. The foundation of any analysis is the collection of large amounts of data or facts. When that data are analyzed or mined, patterns and relationships evolve, the analysis of which leads to information. This information can lead to the discovery of trends or insights, which can be leveraged by business.

Big data is in the news a lot, specifically when it involves issues of personal privacy. Big data is a collection of an enormous amount of unrelated, raw data that is complex and difficult to work with using traditional database management tools. As a result, big data is usually associated with predictive analytics. Describe the objective of predictive analytics.

The main objective of predictive analytics is to offer statistical or probability trends that can influence business practices or business strategic decision making. Large businesses must create predictive models to ensure a competitive advantage. This requires the collection of large amounts of data along with data mining techniques, such as predictive analytics.

One goal of a database administrator is to have a database that has the same entities and values in one or more tables. The database should go through what process to optimize? Duplication Equalization Normalization

The process of normalization eliminates first-pass design redundancies that could create query and reporting problems in a database.

Relational database models were first developed in the 1970s. It has become the most widely used database model in business with leading products from Oracle, Sybase, and others, including Microsoft. Can you explain why this type of database model has become so widely used and how you might use it in your business?

The reason why relational database models are widely used in business applications is because of their ability to support transactions. They store data in tabular form; they use SQL, which offers ease of use; they have data relationships between tables; and there is no data redundancy. They dominate the market because of data re-use, normalization, separation of concerns and maturity, and momentum.

There are many types of data warehouses with many different applications. Which of the following item is not a data warehouse application? Data mining Transaction processing Information processing Analytical processing

Transaction Processing - The data in a data warehouse system comes from business transactions and other sources. Transaction processing involves running fundamental tasks used to run the business in an operational database.

Data mining is used by businesses to analyze internal factors in order to improve business performance. True False

True - Data mining is primarily an analysis of internal business data.

True or False. Relational database systems give organizations the ability to analyze data to make better and faster managerial decisions.

True - One key benefit to databases is that they provide businesses an ability to perform data analytics.

True or False. Predictive analytics is used by businesses to predict trends or behaviors from customers. True False

True - Predictive analytics is a process in data mining that attempts to deduce what might happen in the future.

Big data can be structured, semistructured, or ____________.

Unstructured - Big data algorithms can handle any type of data set.

In computer science and computer programming, a data type or simply type is a classification of data which tells the compiler or interpreter how the programmer intends to use the data. Most programming languages support various types of data, for example: real, integer or Boolean. A Data type provides a set of values from which an expression (i.e. variable, function...) may take its values. The type defines the operations that can be done on the data, the meaning of the data, and the way values of that type can be stored.

Variable (referencing a data type)

Table names and ___________ must be unique; no two tables can have the same name in a database.

attribute names

Benefits of normalization include ___________ because data redundancy is eliminated.

data independence

A_____________is a set or collection of data that is centrally located in electronic format, organized in tables, and easily searchable.

database

An _________is an event, person, or thing for which a business collect data.

entity

The ______________ diagram is an essential starting point in database design.

entity-relationship

Big data has three characteristics:

large volume variety and velocity in which it can be processed through analytics.

With a ________________, a non-relational database uses a flexible model in which data is distributed among several machines, often in a cloud-computing format.

machine-based relational database

Besides ________, businesses use their databases for many things, including something as simple as keeping track of basic transactions.

modeling

A __________________is a type of database in which multiple information files are combined into one database in one location, where data are stored in two-dimensional tables known as relations, and contain multiple entities and attributes.

relational database

Database objects can be compressed and stored on different __________ to lower the cost of storing large amounts of data.

storage tiers

A primary key must be a_____________.

unique identifier. Social security numbers are not used as often for primary keys due to security risks. No two records can have the same primary key.

The typical relational database configuration includes what 3 servers?

web server, application server, and a database server.


Set pelajaran terkait

Microeconomics Chapter 9 Study Guide

View Set

Principles of Scientific Management

View Set

Ch 8A Mitosis and the Cell Cycle Homework

View Set

BIOL 2107 - Ch. 15 Practice Quiz

View Set