Is475 Final
Give an example of a threat to internet.
A trojan horse program hides within the database system and reveals information and remove all access controls within database .
During the translation of business rules to data model components, you identify entities, attributes, relationships, and _____ Constraints Segments Tables Schemas None of the above
A. Constraints
4. Increased customer knowledge can be used to
A. Design a website that attracts more users B. Target specific markets C. Display features customized to individuals D. Increase customer loyalty
A major concern of operations information systems
A. Flexibility in report generation
Q: Which of the following is a characteristic of an Object-Oriented system? Support complex objects Objects must be encapsulated Must support classes Must support inheritance All of the above
All of the above
Which of these is not an aggregate function? Sum Count Avg Min
All of these are aggregate functions
Which of the following is NOT an advantages of NoSQL? a. low cost b. standardization c. distributed computing d. no complicated relationships e. all of the above are advantages of NoSQL
B standardization
What is a "tuple" in a formal relation? a. Table b. Row or record c. Column or field d. Unique identifier e. Pool of legal values
B. row or record
What is the second layer of the e-commerce architecture?
Business Enabling Services
Explain the difference between data modeling and business process modeling.
Business process modeling is the business process wherein a set of activities accomplish a specific business task. For example: Order handling, performance of an audit, identifying potential value-adding and reducing processes in the business. Whereas business process modeling is primarily concerned with performing specific business related tasks in order to address specific business related problems, data modeling is primarily concerned with understanding HOW data should be collected, stored, in addition to understanding the existing associations amongst them.
A Good Business Rule is which of the following?
Business-oriented Declarative Precise Distinct
Which is NOT a common pitfall of data mining? a) Not understanding business need/problem b) Careless handling of data c) Data discovery d) Lack of data mining model e) Lack of participation by domain experts
C . data discovery
Data can become dirty for what reason? A-Standard use of abbreviations in character data B-Proper implementation of constraints C-Use of synonyms and homonyms across the system D-All of the above E-None of the above
C. Use of synonyms and homonyms across the system
Based on the subscription model, you would: a. receive payments from advertisers b. earn payments on transaction volume c. make fixed payments on specific time frames d. earn payments on transaction volume e. make payments based on advertisements
C. make fixed payments on specific time frames
What Data Type should be used for the following example? -The serial number of a lego set.
Char
Choose the contents of object oriented programming language in order of importance
Class, Object, Method
Which data mining tool is used mostly for "prediction"? Association Clustering Classification Data Mining None of the above
Classification
In data mining, what is classification analysis?
Classifying all objects into predefined classes.
An attribute which can be broken into several components parts is called a:
Composite Attribute
Which option lists the steps of database design (3)?
Conceptual, Logical, and Physical
What are the 3 tehcniques for fata integration?
Consolidation Federation Propagation
How do you handle a multi-valued attribute?
Create a new table with the multi valued attribute, and include the primary key from the same table
----------------- is the ability to change the product to better fit the needs of the customer.
Customization
A common problem of file system data processing (i.e. non integrated database processing systems) is:
D. Data quality issues such as data redundancy, inconsistency, and lack of data sharing/integration
What is not a basic procedure for an ER Model? -Identifying relationships among the entities -Identifying attributes for the entities and relationships -Identifying the key identifier for each entity -Identifying major setbacks in the data system
D. Identifying major setbacks in the data system
Which of the following roles would be the most likely to focus on daily operations of the database?
DBA (Database Administrator)
One way of NOT improving data capture processes would be to: Automate data entry as much as possible Select from preset options, such as a drop down menu Allow for immediate data validation (i.e. enter password again) upon manually entering data Disregard simple user interface principles in lieu of a complex/holistic interface which has many options
Disregard simple user interface principles in lieu of a complex/holistic interface which has many options
What serves as the communication tool between the designers and the users?
ERD models
_______ is the process of inputting data in "plain text" to yield an "encoded" output of the data, making the data unintelligible to unauthorized users
Encryption
What does ERD stand for_______?
Entity relation diagram
(T/F) Browsing options on the Zappos website is an example of E-Commerce
F
Data and information are the same thing. (T/F)
F
Data mining process consists of the following four steps (order): (1) selection, (2) cleaning, (3) data mining, and (4) interpretation and evaluation. (T/F)
F
True / False In a data warehouse only correct complete information will be stored.
F
True / False Metadata describes data that is processed
F
True/False. A poorly designed database does NOT cause difficult to trace errors
F
True/False. NoSQL databases are standardized
F
The advantages to e-commerce are lack of hidden costs, invulnerability to technical failure, reduced costs of staying business, increased security and privacy, higher service levels, and reduced legal issues.
False. E-commerce has disadvantages such as: hidden costs, vulnerability to technical failure, increased business costs, decreased security and privacy, lower service levels, and increased exposure to legal issues.
The (________) clause only appears in conjunction with the GROUP BY clause, for retrieving groups that satisfy certain conditions.
HAVING clause. SELECT Order_Detail.O_No, SUM(Qty) FROM Order_Detail GROUP BY Order_Detail.O_No HAVING SUM(Qty) > 2;
_________ refers to data that have been presented or processed in a way that they become meaningful or can increase the knowledge of users.
Information
What are some disadvantages of distributed database?
Network dependency Communication via network relies on the reliability of network connectivity Software cost and complexity Cost for both client application and database server Database application setup complexity Slower response for certain queries Processing overhead Additional transaction control mechanism Network security concerns
Which of these is not a tier in three-tier architecture? Database Management Application Logic ODBC Presentation Layer
ODBC
When having many to many relationships a new entity will be created using what key(s) from the two connecting entities?
Parent key
explanatory analytics
Part A.) explanatory analytics focuses on the discovering and explaining data characteristics and relationships based on existing data. Predictive analytics focuses on predicting future outcomes with a high degree of accuracy. Part B.) Explanatory analytics would be used when a person is identifying the actual demographics of their product. For example, when you begin to sell a product you have a demographic in mind. Sometimes, products turn out to be purchased by a different demographic than intended.
Who developed the Entity Relationship Model?
Peter Chen
predictive analytics
Predictive analytics would be used when we are tracking previous orders a store had for sunscreen over the Summer time because we would like to predict how many orders we need for next year. The reason why this is a good idea is because sunscreen has an expiration date. If we order too many sunscreens there is a likelihood they will expire before they are sold.
The three major functions for an Internet environment are:
Presentation, Application Logic, and data management
List order numbers of the orders which contain products supplied by vendor V003.
SELECT Order_Detail.O_No FROM Order_Detail, Product WHERE Product.VID='V003' AND Order_Detail.Product_ID=Product.Product_Id
List Names and MSRP of all inactive products, ordered by MSRP
SELECT Product_Name, MSRP WHERE status = 'inactive' ORDER BY MSRP
What is the main format of an SQL query?
Select, From, Where
What are the steps of Data Mining Process?
Selection-Cleaning-Transformation-Data Mining- Interpretation and Evaluation
What are the key characteristics of a database?
Self-describing collection of data: A database contains a description of the overall database design and essential data structure and interrelationships. Related data: A logically coherent collection of data with some inherent meaning. Integrated data: A unification of data files, reducing or eliminating unplanned data redundancy. Shared data: A database provides a central information repository which can be accessed by different application programs and users.
What does SQL stand for?
Structure Query Language
Data can be ______ or _______.
Structured, Unstructured
(T/F) Data Manipulation Language (DML) is the commands that maintain and query a database.
T
(T/F) In transforming an entity-relationship diagram to a logical schema, a many-to-many relationship set and its involving entity sets will be transformed into three tables.
T
(T/F) Metadata are data that describe the properties or characteristics of end-user data and the context of that data.
T
A class is a collection of similar objects with shared structure (attributes) and behavior (methods).
T
An Entity can have one or multiple values for each attribute (T/F)
T
An attribute is a property or characteristic of an entity set of interest to the organization (T/F)
T
An entity is a person, place, object, event, or concept in the user environment about which we wish to maintain data. (T/F)
T
Client and server processes can reside in different computers connected by a network(T/F)?
T
Database design process captures and represents key data requirements of a database system.
T
ER model is a natural and intuitive way to model the focal business application or phenomenon (T/F)?
T
Encapsulation is the ability to hide an object's internal details (T/F)
T
SQL has become the de facto standard language for creating and querying relational databases. (T/F)
T
T/F 1. Subtypes should have their own unique attributes
T
T/F Advantages of a distributed database are increased reliability/availability, modular growth, and faster response for certain queries
T
T/F Data management includes the generation, storage, and retrieval of data
T
TRUE OR FALSE The having clause is used to restrict the output of a GROUP BY query by applying conditional criteria to the grouped rows.
T
What is the purpose of an operational database?
To support a company's day-to-day operations.
How do you select all of the information from a specific table in SQL?
Use SELECT * From specific table
What is Varchar and when do you use it?
Variable Character Field is a set of character data of indeterminate length . I would use this if the user were to use it to enter emails.
What are the 3 V's of data?
Volume, Velocity, and Variety
What are the three words that define big data
Volume, velocity, variety
What is an entity set?
a) A collection of entities that share common properties
- ________________ are the glue that holds together various components of the ER data
a) Relationship sets
Which of the following about an entity set is NOT true? a. an entity set must have a primary key b. an entity set must involve at least 1 relationship c. an entity can have only one attribute d. if an attribute is surrounded by 2 lines, it is considered multi-valued e. each entity has a unique name
an entity can have only one attribute
A Foreign Key is a Primary Key That Consists of Multiple Attributes?
f
An advantage of e commerce is no hidden costs.
f
Data control language is considered the core of SQL and has a fixed format?
f
Data integration does not create a unified view of business data
f
To ensure data quality, while entering data, it is best to limit automation as much as possible
f
When employing a password to protect your database, it is best to use something easy and simple to remember such as "password".
f
When using SQL, it is required to use quotation marks for integers,
f
In SQL, whenever there is potential unambiguity, an attribute must be qualified by prefixing the relation name and a dot (.) to the attribute name when performing a JOIN operation. Example: SELECT Product_Name, Sales_Price, Eff_Date FROM Product, Promotion WHERE Product.Product_ID = Promotion.Product_ID
f In SQL, whenever there is potential ambiguity, an attribute must be qualified by prefixing the relation name and a dot (.) to the attribute name when performing a JOIN operation.
2. ER model is supported by ______ Case tools
mostly
Data mining extracts:
previously unknown, interesting, valid, and actionable data patterns
- An attribute is a property or characteristic of an entity set of interest to the organization
t
- DDL can be explained as "data defined language"
t
- One of the purposes of encryption is to ensure that data cannot be understood if intercepted
t
A server can provide services for more than one client.
t
Data integrity is composed of rules and mechanisms for ensuring data accuracy and consistency.
t
The effectiveness of BI depends on quality of data gathered at operational level.
t
The first step of transforming an entity set is transforming each entity into a relation.
t
