Survey of Database Systems Final Exam

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

Business Intelligence and data warehousing is used for_______ a) Forecasting b) Data mining c) Analysis of large volumes of product sales data d) All

All

Data mining can also be applied to other forms such as_____ i. Data streams ii. Sequence data iii. Networked data iv. Text data v. Spatial data A) i, ii, iii and v only B) ii, iii, iv and v only C) i, iii, iv and v only D) All

All

The various aspects of data mining methodologies is/are ................... i) Mining various and new kinds of knowledge ii) Mining knowledge in multidimensional space iii) Pattern evaluation and pattern or constraint-guided mining. iv) Handling uncertainty, noise, or incompleteness of data A) i, ii and iv only B) ii, iii and iv only C) i, ii and iii only D) All i, ii, iii and iv

All

Which statement is true (related to general concepts of XML)?

All XML elements must be properly closed

DynamoDB

Amazon, key-value data store

_____________ supports basic OLAP operations, including slice and dice, drill-down, roll-up and pivoting. a) Information processing b) Analytical Processing c) Data mining d) Transaction processing

Analytical Processing

How to specify the attributes with multiple values?

Attributes cannot have multiple values

What is the correct syntax of the declaration which defines the XML version?: A. <xml version="A.0" /> B. <?xml version="A.0"?> C. <?xml version="A.0" /> D. None of the above

B. <?xml version="A.0"?>

18) When processing an output XML "new line" symbols A. are copied into output "as is", i.e. "CR+LF" for Windows, CR for Macintosh, LF for Unix. B. are converted to single LF symbol C. are converted to single CR symbol D. are discarded

B. are converted to single LF symbol

CAP Theorem

Consistency, Availability, Partition Tolerance, cannot guarantee all three

The data from the operational environment enter ____________ of data warehouse. a) Current detail data b) Older detail data c) Lightly summarized data d) Highly summarized data

Current detail data

_______________ is the process of finding a model that describes and distinguishes data classes or concepts. a) Data Characterization b) Data Classification c) Data discrimination d) Data selection

Data Classification

________________ is a subject-oriented, integrated, time-variant, nonvolatile collection or data in support of management decisions. A) Data Mining B) Data Warehousing C) Document Mining D) Text Mining

Data Warehousing

____________ is a summarization of the general characteristics or features of a target class of data. a) Data Characterization b) Data Classification c) Data discrimination d) Data selection

Data characterization

The core multidimensional model is the _________ , which consists of a large set of facts and a number of dimensions. a) Multidimension cube b) Older detail data c) Lightly summarized data d) Highly summarized data

Data cube

__________ is a comparison of the general features of the target class data objects against the general features of objects from on or multiple contrasting classes. a) Data Characterization b) Data Classification c) Data discrimination d) Data selection

Data discrimination

________ is an essential process where intelligent methods are applied to extract data patterns a) Data warehousing b) Data mining c) Text mining d) Data selection

Data mining

___________ are databases that store and maintain analytical data separately from transaction-oriented databases for the purpose of decision support

Data warehouses

MongoDB and CouchDB

Document stores

__________ are responsible for running queries and reports against data warehouse tables. a) Hardware b) Software c) End users d) Middle ware

End users

Cassandra

Facebook, uses both key-value and column-based systems

___________ is a good alternative to the star schema a) Star schema b) Snowflake schema c) Fact constellation d) Star-snowflake schema

Fact constellation

NoSQL prohibits SQL. True or False?

False

NoSQL was designed with security in mind, so developers or security teams don't need to worry about implementing a security layer. True or False?

False

BigTable

Googles NoSQL system, column-based

____________ was designed to display data and focus on how data looks - XML - HTML - XHTML - DTD

HTML

KDD is A) Knowledge Database B) Knowledge Discovery Database C) Knowledge Data House D) Knowledge Data Definition

Knowledge Discovery Database

__________ are designed to overcome any limitation placed on the warehouse by the nature of the relational data model. a) Operational database b) Relational database c) Multidimensional database d) Data respository

Multidimensional database

An________ system is market-oriented and is used for data analysis by knowledge workers, including managers, executives, and analysts a) OLAP b)OLTP c) both above d) none above

OLAP

The data is stored, retrieved and updated in_________ A)OLAP B) OLTP C) SMTP D) FTP

OLTP

The full form of OLAP is a) Online Analytical Processing b) Online Advanced Processing c) Online Advanced Preparation d) Online Analytical Performance

Online Analytical Processing

________ is a term used about text data that will be parsed by the XML parser

PCDATA

The full form of PICO is

Prediction Identification Classification Optimization

Data warehouse architecture is based on ____________ . a) DBMS b) RDBMS c) Sybase d) SQL server

RDBMS

Which types of problems can be solved by XQuery?

Recreational problems and puzzles

In terms of performance, which of the following parsers are more suitable? - DOM - SAX

SAX

Which of the following is not a data mining functionality? A) Characterization and Discrimination B) Classification and regression C) Selection and interpretation D) Clustering and Analysis

Selection and Interpretation

Which of the following is not a kind of data warehouse application? a) Information processing b) Analytical processing c) Data mining d) Transaction processing

Transaction processing

SQL command types include data manipulation language (DML) and data definition language (DDL). True or False?

True

XML preserves white spaces

True

XML tags are case sensitive

True

The out put of KDD is______ a) Data b) Information c) Query d) Useful information

Useful information

What can be used to transform XML into HTML?

XSLT

Can we use CSS in XML?

Yes

Which of the following is a wide-column store? a) Cassandra b) Riak c) MongoDB d) Redis

a) Cassandra

_________ analysis divides data into groups that are meaningful, useful or both. a) Cluster b) Association c) Classification d) Relation

a) Cluster

Point out the correct statement. a) Hadoop is ideal for the analytical, post-operational, data-warehouse-ish type of workload b) HDFS runs on a small cluster of commodity-class nodes c) NEWSQL is frequently the collection point for big data d) None

a) Hadoop is ideal for the analytical, post-operational, data-warehouse-ish type of workload

Which of the following are the simplest NoSQL databases? a) Key-value b) Wide-column c) Document d) All

a) Key-value

Point out the correct statement. a) MongoDB is classified as a NoSQL database b) MongoDB favors XML format more than JSON c) MongoDB is column-oriented database store d) All

a) MongoDB is classified as a NoSQL database

_____________ systems are scale-out file-based (HDD) systems moving to more uses of memory in the nodes. a) NoSQL b)NewSQL c) SQL d) All

a) NoSQL

A transactional database consists of a file where each __________ represents a transaction. a) Record b) Record graph c) Ordered record d) Attribute

a) Record

___________________ refers to the process of deriving high-quality information from text. a) Text Mining b) Image Mining c) Database Mining d) Multimedia Mining

a) Text Mining

Knowledge discovery in database refers to _____________. a) Whole process of extraction of knowledge from data b) Selection of data c) Coding d) Cleaning the data

a) Whole process of extraction of knowledge from data

Which of the following strings is a correct XML name? a) __myElement b) my Element c) #myElement d) None of the above

a) __myElement

Kind of Parsers are a) non validating and validating b) well-formed c) None of the above

a) non validating and validating

The need of using a DTD in developing XML document is a) required when validating XML documents b)no longer necessary after the XML editor has been customized c) used to direct conversion using an XSLT processor d) a good guide to populating a templates to be filled in when generating an XML document automatically

a) required when validating XML documents

The DTD includes the specifications about the markup that can be used within the document, the specifications consists of all except a) the browser name b) the size of element name c) entity declarations d) element declarations

a) the browser name

NoSQL databases is used mainly for handling large volumes of _________ data. a) unstructured b) structured c) semi-structured d) All

a) unstructured

Which of the following is not a NoSQl database? a)SQL Server b) MongoDB c) Cassandra d) None of the above

a)SQL Server

_____________ is an essential process where intelligent methods are applied to extract data patterns. a) Data Warehousing b) Data mining c) Data Base d) Data structure

b) Data mining

NoSQL databases are most often referred to as: a) Relational b) Distributed c) Object-oriented d) Network

b) Distributed

What does DTD stand for? a) Direct Type Definition b) Document Type Definition c)Dynamic Tool Description d) Dynamic Type Definition

b) Document Type Definition. XML uses DTD to describe the data/XML uses a description node to describe data

Which of the following is a NoSQL database type? a) SQL b) Document databases c) JSON d) All

b) Document databases

Which of the following represent column in NoSQL? a)Database b)Field c) Document d) Collection

b) Field

Cluster is : a) Group on a training data set to transform or simplify data in order to prepare it b) Group of similar objects that differ significantly from other objects c) Symbolic representation of facts or ideas from which information can potentially be extracted d) A and B

b) Group of similar objects that differ significantly from other objects

Which of the following is not a reason NoSQL has become a popular solution for some organizations? a) Better scalability b) Improved ability to keep data consistent c) Faster access to data than relational database management systems d) More easily allows for data to be held across multiple servers

b) Improved ability to keep data consistent

Which of the following is a reason to use an SQL database? a) It can easily store unstructured data b) Its ACID-compliant c) It can enable development in the cloud d) All

b) Its ACID-compliant

"Sharding" a database across many server instances can be achieved with __________ a) LAN b) SAN c) MAN d) All

b) SAN

What is the main technique employed for data selection? a) Noise b) Sampling c) Clustering d) Histogram

b) Sampling

SQL databases are: a) Horizontally scalable b) Vertically scalable c) Either horizontally or vertically d) They don't scale

b) Vertically scalable

Data mining is a) an extraction of explicit, known and potentially useful knowledge from information b) a non-trivial extraction of implicit, previously unknown and potentially useful information from data c) an essential process where intelligent methods are applied to extract data patterns that is also referred to database d) is an essential process where intelligent methods are applied to extract data that is also referred to data sets

b) a non-trivial extraction of implicit, previously unknown and potentially useful information from data

Which of the following strings is a correct XML name? a) xmlExtension b) xslNewElement c) XMLElement#123 d) XMLName

b) xslNewElement

______ is an online NoSQL developed by Cloudera. a) HCatalog b)Hbase c)Imphala d) Oozie

b)Hbase

Comments in XML document are given by a) < ?-- --> b) < !-- --!> c) < !-- --> d) < /-- -- >

c) < !-- -->

Which of the following is supported by MongoDB? a) SQL b) XML c) BSON d) all

c) BSON

What are the predefined attributes? a) xml:lang b) xml:space c) Both A and B d) None of the above

c) Both A and B

Data mining is used to refer _________ stage in knowledge discovery in database a) Selection b) Retrieving c) Discovery d) Coding

c) Discovery

XML document can be viewed in a) IE 3.0 b) IE 2.0 c) IE 6.0 d) IE X.0

c) IE 6.0

Two fundamental goals of data mining are: a) Analysis and description b) Data cleaning and organizing the data c) Prediction and description d) Data cleaning and organizing the data

c) Prediction and description

_________ is the application of data mining techniques to discover patterns from the Web a) Text Mining b) Multimedia Mining c) Web Mining d) Link Mining

c) Web Mining

When is it best to use a NoSQL database? a) When providing confidentiality, integrity and availability is crucial b) When the data is predictable c) When the retrieval of large quantities of data is needed d) When the retrieval speed of data is not critical

c) When the retrieval of large quantities of data is needed

Most NoSQL databases support automatic ___________ meaning that you get high availability and disaster recovery. a) processing b) scalability c) replication d) all

c) replication

Which of the following is not a component of a data warehouse? a) metadata b) current detail data c) lightly summarized data d) component key

component key

Which of the following are well formed XML fragments? a) < myElement myAttribute="someValue'/> b) < myElement myAttribute=someValue/> c) < myElement myAttribute='someValue'> d) < myElement myAttribute="someValue"/>

d) < myElement myAttribute="someValue"/>

Which of the following language is MongoDB written in? a) Javascript b) C c) C++ d) ALL

d) ALL

Which is an advantage of NewSQL? a) Less complex applications, greater consistency b) Convenient standard tooling c) SQL influenced extensions d) All

d) All

Which of the following statements are true? a) Cannot be used to comment an XML line b) XML elements names cannot contain spaces c) XML element names cannot start with a number d) All of the above are true

d) All of the above are true

How to describe XML data? a) XML uses a DTD to describe the data b) XML uses XSL to describe data c) XML uses a description node to describe data d) Both A and C

d) Both A and C

Data _____________ is the process of finding a model that describes and distinguishes data classes or concepts a) Characterization b) Mining c) Clustering d) Classification

d) Classification

Reduce amount of time and memory required by data mining algorithms_______ a) Target data b) Data sampling c) Data aggregation d) Data reduction

d) Data reduction

Which of the following companies developed NoSQL database Apache Cassandra? a) LinkedIn b) Twitter c) MySpace d) Facebook

d) Facebook

_________ stores are used to store information about networks, such as social connections. a) Key-Value b) Wide-column c) Document d) Graph

d) Graph

What is the aim of NoSQL? a) NoSQL provides an alternative to SQL databases to store textual data b) NoSQL databases allow storing non-structured data c) NoSQL is not suitable for storing structured data d) NoSQL is a new data format for storing large data sets

d) NoSQL is a new data format for storing large data sets

Which of the following is not an example of a NoSQL database? a) CouchDB b) MongoDB c) HBase d) PostgreSQL

d) PostgreSQL

XML uses the features of a) HTML b) XHTML c) VML d) SGML

d) SGML

Which of the following is a characteristic of a NoSQL database? a) Uses tables for storage b) Needs a schema c) Requires JOINs d) Uses JSON

d) Uses JSON

Which of the following is a primary classification for NoSQL architectures? a) Document databases b) graph databases c) Key-value databases d) all

d) all

Any XML must have: a) Closing tag b) Root element c) Nested child d) both a and b

d) both a and b

The _________ exposes the information being captured, stored and managed by operational systems. a) top-down view b) data warehouse view c) data source view d) business query view

data source view

The biggest drawback of the level indicator in the classic star schema is that it limits________ a) flexibility b) quantify c) qualify d) ability

flexibility

Neo4J and GraphBase

graph-based NoSQL systems

Given the association rule (A and B) → (C and D). Support can be computed as

occurrence of (A intersection B) → (C intersection D)/ total transaction

The type of relationship in star schema is__________ a) many to many b) one to one c) one to many d) many to one

one to many

A data warehouse is___________ a) Updated by end users b) contains numerous naming conventions and formats c) organized around important subject areas d) contain only current data

organized around important subject areas

Data warehouse contains____________ data that is never found in the operational environment. a) Normalized b) Informational c) Summary d) Denormalized

summary

Strategic value of data mining is _______ a) cost-sensitive b) work-sensitive c) time-sensitive d) technical-sensitive

time-sensitive

The ________ allows the selection of the relevant information necessary for the data warehouse. a) top-down view b) data warehouse view c) data source view d) business query view

top-down view


Ensembles d'études connexes

Cognitive Psychology Quiz 2 (Chapters 6-8)

View Set

Bio 100 Chapter 3 Launchpad Questions

View Set

COSC 2436 Exam 1 Final Exam review

View Set

A&P Chapter 8 (Short Answer + MCQ)

View Set