Survey of Database Systems Final Exam
Business Intelligence and data warehousing is used for_______ a) Forecasting b) Data mining c) Analysis of large volumes of product sales data d) All
All
Data mining can also be applied to other forms such as_____ i. Data streams ii. Sequence data iii. Networked data iv. Text data v. Spatial data A) i, ii, iii and v only B) ii, iii, iv and v only C) i, iii, iv and v only D) All
All
The various aspects of data mining methodologies is/are ................... i) Mining various and new kinds of knowledge ii) Mining knowledge in multidimensional space iii) Pattern evaluation and pattern or constraint-guided mining. iv) Handling uncertainty, noise, or incompleteness of data A) i, ii and iv only B) ii, iii and iv only C) i, ii and iii only D) All i, ii, iii and iv
All
Which statement is true (related to general concepts of XML)?
All XML elements must be properly closed
DynamoDB
Amazon, key-value data store
_____________ supports basic OLAP operations, including slice and dice, drill-down, roll-up and pivoting. a) Information processing b) Analytical Processing c) Data mining d) Transaction processing
Analytical Processing
How to specify the attributes with multiple values?
Attributes cannot have multiple values
What is the correct syntax of the declaration which defines the XML version?: A. <xml version="A.0" /> B. <?xml version="A.0"?> C. <?xml version="A.0" /> D. None of the above
B. <?xml version="A.0"?>
18) When processing an output XML "new line" symbols A. are copied into output "as is", i.e. "CR+LF" for Windows, CR for Macintosh, LF for Unix. B. are converted to single LF symbol C. are converted to single CR symbol D. are discarded
B. are converted to single LF symbol
CAP Theorem
Consistency, Availability, Partition Tolerance, cannot guarantee all three
The data from the operational environment enter ____________ of data warehouse. a) Current detail data b) Older detail data c) Lightly summarized data d) Highly summarized data
Current detail data
_______________ is the process of finding a model that describes and distinguishes data classes or concepts. a) Data Characterization b) Data Classification c) Data discrimination d) Data selection
Data Classification
________________ is a subject-oriented, integrated, time-variant, nonvolatile collection or data in support of management decisions. A) Data Mining B) Data Warehousing C) Document Mining D) Text Mining
Data Warehousing
____________ is a summarization of the general characteristics or features of a target class of data. a) Data Characterization b) Data Classification c) Data discrimination d) Data selection
Data characterization
The core multidimensional model is the _________ , which consists of a large set of facts and a number of dimensions. a) Multidimension cube b) Older detail data c) Lightly summarized data d) Highly summarized data
Data cube
__________ is a comparison of the general features of the target class data objects against the general features of objects from on or multiple contrasting classes. a) Data Characterization b) Data Classification c) Data discrimination d) Data selection
Data discrimination
________ is an essential process where intelligent methods are applied to extract data patterns a) Data warehousing b) Data mining c) Text mining d) Data selection
Data mining
___________ are databases that store and maintain analytical data separately from transaction-oriented databases for the purpose of decision support
Data warehouses
MongoDB and CouchDB
Document stores
__________ are responsible for running queries and reports against data warehouse tables. a) Hardware b) Software c) End users d) Middle ware
End users
Cassandra
Facebook, uses both key-value and column-based systems
___________ is a good alternative to the star schema a) Star schema b) Snowflake schema c) Fact constellation d) Star-snowflake schema
Fact constellation
NoSQL prohibits SQL. True or False?
False
NoSQL was designed with security in mind, so developers or security teams don't need to worry about implementing a security layer. True or False?
False
BigTable
Googles NoSQL system, column-based
____________ was designed to display data and focus on how data looks - XML - HTML - XHTML - DTD
HTML
KDD is A) Knowledge Database B) Knowledge Discovery Database C) Knowledge Data House D) Knowledge Data Definition
Knowledge Discovery Database
__________ are designed to overcome any limitation placed on the warehouse by the nature of the relational data model. a) Operational database b) Relational database c) Multidimensional database d) Data respository
Multidimensional database
An________ system is market-oriented and is used for data analysis by knowledge workers, including managers, executives, and analysts a) OLAP b)OLTP c) both above d) none above
OLAP
The data is stored, retrieved and updated in_________ A)OLAP B) OLTP C) SMTP D) FTP
OLTP
The full form of OLAP is a) Online Analytical Processing b) Online Advanced Processing c) Online Advanced Preparation d) Online Analytical Performance
Online Analytical Processing
________ is a term used about text data that will be parsed by the XML parser
PCDATA
The full form of PICO is
Prediction Identification Classification Optimization
Data warehouse architecture is based on ____________ . a) DBMS b) RDBMS c) Sybase d) SQL server
RDBMS
Which types of problems can be solved by XQuery?
Recreational problems and puzzles
In terms of performance, which of the following parsers are more suitable? - DOM - SAX
SAX
Which of the following is not a data mining functionality? A) Characterization and Discrimination B) Classification and regression C) Selection and interpretation D) Clustering and Analysis
Selection and Interpretation
Which of the following is not a kind of data warehouse application? a) Information processing b) Analytical processing c) Data mining d) Transaction processing
Transaction processing
SQL command types include data manipulation language (DML) and data definition language (DDL). True or False?
True
XML preserves white spaces
True
XML tags are case sensitive
True
The out put of KDD is______ a) Data b) Information c) Query d) Useful information
Useful information
What can be used to transform XML into HTML?
XSLT
Can we use CSS in XML?
Yes
Which of the following is a wide-column store? a) Cassandra b) Riak c) MongoDB d) Redis
a) Cassandra
_________ analysis divides data into groups that are meaningful, useful or both. a) Cluster b) Association c) Classification d) Relation
a) Cluster
Point out the correct statement. a) Hadoop is ideal for the analytical, post-operational, data-warehouse-ish type of workload b) HDFS runs on a small cluster of commodity-class nodes c) NEWSQL is frequently the collection point for big data d) None
a) Hadoop is ideal for the analytical, post-operational, data-warehouse-ish type of workload
Which of the following are the simplest NoSQL databases? a) Key-value b) Wide-column c) Document d) All
a) Key-value
Point out the correct statement. a) MongoDB is classified as a NoSQL database b) MongoDB favors XML format more than JSON c) MongoDB is column-oriented database store d) All
a) MongoDB is classified as a NoSQL database
_____________ systems are scale-out file-based (HDD) systems moving to more uses of memory in the nodes. a) NoSQL b)NewSQL c) SQL d) All
a) NoSQL
A transactional database consists of a file where each __________ represents a transaction. a) Record b) Record graph c) Ordered record d) Attribute
a) Record
___________________ refers to the process of deriving high-quality information from text. a) Text Mining b) Image Mining c) Database Mining d) Multimedia Mining
a) Text Mining
Knowledge discovery in database refers to _____________. a) Whole process of extraction of knowledge from data b) Selection of data c) Coding d) Cleaning the data
a) Whole process of extraction of knowledge from data
Which of the following strings is a correct XML name? a) __myElement b) my Element c) #myElement d) None of the above
a) __myElement
Kind of Parsers are a) non validating and validating b) well-formed c) None of the above
a) non validating and validating
The need of using a DTD in developing XML document is a) required when validating XML documents b)no longer necessary after the XML editor has been customized c) used to direct conversion using an XSLT processor d) a good guide to populating a templates to be filled in when generating an XML document automatically
a) required when validating XML documents
The DTD includes the specifications about the markup that can be used within the document, the specifications consists of all except a) the browser name b) the size of element name c) entity declarations d) element declarations
a) the browser name
NoSQL databases is used mainly for handling large volumes of _________ data. a) unstructured b) structured c) semi-structured d) All
a) unstructured
Which of the following is not a NoSQl database? a)SQL Server b) MongoDB c) Cassandra d) None of the above
a)SQL Server
_____________ is an essential process where intelligent methods are applied to extract data patterns. a) Data Warehousing b) Data mining c) Data Base d) Data structure
b) Data mining
NoSQL databases are most often referred to as: a) Relational b) Distributed c) Object-oriented d) Network
b) Distributed
What does DTD stand for? a) Direct Type Definition b) Document Type Definition c)Dynamic Tool Description d) Dynamic Type Definition
b) Document Type Definition. XML uses DTD to describe the data/XML uses a description node to describe data
Which of the following is a NoSQL database type? a) SQL b) Document databases c) JSON d) All
b) Document databases
Which of the following represent column in NoSQL? a)Database b)Field c) Document d) Collection
b) Field
Cluster is : a) Group on a training data set to transform or simplify data in order to prepare it b) Group of similar objects that differ significantly from other objects c) Symbolic representation of facts or ideas from which information can potentially be extracted d) A and B
b) Group of similar objects that differ significantly from other objects
Which of the following is not a reason NoSQL has become a popular solution for some organizations? a) Better scalability b) Improved ability to keep data consistent c) Faster access to data than relational database management systems d) More easily allows for data to be held across multiple servers
b) Improved ability to keep data consistent
Which of the following is a reason to use an SQL database? a) It can easily store unstructured data b) Its ACID-compliant c) It can enable development in the cloud d) All
b) Its ACID-compliant
"Sharding" a database across many server instances can be achieved with __________ a) LAN b) SAN c) MAN d) All
b) SAN
What is the main technique employed for data selection? a) Noise b) Sampling c) Clustering d) Histogram
b) Sampling
SQL databases are: a) Horizontally scalable b) Vertically scalable c) Either horizontally or vertically d) They don't scale
b) Vertically scalable
Data mining is a) an extraction of explicit, known and potentially useful knowledge from information b) a non-trivial extraction of implicit, previously unknown and potentially useful information from data c) an essential process where intelligent methods are applied to extract data patterns that is also referred to database d) is an essential process where intelligent methods are applied to extract data that is also referred to data sets
b) a non-trivial extraction of implicit, previously unknown and potentially useful information from data
Which of the following strings is a correct XML name? a) xmlExtension b) xslNewElement c) XMLElement#123 d) XMLName
b) xslNewElement
______ is an online NoSQL developed by Cloudera. a) HCatalog b)Hbase c)Imphala d) Oozie
b)Hbase
Comments in XML document are given by a) < ?-- --> b) < !-- --!> c) < !-- --> d) < /-- -- >
c) < !-- -->
Which of the following is supported by MongoDB? a) SQL b) XML c) BSON d) all
c) BSON
What are the predefined attributes? a) xml:lang b) xml:space c) Both A and B d) None of the above
c) Both A and B
Data mining is used to refer _________ stage in knowledge discovery in database a) Selection b) Retrieving c) Discovery d) Coding
c) Discovery
XML document can be viewed in a) IE 3.0 b) IE 2.0 c) IE 6.0 d) IE X.0
c) IE 6.0
Two fundamental goals of data mining are: a) Analysis and description b) Data cleaning and organizing the data c) Prediction and description d) Data cleaning and organizing the data
c) Prediction and description
_________ is the application of data mining techniques to discover patterns from the Web a) Text Mining b) Multimedia Mining c) Web Mining d) Link Mining
c) Web Mining
When is it best to use a NoSQL database? a) When providing confidentiality, integrity and availability is crucial b) When the data is predictable c) When the retrieval of large quantities of data is needed d) When the retrieval speed of data is not critical
c) When the retrieval of large quantities of data is needed
Most NoSQL databases support automatic ___________ meaning that you get high availability and disaster recovery. a) processing b) scalability c) replication d) all
c) replication
Which of the following is not a component of a data warehouse? a) metadata b) current detail data c) lightly summarized data d) component key
component key
Which of the following are well formed XML fragments? a) < myElement myAttribute="someValue'/> b) < myElement myAttribute=someValue/> c) < myElement myAttribute='someValue'> d) < myElement myAttribute="someValue"/>
d) < myElement myAttribute="someValue"/>
Which of the following language is MongoDB written in? a) Javascript b) C c) C++ d) ALL
d) ALL
Which is an advantage of NewSQL? a) Less complex applications, greater consistency b) Convenient standard tooling c) SQL influenced extensions d) All
d) All
Which of the following statements are true? a) Cannot be used to comment an XML line b) XML elements names cannot contain spaces c) XML element names cannot start with a number d) All of the above are true
d) All of the above are true
How to describe XML data? a) XML uses a DTD to describe the data b) XML uses XSL to describe data c) XML uses a description node to describe data d) Both A and C
d) Both A and C
Data _____________ is the process of finding a model that describes and distinguishes data classes or concepts a) Characterization b) Mining c) Clustering d) Classification
d) Classification
Reduce amount of time and memory required by data mining algorithms_______ a) Target data b) Data sampling c) Data aggregation d) Data reduction
d) Data reduction
Which of the following companies developed NoSQL database Apache Cassandra? a) LinkedIn b) Twitter c) MySpace d) Facebook
d) Facebook
_________ stores are used to store information about networks, such as social connections. a) Key-Value b) Wide-column c) Document d) Graph
d) Graph
What is the aim of NoSQL? a) NoSQL provides an alternative to SQL databases to store textual data b) NoSQL databases allow storing non-structured data c) NoSQL is not suitable for storing structured data d) NoSQL is a new data format for storing large data sets
d) NoSQL is a new data format for storing large data sets
Which of the following is not an example of a NoSQL database? a) CouchDB b) MongoDB c) HBase d) PostgreSQL
d) PostgreSQL
XML uses the features of a) HTML b) XHTML c) VML d) SGML
d) SGML
Which of the following is a characteristic of a NoSQL database? a) Uses tables for storage b) Needs a schema c) Requires JOINs d) Uses JSON
d) Uses JSON
Which of the following is a primary classification for NoSQL architectures? a) Document databases b) graph databases c) Key-value databases d) all
d) all
Any XML must have: a) Closing tag b) Root element c) Nested child d) both a and b
d) both a and b
The _________ exposes the information being captured, stored and managed by operational systems. a) top-down view b) data warehouse view c) data source view d) business query view
data source view
The biggest drawback of the level indicator in the classic star schema is that it limits________ a) flexibility b) quantify c) qualify d) ability
flexibility
Neo4J and GraphBase
graph-based NoSQL systems
Given the association rule (A and B) → (C and D). Support can be computed as
occurrence of (A intersection B) → (C intersection D)/ total transaction
The type of relationship in star schema is__________ a) many to many b) one to one c) one to many d) many to one
one to many
A data warehouse is___________ a) Updated by end users b) contains numerous naming conventions and formats c) organized around important subject areas d) contain only current data
organized around important subject areas
Data warehouse contains____________ data that is never found in the operational environment. a) Normalized b) Informational c) Summary d) Denormalized
summary
Strategic value of data mining is _______ a) cost-sensitive b) work-sensitive c) time-sensitive d) technical-sensitive
time-sensitive
The ________ allows the selection of the relevant information necessary for the data warehouse. a) top-down view b) data warehouse view c) data source view d) business query view
top-down view