Data Analytics (Lesson 1-3)

Pataasin ang iyong marka sa homework at exams ngayon gamit ang Quizwiz!

a.) analytics b.) System analytics c.) business analytics - Correct Answer d.) data analytics

The process of understanding the scope, objectives, and complexities of business projects.

Society

When purchasing items online, users may be unaware that the store is likely collecting data on the buying patterns of its customers, which may be used to recommend other items for purchase in the future.

Public data

Agglomerative Hierarchical

Density-Based

DBSCAN

Model-based

Here, all the clusters are hypothesized in order to find the data which is best suited for the model.

Database

A collection of data files that contain information about people, transactions, and /or inventory.

Grid-based

A fast processing time and it is dependent only on the number of cells in each dimension in the quantized space.

False

Analytics can be defined as a variable that involves the use of statistical techniques, information system software, and operations research methodologies to explore, visualize, discover, and communicate patterns or trends in data.

True

As a general technology, data mining can be applied to any kind of data as long as the data are meaningful for a target application.

User Interaction

Background knowledge, constraints, rules, and other information regarding the domain under study should be incorporated into the knowledge discovery process.

hierarchical

Can be agglomerative or divisive approach.

Hierarchical

Chameleon

Web Data

Characterize and classify web pages, and uncover web dynamics and the association and other relationships among different web pages, users, communities, and web-based activities.

Partitioning

Clusters are represented by the prototype and we use the iterative counterstrategy to optimize the clustering.

True

Data analytics became very useful not only for business sector but also in government services.

True

Data analytics helps the retailer in analyzing user behavior and figuring out the best way to organize website based on user preferences.

Information Retrieval

Finding the major topics in a collection of documents and, for each document in the collection, the major topics involved.

Prescriptive

One use of this is to properly allocate resources .

Statistics

This is useful for mining various patterns from data as well as for understanding the underlying mechanisms generating and affecting the patterns.

Does Include: - data discovery - market research - tracking applications Does Not Include: - identifying different areas

Which of the following is/are the application of clustering?

Data Cleanser

Hardware and software used for data remote storage, retrieval, and computational functions

Density-Based

Here, the given cluster will keep on growing continuously as long as the density in the neighborhood exceeds some threshold, i.e, for each data point within a given cluster.

Traffic

Hierarchical clustering

Sequence Data

Historical records, stock exchange data, and time-series and biological sequence data

Efficiency and Scalability

In order to effectively extract information from huge amounts of data in many data repositories or in dynamic data streams, data mining algorithms must have these two factors.

Stock Market

K-means

knowledge discovery from data

Many people treat data mining as a synonym for this another popularly used term

Sequence Data

Mining banking data for changing trends, which may aid in the scheduling of bank tellers according to the volume of customer traffic.

True

Mining multiple data sources of complex data often leads to fruitful findings due to the mutual enhancement and consolidation of such multiple sources.

True

Powerful and versatile tools are badly needed to automatically uncover valuable information from the tremendous amounts of data and to transform such data into organized knowledge.

Data Visualization

Presenting information graphically to illustrate trends and patterns.

Mining Methodology

Should consider issues such as data uncertainty, noise, and incompleteness.

Data Set

A simple collection of data or a data file.

False

Steps 1 through 5 of the knowledge discovery process are different forms of data preprocessing, where data are prepared for mining.

Statistics

Studies the collection, analysis, interpretation or explanation, and presentation of data.

Data mining

The knowledge discovery process step that applies intelligent methods to extract data patterns.

Pattern evaluation

The knowledge discovery process step that identifies the truly interesting patterns representing knowledge based on interestingness measures.

Data Mining

The process of discovering interesting patterns and knowledge from large amounts of data.

False

The transdisciplinary nature of data mining research and development contributes significantly to the success of data mining and its extensive applications.

Diversity of Database Types

The wide diversity brings about challenges to data mining which includes diverse applications that generate a wide spectrum of new data type.

False

Data Analytics helps the search engine in ranking the web pages regardless of its category.

True

Data analytics is a major component of today's manufacturing since most transactions are done in real-time.

Model-Based

EM

Capturing images

Mean Shift clustering

Data cleaning

The knowledge discovery process step that removes noise and inconsistent data.

Data transformation

The knowledge discovery process step that transforms and consolidates data into forms appropriate for mining by performing summary or aggregation operations.

Knowledge presentation (2nd One)

The knowledge discovery process step that uses visualization and knowledge representation techniques to present mined knowledge to users.

Transactional Data

These data are captured transactions such as a customer's purchase, a flight booking, or a user's clicks on a web page.

Database Data

This consists of a collection of interrelated data and a set of software programs to manage and access the data.

Database Systems Approach

Well known for their high scalability in processing very large, relatively structured data sets.

Includes: - more effective marketing -can continuously collect and analyze new data to update business understanding as conditions change. -can boost revenue through increased conversions, ad revenue or subscriptions Does not include: -can reveal hidden information of the employees

What are the benefits of data analysis?

Includes: - data engineer - data architect Does Not include: - data manager - data adminitsrator

What are the possible careers for those who know data analysis aside from data analyst?

Machine Learning

This investigates how computers can learn (or improve their performance) based on data.

Information Retrieval

This is the science of searching for documents or information in documents.

Data Streams

Video surveillance and sensor data, which are continuously transmitted

False

A cluster is a set of similar objects.

Data Model

A way to organize the elements of a data set.

True

Cluster is a connected region of a multidimensional space with a comparatively high density of objects.

Meteorological department

Density based subsequence clustering.

Spatial Data

Describing changes in metropolitan poverty rates based on city distances from major highways.

Multimedia Data

Detecting video sequences corresponding to goals of a hockey game

Multimedia Data

Identifying objects and classifying them by assigning semantic labels or tags from images

Text Data

Identifying the evolution of hot topics in literature

True

In Clustering, falling under the category of unsupervised machine learning, is one of the problems that machine learning algorithms solve.

Partitioning

K-means

Predictive

Need to use advanced statistical, IS or operations research methods that an identify ominous variables and create a model to identify trends and relationships

Mining Methodology

Not all the patterns generated by data mining processes are interesting.

Database Systems Approach

Research focuses on the creation, maintenance, and use of databases for organizations and end-users.

Descriptive

The application of simple statistical techniques that determine what is contained in a data set or database.

Data Warehouse

The data are stored to provide information from a historical perspective in a repository of information collected from multiple sources, stored under a unified schema, and usually residing at a single site.

Statistics

The data mining process can use these models to help identify and handle noisy or missing values in the data.

User Interaction

The data mining process should be highly interactive. It is important to build flexible user interfaces and an exploratory mining environment, facilitating the interaction with the system.

Engineering Design Data

The design of buildings, system components, or integrated circuits

Society

The improper disclosure or use of data and the potential violation of individual privacy and data protection rights are areas of concern that need to be addressed.


Kaugnay na mga set ng pag-aaral

Anatomy: Skeletal Microanatomy Review

View Set

¿Qué hacían? Fill in the blanks Activity Instructions Complete the sentences, describing the domestic activity in each drawing. Use the imperfect tense.

View Set

2nd Semester Exam - Honors English 12

View Set

Chpt. 18 Nursing Management of the Newborn

View Set

Midterm Foundations (Quiz 1, 2,3)

View Set

Final Exam Study Guide Ecology Evolution Diversity

View Set

Combo with Unit 8 Chapter 46 Shock, Multiple Organ Dysfunction & Burns and 1 other

View Set