Block 2: Methodology of Data Science

¡Supera tus tareas y exámenes ahora con Quizwiz!

What are the benefits of big Data in The European Union?

transform Europe's service industries by generating a wide range of innovative information products and services; increase the productivity of all sectors of the economy through improved business intelligence; better address many of the challenges that face our societies; improve research and speed up innovation; achieve cost reductions through more personalised services increase efficiency in the public sector.

A data scientist should be:

- Technical - Quantitative - Curious and creative - Communicative and Collaborative - Expert in the Area - Visual

What are the Steps of the Data Science Process?

1. Capture data 2. Prepare Data 3. Analyze Data 4. Communicate Results 5. Take Decisions

What is done in the Take Decisions step of the Data Science process?

1. Connect your results with your business question 2. Assess impact 3. Define next steps

From what is composed the Architecture of Data Science?

1. Data devices 2. Data collectors 3. Data aggregators 4. Data users and buyers

What is done in the Capture data step of the Data Science process?

1. Identify data sources 2 .Capture data 3. Scope structures and tools needed In Big data there are some tools specialized on gathering data

What is done in the Communicate results step of the Data Science process?

1. What to present 2. How to present: visualization techniques

What is done in the Prepare data step of the Data Science process?

1.Explore data ( Is data categorical or numerical?) 2.Clean data 2.1. Missing values: average or put values like 1 but always state the changes you have made. 2.2Outliers: Decide if you want to use outliers or delete them 2.3 Invalid data better to delete 2.4. Duplicate records 3. Transform data necessary to have data in a homogeneous way 3.1. Scale 3.2. Transform 3.3. Feature selection 3.4.Reduce dimensionality 4. Integrate

How does a data scientist should be?

A data scientist is someone who is better at statistics than any software engineer and better at software engineering than any statistician

For what does big Data apply knowledge?

to encourage action

What are Data buyers in the Architecture of data science?

Buy data from the data aggregators that have a value for them to sell more for example. Banks, etc.

What is done in the Analyze data step of the Data Science process?

Depending on the purpose of what you want to do you select different techniques. 1. Select technique 2. Build Model 3. Evaluate

What are Data devices in the Architecture of data science?

Devices that capture data. Smartphones, TVs, computers

That GPDR stands for, and what is it?

GPDR (General Data Protection Regulation) is the new law that preserves this protection)

What is Data Science

Is an interdisciplinary field about scientific methods, processes, and systems to extract knowledge or insights from big data similar to data mining.

What are Data aggregators in the Architecture of data science?

Make sense of the collected data. In turn, they can transform and packages data as products to be sold.

What are the Scopes of Data science

Mathematics Expertise, Technology/Software, Business/ Strategy

In what is the Data Science Methodology based?

On the Five P's of Data Science. The five P's as dimensions of modern data science

What are the 5 P's of data Science methodology

People, Purpose, Process, Platform, Programmability

What role can NoSql play in a Data Science Project?

Store the data Integrate Data

Why Big Data is a big opportunity for EU companies ?

The EU has the highest data protection standards in the world. This generates trust

Does the GDPR affect big data

True

What do we need to know in Programmability?

all the frameworks regarding Big Data and software

From what do Data users and buyers benefit in the Architecture of data science?

benefit directly from the collected and aggregated data.

What are Data users in the Architecture of data science?

can benefit myself from the information that is given there. Used Be informed about disasters and discuss about them using Social Media

From what do SMES will be benefitiated giving it an opportunity?

from four reductions in excessive legislation

When talking about Platform what is one of the most complicated aspects?

how to integrate everything

What are some techniques for data protection by design?

like anonymisation, pseudonymisation, encryption, and protocols for anonymous communications

What are NoSql databases

not structured data. Information gathered from sensors. Database without SQL structured data.


Conjuntos de estudio relacionados

Exam FX Chapter 4: Life Insurance Policy Provisions, Riders and Options

View Set

Construction Project Administration Textbook Exam Chapters 1-17

View Set

Food Service Systems Chapter 5-6

View Set

Algebra Quiz on 5-4, 5-5, 5-6, 5-7, & 5-9

View Set

Forensic and Legal Psychology Final Test (Part 2: Previous Notes)

View Set

Revolutionary War Battles and People

View Set

Insurance License Training: Annuities

View Set