Merit America
Visualization
(Refer to data visualization)
gap analysis
A method for examining and evaluating the current state of a process in order to identify opportunities for improvement in the future
action-oriented question
A question whose answers lead to change
5 Whys
A technique in which you repeatedly ask the question "Why?" to help peel away the layers of symptoms that can lead to the root cause of a problem
Phases of Data Analysis
Ask, Prepare, Process, Analyze, Share, Act
the six steps of data analysis
Ask, Prepare, Process, Analyze, Share, Act
5. Share
Communicating and interpreting results
ETL
Extract, Transform, Load
what is a method for examining and evaluating how a process works currently in order to get an improved future state?
Gap analysis
Data design
How information is organized
what type of data visualization might my analyst create in order to communicate data insights to others
Map, Graph, and Chart
6. Act
Putting your insights to work to solve the problem
SQL
Structured Query Language
dashboard
a tool that monitors live, incoming data
2. Prepare
identifying and locating data you can use to answer your question
select
to choose the columns you want to return
from
to choose the tables where the columns you want are located
where
to filter or certain information
4. Analyze
use the data to solve problems, make decisions, and support business goals
data scientist
uses expert skills and technology and social science to find trends through data analysis
what starts an equation in a spreadsheet
=
Math Expression
A calculation that involves addition, subtraction, multiplication, or division (also called an equation)
query language
A computer programming language p that allows you to retrieve and manipulate data from a database
Spreadsheet
A digital worksheet
Data Science
A field of study that uses raw data to create new ways of modeling and understanding the unknown
Fill in the blank: data analysis use a problem-oriented approach in order to identify, _____, and solve problems
Describe
EMC's data analysis process
Discovery, Pre-processing data, model planning, model building, communicate results, and operationalize
data analyst
Someone who collects, transforms, and organizes data in order to draw conclusions, make predictions, and drive informed decision-making
technical mindset
The ability to break things down into smaller steps or pieces and work with them in an orderly and logical way
Data Analysis
The collection, transformation, and organization of data in order to draw conclusions, make predictions, and drive informed decision-making
Data strategy
The management of the people, processes, and tools used in data analysis
Data-inspired decision-making
The process of exploring different data sources to find out what they have in common
business task
The question or problem data analysis answers for a business
root cause
The reason why a problem occurs
Data ecosystem
The various elements that interact with one another in order to produce, manage, store, organize, analyze, and share data
fill handle
a box in the lower right hand corner of a selected spreadsheet cell that can be drive through neighboring cells in order to continue an instruction
cell reference
a cell or range of cells and worksheet typically used in formulas and functions
attribute
a characteristic or quality of data used to label a column in a table
Database
a collection of data stored in a computer system
Data Set
a collection of data that can be manipulated or analyzed as one unit
data
a collection of facts
self-reporting
a data collection technique where participants provide information about themselves
cloud
a place to keep data online rather than a computer hard drive
function
a preset command that automatically performs a specific process or task using the data in a spreadsheet
Algorithm
a process or set of rules followed for a specific task
leading question
a question that steers people towards a certain response
query
a request for data or information from a database
formula
a set of instructions that performs a specific calculation using the data in a spreadsheet
COUNT
a spreadsheet function that counts the number of cells in a range that meet a specific criteria
AVERAGE
a spreadsheet function that returns an average of the values from a selected range
issue
a topic or subject to investigate
Observation
all of the attributes for something contained in a row of a data table
problem
an obstacle or complication that needs to be worked out
operations analyst
analyzes data to assess the performance of business operations and workflows
business analyst
analyzes data to help businesses improve processes, products, or services
financial analyst
analyzes financial status by collecting, moditorimg, and reviewing data
marketing analyst
analyzes market conditions to assess the potential sales of products and services
healthcare analyst
analyzes medical data to improve the business aspect of hospitals and medical facilities
hr/payroll analyst
analyzes payroll data for inefficiencies and errors
data analytics consultant
analyzes the systems and models for using data
brisk analyst
analyzing financial documents, economic conditions, and client data to help companies determine the level of risk involved in making a particular business decision
capitalization, indentation, and semicolons
are useful for making your SQL queries easier to read
equation
calculation that involves addition, subtraction, multiplication, or division (also called a math expression)
3. manage
care for and maintain the data. this includes determining how and where it is stored in the tools used to do so
3. Process
cleaning data, transforming data into more useful formats, committing two or more data sets to make information more complete, and removing outliers
2. Capture
collect or bring in data from a variety of different sources
Data analysis skills
curiosity, understanding of context, technical mindset, data design, and data strategy
what statements correctly describe data and data analysis
data is a collection of facts, collecting data is part of the data analysis process, and one goal of day analysis is to make predictions.
1. plan
decide what kind of data is needed, how it will be managed, and who will be responsible for it
question
designed to discover information
while planning a road trip, you figure out all of these specific stops you need to take along the way. you also consider how often you'll stop for gas, meals, and sleep. Having this information enables you to execute your plan. what does the scenario describe?
detail oriented thinking
Fairness
ensuring that your analysis doesn't create or reinforce bias
what are elements of data driven decision making
figure out the business need or problem to be solved
fill in the blank: analytical thinking involves _____ a problem, then solving it using data and organized, step by step manner
identifying and defining
Oversampling
increasing the sampling size of a non-dominant group in a population this can help you better represent them and address imbalanced data sets
archive
keep relevant data stored for long-term and future reference
Big Data
large, complex data sets typically involving long periods of time, which enabled data analyst to address far reaching business problems
Borders
lines that can be added around two or more cells in a spreadsheet
data specialist
organizes or converts data for use in databases or software systems
Syntax
p the predetermined structure of a language that includes all required words, symbols, and punctuation, as well as their proper placement the syntax of every SQL quarry is the same: SELECT FROM WHERE
Stakeholders
people who have invested time and resources into a project and are interested in the outcome
Data Life Cycle
plan, capture, manage, analyze, archive, destroy
Data Engineer
prepares and integrates data from different sources for analytical use
analytical skills
qualities and characteristics associated with using facts to solve problems
destroy
remove data from storage and delete any shared copies of the data.
fill in the blank: data science is creating new ways of modeling and _____ by using raw data
sharing insights
consider surrounding factors
so that you can understand all factors that can influence the insights you're gaining
considering all of the available data
so you're analysis reflects the truth and not just your own expectations
1. Ask
takes the time to fully understand stakeholder expectations, defines the problem to be solved, decides which question to answer in order to solve the problem
context
the condition in which something exists or happens
Header
the first row in a spreadsheet that labels the type of data in each column
data visualization
the graphical representation of data
analytical thinking
the process of identifying and defining a problem, then solving it by using data in an organized, step by step manner
over sampling
the process of increasing the sample size of non-dominant groups in a population. this can help you better represent them in address imbalanced data sets
Filtering
the process of showing only the data that meets a specific criteria while hiding the rest
data analytics
the science of data
4. Analyze
this is when you turn the day you gathered, prepared, and processed into actionable information
think about fairness from the beginning to end
to ensure that your analysis and final conclusions are fair, be sure to consider fairness from the earliest stages of a project to when you act on the data insight. this means that the data collection, cleaning, processing, and Analysis are all performed with fairness in mind.
data driven decision making
using facts to guide business strategy