Data Analytics - Terms

Ace your homework & exams now with Quizwiz!

Visualization:

(Refer to Data visualization)

SQL:

(Refer to Structured Query Language)

Fill handle:

A box in the lower-right-hand corner of a selected spreadsheet cell that can be dragged through neighboring cells in order to continue an instruction

Equation:

A calculation that involves addition, subtraction, multiplication, or division (also called a math expression)

Math expression:

A calculation that involves addition, subtraction, multiplication, or division (also called an equation)

Cell reference:

A cell or a range of cells in a worksheet typically used in formulas and functions

Attribute:

A characteristic or quality of data used to label a column in a table

attribute

A characteristic or quality of data used to label a column in a table

Pivot chart:

A chart created from the fields in a pivot table

Database:

A collection of data stored in a computer system

Dataset:

A collection of data that can be manipulated or analyzed as one unit

Data:

A collection of facts

Video file:

A collection of images, audio files, and other data usually encoded in a compressed format such as MP4, MV4, MOV, AVI, or FLV

Record:

A collection of related data in a data table, usually synonymous with row

Range:

A collection of two or more cells in a spreadsheet

Array

A collection of values in spreadsheet cells

Query language:

A computer programming language used to communicate with a database

Structured Query Language:

A computer programming language used to communicate with a database

Bias

A conscious or subconscious preference in favor of or against a person, group of people, or thing

Bad data source

A data source that is not reliable, original, comprehensive, current, and cited (ROCCC

Bad data source

A data source that is not reliable, original, comprehensive, current, and cited (ROCCC)

Pivot table:

A data summarization tool used to sort, reorganize, group, count, total, or average data

Boolean data

A data type with only two possible values, usually true or false

Boolean data:

A data type with only two possible values, usually true or false

bullet graph

A data visualization that displays data as a horizontal bar chart moving toward a desired value

bubble chart

A data visualization that displays individual data points as bubbles, comparing numeric values by their relative size

box plot

A data visualization that displays the distribution of values along an x-axis

area chart

A data visualization that uses individual data points for a changing variable connected by a continuous line with a filled in area underneath

bar graph

A data visualization that uses size to contrast and compare two or more values

Relational database:

A database that contains a series of tables that can be connected to form relationships

Long data:

A dataset in which each row is one time point per subject, so each subject has data in multiple rows

Wide data:

A dataset in which every data subject has a single row with multiple columns to hold the values of various attributes of the subject

Spreadsheet:

A digital worksheet

Data science:

A field of study that uses raw data to create new ways of modeling and understanding the unknown

Return on investment (ROI):

A formula that uses the metrics of investment and profit to evaluate the success of an investment

Math function:

A function that is used as part of a mathematical formula

agenda

A list of scheduled appointme

Agenda:

A list of scheduled appointments

Metric goal:

A measurable goal set by a company and evaluated using metrics

Gap analysis:

A method for examining and evaluating the current state of a process in order to identify opportunities for improvement in the future

Data element:

A piece of information in a dataset

Cloud:

A place to keep data online, rather than a computer hard drive

Function:

A preset command that automatically performs a specified process or task using the data in a spreadsheet

Algorithm

A process or set of rules followed for a specific task

Algorithm:

A process or set of rules followed for a specific task

Fairness:

A quality of data analysis that does not create or reinforce bias

Relevant question:

A question that has significance to the problem to be solved

Specific question:

A question that is simple, significant, and focused on a single topic or a few closely related ideas

Unfair question:

A question that makes assumptions or is difficult to answer honestly

Time-bound question:

A question that specifies a timeframe to be studied

Leading question:

A question that steers people toward a certain response

Measurable question:

A question whose answers can be quantified and assessed

Action-oriented question:

A question whose answers lead to change

Absolute Reference

A reference within a function that is locked so that rows and columns won't change if the function is copied

Query:

A request for data or information from a database

String data type:

A sequence of characters and punctuation that contains textual information (Refer to Text data type)

Text data type:

A sequence of characters and punctuation that contains textual information (also called string data type)

Formula:

A set of instructions used to perform a calculation using the data in a spreadsheet

Field:

A single piece of information from a row or column of a spreadsheet; in a data table, typically a column in the table

Metric:

A single, quantifiable type of data that is used for measurement

Cookie:

A small file stored on a computer that contains information about its users

Quantitative data:

A specific and objective measure, such as a number, quantity, or range

SUM:

A spreadsheet function that adds the values of a selected range of cells

COUNT:

A spreadsheet function that counts the number of cells in a range that meet a specified criteria

AVERAGE

A spreadsheet function that returns an average of the values from a selected range

AVERAGE:

A spreadsheet function that returns an average of the values from a selected range

AVERAGEIF

A spreadsheet function that returns the average of all cell values from a given range that meet a specified condition

MAX:

A spreadsheet function that returns the largest numeric value from a range of cells

MIN:

A spreadsheet function that returns the smallest numeric value from a range of cells

Business metric

A standard of measurement used to solve a business task

Report:

A static collection of data periodically given to stakeholders

Qualitative data:

A subjective and explanatory measure of a quality or characteristic

United States Census Bureau:

An agency in the U.S. Department of Commerce that serves as the nation's leading provider of quality data about its people and economy

Scope of work (SOW):

An agreed-upon outline of the tasks to be performed during a project

Data type:

An attribute that describes a piece of data based on its values, its programming language, or the operations it can perform

Digital photo:

An electronic or computer-based image usually in BMP or JPG format

Assignment Operator

An operator used to assign values to variables and vectors

Arithmetic Operator

An operator used to perform basic math operations such as addition, subtraction, multiplication, and division

Second-party data:

Data collected by a group directly from its audience and then sold

First-party data:

Data collected by an individual or group using their own resources

Structured data:

Data organized in a certain format such as rows and columns

Third-party data:

Data provided from outside sources who didn't collect it directly

Open data:

Data that is available to the public

Discrete data:

Data that is counted and has a limited number of values

Continuous data:

Data that is measured and can have almost any numeric value

Unstructured data:

Data that is not organized in any easily identifiable manner

Internal data:

Data that lives within a company's own systems

External data:

Data that lives, and is generated, outside of an organization

Audio file:

Digitized audio storage usually in an MP3, AAC, or other compressed format

audio file

Digitized audio storage usually in an MP3, AAC, or other compressed format

Data-inspired decision-making:

Exploring different data sources to find out what they have in common

Access Control

Features such as password protection, user permissions, and encryption that are used to protect a spreadsheet

Anscombe's Quartet

Four datasets that have nearly identical summary statistics but contain different plotted values

Data design:

How information is organized

Sample:

In data analytics, a segment of a population that is representative of the entire population

Population:

In data analytics, all possible data values in a dataset

Pixel:

In digital imaging, a small area of illumination on a display screen that, when combined with other adjacent areas, forms a digital image

Argument (R)

Information needed by a function in R in order to run

Big Data

Large, complex datasets typically involving long periods of time, which enable data analysts to address far-reaching business problems

Big data:

Large, complex datasets typically involving long periods of time, which enable data analysts to address far-reaching business problems

Borders

Lines that can be added around two or more cells on a spreadsheet

Borders:

Lines that can be added around two or more cells on a spreadsheet

Administrative metadata

Metadata that indicates the technical source of a digital asset

Stakeholders:

People who invest time and resources into a project and are interested in its outcome

Ordinal data:

Qualitative data with a set order or scale

Analytical skills

Qualities and characteristics associated with using facts to solve problems

Analytical skills:

Qualities and characteristics associated with using facts to solve problems

Small data:

Small, specific data points typically involving a short period of time, which are useful for making day-to-day decisions

Data analyst:

Someone who collects, transforms, and organizes data in order to draw conclusions, make predictions, and drive informed decision-making

Aliasing

Temporarily naming a table or column in a query to make it easier to read and write

annotation

Text that briefly explains data or helps focus the audience on a particular aspect of the data in a visualization

alternative text

Text that provides an alternative to non-text content, such as images and videos

Technical mindset:

The ability to break things down into smaller steps or pieces and work with them in an orderly and logical way

Problem domain:

The area of analysis that encompasses every activity affecting or affected by a problem

Ownership:

The aspect of data ethics that presumes individuals own the raw data they provide and have primary control over its usage, processing, and sharing

Observation:

The attributes that describe a piece of data contained in a row of a table

Data analysis:

The collection, transformation, and organization of data in order to draw conclusions, make predictions, and drive informed decision-making

Context:

The condition in which something exists or happens

Accuracy

The degree to which data conforms to the actual entity being measured or described

Balance

The design principle of creating aesthetic appeal and clarity in a data visualization by evenly distributing visual elements

Header:

The first row in a spreadsheet that labels the type of data in each column

Data visualization:

The graphical representation of data

Data strategy:

The management of the people, processes, and tools used in data analysis

Sorting:

The process of arranging data into a meaningful order to make it easier to understand, analyze, and visualize

aggregation

The process of collecting or gathering many separate pieces into a whole

Analytical thinking:

The process of identifying and defining a problem, then solving it by using data in an organized, step-by-step manner

analytical thinking

The process of identifying and defining a problem, then solving it by using data in an organized, step-by-step manner

Structured thinking:

The process of recognizing the current problem or situation, organizing available information, revealing gaps and opportunities, and identifying options

Reframing:

The process of restating a problem or challenge, then redirecting it toward a potential resolution

Filtering:

The process of showing only the data that meets a specified criteria while hiding the rest

A/B Testing

The process of testing two variations of the same web page to determine which page is more successful at attracting user traffic and generating revenue

Business task

The question or problem data analysis resolves for a business

Business task:

The question or problem data analysis resolves for a business

Turnover rate:

The rate at which employees voluntarily leave a company

Root cause:

The reason why a problem occurs

Data analytics:

The science of data

Data life cycle:

The sequence of stages that data experiences, which include plan, capture, manage, analyze, archive, and destroy

Data analysis process:

The six phases of ask, prepare, process, analyze, share, and act whose purpose is to gain insights that drive informed decision-making

Revenue:

The total amount of income generated by the sale of goods or services

Data ecosystem:

The various elements that interact with one another in order to produce, manage, store, organize, analyze, and share data

Problem types:

The various problems that data analysts encounter, including categorizing things, discovering connections, finding patterns, identifying themes, making predictions, and spotting something unusual

Data-driven decision-making:

Using facts to guide business strategy

Order of operations:

Using parentheses to group together spreadsheet values in order to clarify the order in which operations should be performed

Social media:

Websites and applications through which users create and share content or participate in social networking

Operator:

A symbol that names the operation or calculation to be performed

SMART methodology:

A tool for determining a question's effectiveness based on whether it is specific, measurable, action-oriented, relevant, and time-bound

Data model:

A tool for organizing data elements and how they relate to one another

Dashboard:

A tool that monitors live, incoming data

Nominal data:

A type of qualitative data that is categorized without a set order

Aesthetic (R)

A visual property of an object in a plot


Related study sets

Primary Professional Military Education (Enlisted) - Block - 2 - The Culture of the Navy; Enlisted Professionalism; Overview of Manpower Management for the Senior Enlisted Leader.

View Set

RBT - Behavioral Challenges of Autism

View Set