Foundations of Data Science - Week 2
List the three activities covered by the data professions
1. Statistical Inference 2. Machine Learning 3. Data Analytics
Nonprofit
A group whose main purpose is to further a social cause or provide a benefit to the public
Sample
A segment of a population that is representative of the entire population
Edge Computing
A way of distributing computational tasks over a bunch of nearby processors (i.e., computers) that is good for speed and resiliency and does not depend on a single source of computational power.
Expert _________ explore vast and complex datasets in order to identify worthwhile business initiatives.
Data Analysts
Data professionals come together during _____________ to create a solution to an existing problem using technology.
Hackathons
Artificial intelligence is the development of computer systems that are able to perform tasks that normally require ___________ intelligence.
Human
Personally Identifiable Information (PII)
Information that permits the identity of an individual to be inferred by either direct or indirect means
What is the term for organizations that are specifically created to foster a collective, public, or social advantage, rather than maximizing revenue?
Nonprofits
What type of data is available to the public for free and includes guidance for navigating the datasets and acknowledging the source?
Open Data
What is the term for data that can be used to determine an individual's identity, either by direct or indirect means?
Personally Identifiable Information
Data Stewardship
The practices of an organization that ensure that data is accessible, usable, and safe
Data Aggregation
The process of collecting and combining details from a significant number of users in terms of totals or summary.
Data anonymization
The process of protecting people's private or sensitive data by eliminating Personally Identifiable Information (PII) Data anonymization involves blanking, hashing, or masking personal information, often by using fixed-length codes to represent data columns or hiding data with altered values.
What does a Product Development team do?
The professionals in these roles manage a portfolio of customers and stakeholder analytic projects and initiatives. They often manage the analytical strategy for the organization. In these role, experience is most likely required, and responsibilities are larger and more global. Key Attributes: ~ What they do: Manage analytical strategy within a project team ~ How they do it: They are less hands-on with data analysis, serving as the persona a data scientist or analyst would report to Sample Job Titles: ~ Product Manager ~ Product Developer ~ Product Lead ~ Digitial Product Manager ~ Customer Product Manager
What does the C-Suite do with data in their roles?
This classification of roles covers high-ranking executives within an organization. The 'C' in C-suite stands for chief. In general, there's a trend from the C-suite to build data-driven decision-making into their processes. Individuals filling these roles within organizational leadership teams are expected to be familiar with data and analytics. Key Attributes: ~ What they do: Responsible for data and data professionals across an entire organization ~ How they do it: They are decision-makers found at the top end of a company's hierarchy Sample Job Titles: ~ Chief Marketing Officer ~ Chief Data Officer ~ Chief Analytics Officer ~ Chief Information Officer ~ Chief Data Scientist
A data team collects information from enough people to ensure the information represents the population as a whole. What does this scenario describe?
Aggregating
A data professional collects feedback from many different individuals, then gathers it together to inform a data project. What does this scenario describe?
Aggregation
At a business, who is responsible for ensuring socially beneficial and inclusive practices, applying scientific and ethical principles, and staying aware of possible bias?
All data professionals
Hackathon
An event where data professionals and programmers come together and work on a project
A team of data professionals discusses the potential of their personal background and beliefs affecting their data findings. They establish processes to ensure that they interpret and communicate sensitive information impartially. What does this scenario describe?
Avoiding subtle biases in data work
A person's background, experiences, and beliefs lead to __________, which may negatively affect data work.
Bias Bias is a conscious or subconscious preference in favor of or against a person, groups of people, or thing.
Artificial intelligence is the development of _________ able to perform tasks that normally require human intelligence.
Computer Systems
Artificial Intelligence (AI)
Computer systems able to perform tasks that normally require human intelligence
What does a Business Intelligence professional do?
Data analytics and business intelligence share a lot of commonalities. Both fields have professionals that use data to create insights that inform decision-making. A major difference is that business intelligence is more focused on creating processes and information channels that transform relevant data. Business intelligence professionals create tables, reports, and dashboards that empower stakeholders, facing them access to the data they need to inform the entire decision-making process on a continual basis. These roles often serves as a complement to core data analytics/data science professionals. Key Attributes: ~ What they do: Perform predictive analysis that enables organizations to determine likely future trends ~ How they do it: Create tables, reports and dashboards that empower their organization Sample Job Titles: ~ BI Architect ~ BI Analyst ~ BI Solution Developer ~ BI Software Engineer ~ Data Viz & BI Analyst
Aggregated Information
Data from a significant number of users that has eliminated personal information
What do data management and infrastructure professionals do?
Data professionals that work in data management and infrastructural roles are primarily responsible for the systems that distribute data and maintain its integrity. They work alongside data analytics professionals to help support their work. Their main responsibility is to ensure the functionality of data systems and the compliance with local, state, and federal regulations involving data security and ethics. Key Attributes: ~ What they do: Manage data sources and the overall data infrastructure ~ How they do it: Work with the tools and databases used to manage data within a business Sample Job Titles: ~ Data Engineer ~ Technology Engineer ~ Data Manager ~ Data Steward ~ IT Architect
What do data scientists and data analysts do?
Data scientist and data analysts are roles that work directly with data. These professionals gather, clean, analyze, and share insights from data with stakeholders. An increasing number of industries turn to data analysis to create insights that inform various tasks like guide decision-making, identify user preferences, or determine how to use resources more effectively. Key Attributes: ~ What they do: Uncover trends, patterns, and insights from data ~ How they do it: Employ advanced modeling and statistical analytics techniques Sample Job Titles: ~ Data Scientist ~ Marketing Analyst ~ Data Analyst ~ AI Analyst ~ Business Analyst
Open Data
Data that is available to the public and free to use, with guidance on how to navigate the datasets and acknowledge the source
A good sample is a segment of a population that is representative of what?
The entire population
What type of data professionals are business intelligence professionals and technical project managers?
Strategic
What type of data professionals include expert data analysts, machine learning engineers, and statisticians?
Technical data professionals
Agricultural
~ Develop new approaches ~ Improve harvesting technologies
Fiance
~ Ealy adopter of data science ~ Assess Risks ~ Monitor Markets ~ Reduce Fraud ~ Create a more stable financial system
What are examples of technical data professional roles?
~ Expert data analysis ~ Statisticians ~ Machine learning engineers
Technical Data Professionals
~ Expertise in mathematics, statistics, and computing ~ Build models and make predictions ~ Explore datasets ~ Transform raw data into useful information for decision-making
Strategic Data Professionals
~ Interpret information for an organization's operations, finance, research, and development ~ Work aligns with business strategy
In what way is building diverse teams an effective method for countering human bias in data work?
~ It promotes wider representation. ~ It incorporates a wide range of perspectives. ~ It yields more accurate project results.
Manufacturing
~ Predict when to perform preventative maintenance ~ Maximize quality assurance ~ Respond to logistical issues ~ Enable clear communication ~ Maintain optimal restocking levels
Healthcare
~ Wearable Tech ~ Process clinical data ~ Increase early detection ~ Improve Diagnosis ~ Create individualized wellness plans