Introduction to Analytics - D491
Which project is considered a data analytics project?
Creating a dashboard to visualize sales data and monitor inventory levels for a grocery store chain
Which tools are commonly used for communicating results in data analytics projects?
Data visualization tools and presentation software
What is the advantage of using a decision tree over a linear regression model in a data analytics project?
Decision trees can handle nonlinear relationships between variables.
A data analyst is tasked with creating a comprehensive report about a media company's user base for advertisers. Which data is most useful to include?
Demographic
What are the different types of data analytics projects?
Descriptive, diagnostic, predictive, and prescriptive analytics
What is a primary responsibility of a data engineer?
Designing and implementing data storage solutions
What is the role of a business intelligence analyst?
Designing and maintaining data visualizations and dashboards
What is a primary responsibility of a machine learning engineer?
Developing predictive models using machine learning algorithms
Which type of data analytics project aims to determine why something happened in the past?
Diagnostic
In which phase of the data mining process does the data science team investigate the problem, develop context and understanding, learn about available data sources, and formulate initial hypotheses?
Discovery
A popular travel booking platform receives a large volume of web traffic, GPS location data, and user-generated content from various sources. The data analytics team is preparing this data for analysis to better understand customer behavior and preferences. Which tool would be most suitable for preparing this data?
Hadoop
A marketing company has a client who wants to know their social media engagement for the past month. They have accounts on several social media platforms and want to compare their engagement across these platforms. Which visualization metric should be used to find the social media engagement for the client?
Heat map
What is the most appropriate data analytics technique for analyzing website traffic patterns?
Heat map
A data analyst is planning a new analytics project for a retail company and needs to collect data from different sources to complete the project. Which question should be asked regarding the sources and quality of the available data for the project?
What is the time frame of the data, and how often is it updated?
A manufacturing company collected data on production processes, equipment downtime, and maintenance logs. Which question can a data analytics project answer using diagnostic analytics?
What was the cause of the production process inefficiency that resulted in a six-hour delay yesterday?
Which question should be asked to determine if a data set is biased?
Is the data from a self-reported survey?
Why is quality control/assurance crucial for data engineers in a data analytics project?
It ensures that the data is accurate and reliable.
Which tool is commonly used during the model planning phase?
KNIME
A manufacturing company wants to compare the productivity of different teams in its factory over time. Which visualization technique should be used to present the findings of the comparison?
Line chart
A data analyst works at an e-commerce company that wants to understand its customer churn rate. Their manager has tasked them with conducting a data analytics project to identify customers at risk of churn and offer these customers targeted promotions to retain their business. What is the most suitable form of deliverable in this scenario?
Lists of at-risk customers
A company wants to predict the likelihood of a customer responding to a marketing campaign. The data set contains both numerical and categorical variables. Which analytics technique should the company use?
Logistic regression
A healthcare company wants to predict which patients are at risk of developing a certain medical condition. Which model is commonly used for this type of analysis?
Logistic regression
A data analyst is analyzing the employees' salaries at a company to find a representative value that summarizes the central tendency of the data. Which metric should be used to summarize the central tendency of the data?
Median
During a data analytics project, which phase focuses on developing training and test datasets, refining models, and assessing the validity and predictive power of the models?
Model execution
In the data analytics process, which phase focuses on identifying candidate models for clustering, classifying, or finding relationships and ensuring analytical techniques align with business objectives?
Model planning
What should analysts do with the findings discovered during the operationalize phase of a data analytics project?
Modify reports and dashboards
Which tool is commonly used for data preparation?
OpenRefine
What role does a project manager play within a data analytics project?
Oversee the project team and ensure the project is completed on time and within budget
Which activities should be the focus of the model planning phase?
Partitioning the data into training, validation, and test sets
Which type of data is necessary for performing machine learning analysis?
Preprocessed data
What is the purpose of the communicate results phase in a data analytics project?
Presenting findings and outcomes to stakeholders
Which activity should the data analytics team focus on during the communicate results phase
Presenting key findings to stakeholders and evaluating the project's success
Which data analytic technique is best suited for identifying outliers in a dataset?
Principal component analysis (PCA)
Which groups make up the key stakeholders in a data analytics project?
Project team members and senior management
What role do stakeholders play in the project cycle?
Provide guidance and feedback throughout the project
An organization is building a theme park where the temperature can vary wildly. All rides should be built to handle the extremes of the temperature spectrum. Which metric should be used in this scenario?
Range
What is the most appropriate analytics technique for predicting sales for the next quarter?
Regression analysis
Which stakeholder should conduct literature reviews for a data analytics project?
Researcher
Which tool is suitable for a data analytics team to use during the model execution phase of a project?
SAS Enterprise Miner
Which sequence of steps should you follow during the data preparation phase?
Set up sandbox, extract and transform data, condition data, explore visually
A company recently completed a data analytics project to identify the most energy-efficient products to add to the catalog. The project team comprised business users, project sponsors, analysts, data scientists, data engineers, and database administrators. Now, the team needs to share their findings with various stakeholders. What should the data scientists, data engineer, and database administrator do to share their findings?
Share code and provide implementation details
Which data source for a retail company analyzing customer behavior is an example of an external source?
Social media activity of the company's competitors
A team working for a social media company needs to analyze customer feedback on a newly launched product using sentiment analysis. What is the most appropriate approach for sentiment analysis in this scenario?
Text mining
A retail company wants to improve its sales and customer satisfaction by analyzing customer data. The company hired a data analytics team, which has access to the company's customer database, including transaction records, demographic information, and customer feedback. The data analytics team will work closely with the marketing and IT departments to create actionable insights for the company. The team has three months to complete the project, and the company's budget allows purchasing additional software tools or training, if necessary. What is the most critical resource for the data analytics project?
The customer database
What is a data requirement for logistic regression?
The dependent variable has to be binary.
What is data analytics?
The process of analyzing data to extract insights
What is data science?
The process of analyzing data to extract insights
Why is a project sponsor a key stakeholder in a data analytics project?
They ensure that the project aligns with business goals and objectives.
Why are financial operation stakeholders important in a data analytics project?
They interpret data and provide insights to improve financial performance.
What is the role and function of a decision scientist within an organization?
To analyze data and provide insights to support informed decision-making
What component of a data analytics project is typically completed by a data analyst?
To clean and preprocess data to prepare it for analysis
What is the primary purpose of the data preparation phase in a data analytics project?
To clean, normalize, and transform data
What is the function of a data scientist in an organization?
To conduct statistical analysis and machine learning modeling
What is the main purpose of the model execution phase in a data analytics project?
To develop datasets, refine models, and assess validity
What is the primary purpose of the operationalize phase in a data analytics project?
To pilot the model, refine it, and fully deploy it
A data analyst works at an e-commerce company that wants to understand its customer churn rate. Their manager has tasked them with conducting a data analytics project to identify customers at risk of churn and offer these customers targeted promotions to retain their business. What is the primary purpose of the data analytics project's results in this scenario?
To predict customer churn risk
What is the primary purpose of the discovery phase in the data science process?
To understand the business problem and develop initial hypotheses
Which data migration skill is necessary for database administrators?
Transferring data between different systems or formats
A data analyst for a retail company has collected data on customer demographics, purchase history, and marketing campaigns. Which data analytic technique should be used to predict demand for the upcoming holiday season?
Use a machine learning algorithm to predict future demand and determine the reorder quantity for each product.
A pharmaceutical company collected data on patient outcomes for a new drug it is testing. Which question regarding the source or quality of the available data is most appropriate to ask before analysis?
Was the data collected from electronic health records (EHRs) of patients using the drug?
A data analyst is planning a new analytics project for a toy manufacturing company. Customer survey data is provided. Which question should be asked regarding the sources or quality of the data?
Was the survey sent to a random sample of customers?
Which type of data is needed to assess whether a new type of web content is increasing user engagement?
Web log
Which data sources would be most relevant for analyzing factors affecting patient satisfaction in a healthcare company?
Web log data, call-center records, and survey responses
A grocery store chain collected data on customer purchases, sales transactions, and inventory levels. Which question can a data analytics project answer using descriptive analytics?
What are the most popular products at each store's location?
A retail company collected data on customer demographics, purchase history, and marketing campaigns. Which question can a data analytics project answer using prescriptive analytics?
What is the best marketing strategy to target specific customer segments based on their purchase history and demographics?
What does a data analyst do in a data analytics project?
Conducts exploratory data analysis to identify trends and patterns
Which type of data is necessary to perform cluster analysis?
Continuous
A retail grocer wants to use association rules in retail marketing to increase sales. What would be the impact of using an association rule on sales data?
By analyzing sales data, the data analyst can apply association rules to discover frequent item sets, which are groups of items often purchased together.
Which information tool is a possible source of data in a data analytics project?
Corporate information system
A data analyst is assigned to analyze sales data for a multinational retail company to identify which products have the highest profit margins. Which data quality requirement is most critical for this project?
Accuracy
A media firm is in talks with a larger conglomerate about a possible merger. Which data is relevant for a data analyst to include in a report for its manager?
Competitor analysis
Which job skill is necessary for a researcher in a data analytics project?
Analyzing and interpreting data to inform questions
What should business users and project sponsors do with their findings during the operationalize phase of a data analytics project?
Assess benefits, implications, and business impact
What are the necessary skills for partners in a data analytics project?
Business domain knowledge and communication
Which metric should be used to measure the percentage of website visitors who leave after viewing only one page?
Bounce rate
How does a data analyst interact with stakeholders during a data analytics project?
By presenting data analysis results in an easily understandable format
How do stakeholders interact with data analytics projects?
By providing input throughout the project lifecycle
Which task is the data analyst responsible for within a data analysis project?
Conducting statistical analyses and generating reports
What is a primary responsibility of a data analyst?
Conducting statistical analysis to identify patterns and trends
A company in the renewable energy industry is working on a data analytics project to identify which areas are more likely to adopt solar power. The data science team needs to gather relevant data sources for this project. Which data sources are most relevant for a renewable energy company looking to identify areas more likely to adopt solar power?
Census and economic data, hourly weather readings, and demographic data
Which technique is the most appropriate for analyzing customer demographics?
Clustering
Which technique is the most effective for identifying patterns in large datasets?
Clustering
What do data analytics teams do in the operationalize phase of a data analytics project?
Communicate project benefits, set up the pilot project, and deploy in production
Which phase of a data analytics project involves articulating findings and outcomes for stakeholders while considering caveats, assumptions, and limitations?
Communicate results
How is data science different from data analytics?
Data science focuses on developing new algorithms and models, while data analytics focuses on using existing models to analyze data.
Which comparison describes the difference between data analytics and data science?
Data analytics is the process of analyzing data to extract insights, while data science involves building and testing models to make predictions.
Which phase of the data analytics lifecycle involves cleaning data, normalizing datasets, and performing transformations?
Data preparation
Which activities should the data analytics team perform during the model execution phase of this project?
Generating training and test sets and refining models to enhance performance
What is the difference between exploratory and confirmatory data analytics projects?
Exploratory projects involve testing hypotheses and finding patterns in data, while confirmatory projects involve verifying existing hypotheses.
What is the primary purpose of the model planning phase in the data analytics process?
Identifying methods and aligning techniques with objectives
An online retail company wants to use data analytics to improve customer satisfaction and increase sales. The company has collected data on customer behavior, purchase history, and customer support interactions. Which outcome is most appropriate for the online retail company's data analytics project?
Increasing customer satisfaction and sales through targeted recommendations and improved customer support
A retail company wants to improve its sales and customer satisfaction by analyzing customer data. The company hired a data analytics team, which has access to the company's customer database, including transaction records, demographic information, and customer feedback. The data analytics team will work closely with the marketing and IT departments to create actionable insights for the company. The team has three months to complete the project, and the company's budget allows purchasing additional software tools or training, if necessary. Which constraint should impact the data analytics project the most?
Insufficient time for comprehensive data analysis
An e-commerce company is interested in improving the conversion rate of its website. In which scenario should the company's analyst use an A/B test?
When they want to find out whether changing the color of the "Add to Cart" button will have a significant impact on sales
A data analyst is tasked with understanding customer satisfaction data and is emailed a file with the data. Which question should the data analyst ask about the data regarding where it is sourced from?
When was the data collected?
Which question of interest is appropriate for a data analytics project to increase a store's sales?
Which customer segments will most likely respond to a marketing campaign?
A healthcare company collected data on patient demographics, medical history, treatment outcomes, and hospital readmissions. Which question can a data analytics project answer using predictive analytics and the data collected by the healthcare company?
Which treatments are most likely to result in lower readmission in the future?