Data Analytics Journey Pre-Assessment #1

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

10. An oil company uses robots and sensors to detect how pipeline corrosion changes over time. The collected data is then used in a predictive model that estimates when a pipe should be replaced. How does the predictive model serve this oil company? A. To minimize interruptions from maintenance shutdowns B. To minimize the need for workforce safety training C. To improve compliance with pipeline construction standards D. To improve compliance with pipeline disposal standards

A

21. Which party has the primary vision for a data analytics project and brings resources to complete it? A. Project sponsors B. Project managers C. Customers D. Data analysts

A

27. Which feature is commonly found in collaboration tools like Jira, Slack, Teams, and PivotalTracker? A. Real-time messaging B. Multivariate analysis C. Equation editor D. Source code management

A

35. A restaurant owner wants to sponsor a data analytics project to provide insights regarding hamburger sales before developing a strategy for increasing sales. Which question is framed appropriately for the data analytics project? A. What are the characteristics of customers who buy hamburgers? B. What does the supply and demand curve look like for hamburgers? C. Which discount coupons should we send to neighborhood residents? D. Which varieties of hamburgers are featured by competitors?

A

36. Which organizational objective could be accomplished with a descriptive data analytics project using website request logs as a data source? A. Explain why web data transfer has increased 25% B. Estimate the traffic increase for a new product launch C. Improve the speed of server request processing D. Recommend a strategy to increase network capacity

A

51. What is an example of random sampling of college students? A. Surveying students chosen arbitrarily from around the entire college campus B. Surveying every student in the college library C. Surveying students chosen arbitrarily in the library of the university D. Surveying every student on campus

A

54. Which technique can be used to determine the likelihood that a positive diagnostic test result indicates whether the disease is actually present? A. Bayes' theorem B. Central limit theorem C. Regression D. Optimization

A

58. Which tool has libraries that expand its visualization capabilities? A. Python B. Tableau C. Adobe Infographics D. D3.js

A

57. Which characteristics are used to group data together in a cluster analysis? A. Distance B. Similarity C. Shape D. Size

AB

17. What are two purposes of the reporting phase of the data analytics life cycle? A. Provide the conclusions from the analysis in an engaging manner B. Provide a tool for decision-makers to import and analyze more data C. Provide actionable insights that can inform decision-making D. Provide an automated way for decision-makers to test their own models

AC

2. In which phase of the data analytics life cycle does an analyst build a histogram? A. Data acquisition B. Data exploration C. Discovery D. Predictive modeling

B

23. A data analytics project manager has been asked to complete a project on a very short timeline. Which action is likely to yield positive results? A. Outsource the skilled work to an unproven vendor B. Expand the team with experienced staff C. Require current team to work overtime D. Accept lowered quality standards

B

24. Which type of project management problem occurs when a data mining task has started but a data acquisition task has not been completed? A. Scope B. Schedule C. Procedure D. Cost

B

28. Which action can the project manager take to keep the team engaged in the analytics project? A. At the end of the project, the team publishes an extensive research report and includes it in an email to project stakeholders. B. Throughout the project, the project manager communicates insights from the data analytics team and provides ideas of ways to act on those insights. C. At the end of the project, the project manager sends an email with the predictive model to the stakeholders so they can use it. D. Throughout the project, the project manager holds regular meetings so the entire data analytics team can showcase their work to different departments.

B

3. An analyst applies a statistical formula to obtain the average temperature for a city over the last 50 years. Which phase of the data analytics life cycle is represented by this activity? A. Data acquisition B. Exploratory data analysis C. Predictive modeling D. Data reporting

B

30. What is a characteristic of active listening? A. Actively working on a task while listening to the speaker B. Seeking to understand the speaker's emotions and intent C. Focusing intently on the content of the message D. Waiting patiently to share one's own thoughts

B

32. A data analytics project team is preparing to develop a predictive model that will be included within a business intelligence tool for upper management. Which step should be considered for inclusion when creating the project schedule? A. Model testing and validation for users B. Business intelligence tool interface training C. Model training and testing for stakeholders D. Business intelligence tool data transformation training

B

43. A U.S. company collects and sells information on consumers. Which law prevents the company from collecting information on European Union consumers without their permission? A. Electronic Communications Privacy Act B. General Data Protection Regulation C. Stored Communication Act D. Information Nondiscrimination Act

B

46. What do open-source software tools and widely available analysis tools, such as spreadsheets, help accomplish? A. Data schemas B. Data democratization C. Data security D. Data compliance

B

55. Which concept should be considered when choosing variables for inclusion in a linear regression model? A. Feasibility of merging the variables B. Feasibility of controlling the variables C. Feasibility of testing the variables D. Feasibility of classifying the variables

B

56. A neural network algorithm in machine learning endeavors to recognize underlying relationships in a set of data. What does this process mimic? A. The way a computer processes data B. The way the human brain operates C. The way architects establish functionality D. The way that social media builds networks

B

39. Which outcome should be expected when working with data aggregated from multiple sources? A. Consistently named fields B. Inconsistently named fields C. Data needs cleaning D. Data does not need cleaning

BC

47. What is a feature of SQL? A. It is an object-oriented programming language. B. The basic language is the same across database servers. C. It has built-in chart and graph creation. D. It is used with structured data and unstructured data.

BD

11. During which phase in the data analytics life cycle would a churn analysis be performed? A. Data cleaning B. Data acquisition C. Predictive analysis D. Representation and reporting

C

13. Why might a data analyst resample a data set with replacement data in a data mining project? A. Misidentification of causation due to correlation B. Wrong variables chosen for analyzation C. Too little data for training and testing data sets D. Skewed data resulting from outliers

C

14. A data analyst has identified combinations of sales transactions that frequently occur together in data over the past 5 years. Which phase of the data analytics life cycle is represented by this analysis? A. Data acquisition B. Representation and reporting C. Data mining D. Predictive modeling

C

15. An analyst realizes that the data set has been reduced significantly, resulting in sample sizes that are too small. In which phase of the data analytics life cycle did this likely occur? A. Data exploration B. Data modeling C. Data mining D. Data discovery

C

16. What strategy will contribute to effective data representation and reporting? A. Creating a new training data set B. Selecting data for a prediction model C. Excluding unrelated data D. Extracting data from source repositories

C

18. During which phase of the data analytic life cycle does an analyst create a story to report data? A. Data acquisition B. Data mining C. Data reporting D. Data cleaning

C

19. What is a common duty of a database administrator? A. Set project timelines, milestones, and goals B. Acquire funding for data analytics projects C. Maintain data on the IT infrastructure D. Define business needs at the onset of a project

C

20. What is an example of an external stakeholder for a data analytics project? A. President/CEO B. Project manager C. Regulatory body D. Data analyst's supervisor

C

22. What does the critical path represent in data analytics project management? A. Minimum time to complete independent tasks B. Maximum time to complete independent tasks C. Minimum time to complete dependent tasks D. Maximum time to complete dependent tasks

C

25. How can an organization improve interprofessional communication among team members? A. By setting work priorities for team members B. By requiring weekly updates on project deadlines C. By using tools that provide a team-based collaboration space D. By ensuring employees can recite the desired outcomes

C

29. What is an effective method for a data analyst to prepare for a one-on-one meeting with a manager? A. Make a written list of all source code comments B. Ask other inside employees about the manager's reputation C. Bring a set of questions to draw on to keep the conversation going D. Create an essay summarizing steps in the source code

C

31. Which circumstance could cause a data analyst to have difficulty developing a model to answer a business question? A. Project scope creep B. Poor project budgeting C. Lack of relevant data sources D. Lack of stakeholder support

C

33. Which task would an analyst consider first during the discovery phase of the data analytics lifecycle? A. Seek out necessary data sources. B. Formulate a project plan. C. Identify project goals. D. Develop key metrics.

C

34. Numerical measurements of the amount of a toxic chemical substance are recorded in a large database. Which hypothesis can the data analyst answer through exploratory data analytic methods? A. The chemical will not cause harm to the habitat's native species. B. The chemical contamination is a result of human activity. C. The statistical distribution of the chemical measurements is normal. D. The best analytic approach for analyzing the data is linear regression.

C

37. A travel website tabulated the results of their latest marketing campaign to understand the relationship of clicks-to-sales conversions. Which area of analytics does this activity represent? A. Prescriptive B. Proactive C. Descriptive D. Predictive

C

38. An analyst is looking at data that includes the customer's address, date of purchase, and age. Which question could be answered from this data? A. Which customer has spent the highest dollar amount? B. Which customer is most likely to respond favorably to the next marketing campaign? C. Which state has the highest total customers? D. Which product has sold the most in a certain state?

C

4. An analyst has been tasked with defining data columns that could contain null values. Which activity of the data acquisition phase is represented? A. Collecting data B. Disqualifying data sources C. Detecting missing values D. Transforming improperly formatted text

C

40. Which technique can a project manager use to foster the identification of quality data analytics questions? A. Organized project planning B. Rigorous data cleaning C. Frequent collaboration with the team D. Acquisition of abundant project resources

C

41. A data analyst notices that the data selected for an analytics project is slightly misaligned with the research question. How can the data analyst resolve this situation? A. Halt the data analytics project to pursue a new research question B. Dive deeper into the data to identify data quality issues C. Adjust the research question to reframe the analysis D. Transform the data to a new metric

C

42. An analyst has been asked to analyze the open-ended responses from customers on a satisfaction survey. Which type of data is the analyst working with on this project? A. Transactional B. Secondary C. Qualitative D. Quantitative

C

44. A consumer sues an entertainment streaming company for leaking personal information regarding her viewing habits. Which ASA ethical standard did the streaming company violate? A. Conflict of interest B. Biases C. Privacy D. Unfair discrimination

C

45. A specific drug is manufactured for the treatment of depression. The company decides to ignore research results on an alternative, less expensive, drug treatment in order to make higher profits. Which ASA ethical standard has the company violated? A. Unfair discrimination B. Reproducible results C. Conflict of interest D. Transparent assumptions

C

48. What is an example of unstructured data? A. Names, dates, and addresses B. Credit card numbers that include a credit score C. Text messages that include video D. Height, weight, and gender

C

49. Which tool should a researcher use to conduct a univariate analysis on complex statistical data? A. Tableau B. Power BI C. R D. SQL

C

5. Which activity in the data analytics life cycle occurs during the data acquisition phase and requires the most time and effort from the data analyst? A. Selecting the data sources B. Importing data into a database C. Cleaning data D. Defining goals

C

50. Which statistical technique should be used to draw conclusions about an entire population based on a representative sample? A. Correlation B. Bayes theorem C. Hypothesis testing D. Measures of central tendency

C

52. Which type of analysis would be used to predict a binary outcome based on a set of independent variables? A. Hypothesis testing B. Descriptive statistics C. Regression D. Time Series

C

53. Which type of data analysis is appropriate if the goal is to minimize the cost of a diet, using a data set consisting of the following variables: protein content, fat content, and cost per unit? A. Decision trees B. Calculus C. Optimization D. Bayes' theorem

C

60. Which type of data representation should a data analyst use to display expense categories as a percentage of total business expenses? A. Map visualization B. Line chart C. Pie chart D. Scatter plot

C

7. What can be identified using a box plot? A. Frequency B. Correlation C. Interquartile range D. Mean

C

8. What will be a consequence of poor attention to detail during the data exploration phase? A. Not enough variables will be considered in the analysis. B. The outcome of the analysis will be misaligned to business needs. C. The analyst will lack insight into the structure of the data set. D. The model will be built using the wrong data set.

C

9. Which aspect of data exploration occurs when an analyst writes code to compile a bar graph of dog food sales per month? A. Performance of a correlation analysis B. Analysis of data anomalies C. Verification through visualization D. Determination of variabilities

C

59. Which tools can be used for performing statistics and creating interactive data visualization for large datasets from various sources? A. Gantt Chart B. SQL C. Tableau D. R

CD

1. Which activity does an analyst perform in the discovery phase of the data analytics life cycle? A. Collecting data B. Cleaning data C. Identifying outliers D. Identifying business needs

D

12. Which mistake is commonly made during the predictive analytics phase? A. The data are separated into different sets. B. The variables are separated into response and independent variables. C. The data are prepared before the model is developed. D. The model is developed before the research question is known.

D

26. A data analyst needs to contact a specific member of the database administration team. Which method should be used to discover the person's email address? A. Ask the project's customers B. Ask the project's sponsors C. Send an email to project stakeholders D. Send an email to the team member's manager

D

6. What might be developed by data analysts when acquiring data from a data warehouse? A. The procedures for extracting files from the data warehouse B. The procedures for updating tables in the data warehouse C. The relational structure of tables D. The SQL queries of data within the tables

D


Ensembles d'études connexes

3.15 DNA and RNA are two types of nucleic acids

View Set

Principles of Management: Chapter 16 (Wesson)

View Set

BSC 1005 Living with Dinosaurs Ultimate

View Set

HIST 150 Chapter 15: Reordering the World, 1750-1850

View Set