SU 20 Gleim

¡Supera tus tareas y exámenes ahora con Quizwiz!

Company A needs data that are relevant and reliable. Which of the "4 Vs" best describes Company A's needs? A.Veracity .B.Volume. C.Variety. D.Velocity.

A.Veracity.Answer (A) is correct. Veracity refers to the trustworthiness of the data (relevance and reliability).

Which of the following is a true conclusion based on the visualization? A.Level IV contains the largest number of employees with a college degree. B.Employees with a college degree account for the majority of Level II employees. C.The higher the level of employment, the higher the proportion of employees with a postgraduate degree. D.Employees with a college degree account for the majority of employees among the four levels.

B.Employees with a college degree account for the majority of Level II employees.Answer (B) is correct. Among the three categories of Level II employees, employees with a college degree account for about 52% (90% - 38%).

The type of visualization displayed is a A.Bar graph. B.Heat map. C.Histogram. D.Table.

B.Heat map.Answer (B) is correct. Heat maps present data using colors and shadings.

What is the approximate ratio between direct materials costs and product costs for Product E? A.54% B.30% C.37% D.43%

C.37%Answer (C) is correct. Direct materials costs for Product E are $15 per unit. Product costs of Product E are the sum of direct materials costs, direct labor costs, and variable and fixed overhead costs, which are approximately $40. Thus, the ratio between direct materials costs and product costs for Product E is about 37% ($15 ÷ $40).

Which of the following are key technologies of big data? I In-memory analytics II Data mining III Text mining A.I and III only. B.II only. C.I only. D.I, II, and III.

D.I, II, and III.Answer (D) is correct. Key technologies of big data include data mining, text mining, data management, in-memory analytics, predictive analytics, and Hadoop.

Data visualization A.Assists with big data analytics to display data results in a visual context. B.Is most useful to an audience when visualization tools present various items in different scales and skip values to highlight items emphasized by the presenter. C.Ensures the presentation of trends and correlations are understood by a wider audience without bias .D.Includes visualization tools such as bubble charts, Pareto diagrams, boxplots, and what-if analyses

A.Assists with big data analytics to display data results in a visual context.Answer (A) is correct. One limitation of big data is that user-level data results require interpretation prior to use. Data visualizations assist big data in identifying trends and correlations that run the risk of going undetected in text-based data.

A company uses big data analytics in marketing. Which of the following is a limitation of using big data? A.Big data often cannot explain why customers behave in certain ways. B.Data collected only represent untapped customers but not tapped customers. C.The company can use big data to predict customer behaviors. D.Data results cannot be visualized to identify and forecast customer trends.

A.Big data often cannot explain why customers behave in certain ways.Answer (A) is correct. One limitation of big data is that determining why the analysis results are what they are is difficult. While big data analysis can show that there is a certain pattern in monthly sales, it fails to show what causes the pattern. Further and more complicated analyses are needed, the results of which tend to be more difficult for non-technical people to understand.

Useless information, or noise, is a limitation of big data that A.Corrupts the results. B.Restricts available information. C.Requires interpretation before use. D.Makes it more difficult for non-technical people to choose a course of action.

A.Corrupts the results.Answer (A) is correct. Data are subject to useless information (commonly known as noise). A single incorrect or useless variable can corrupt the results and require additional labor hours to work with the data in order to obtain meaningful results.

Rank the products in ascending order of direct labor costs. A.D, E, B, C, A. B.A, C, B, E, D. C.E, B, C, D, A. D.A, D, C, B, E.

A.D, E, B, C, A.Answer (A) is correct. The direct labor costs of the five products can be compared by the relative lengths of the bars. Among the five products, Product D has the shortest bar for direct labor costs, followed by Products E, B, C, and A, indicating that the ranking in ascending order of direct labor costs is D, E, B, C, A.

Generally, greater values are represented by A.Darker colors. B.Hash marks. C.Stripes. D.Lighter colors.

A.Darker colors.Answer (A) is correct. Generally, greater values are represented by darker colors and lower values are represented by lighter colors.

Qualitative and quantitative methodologies and procedures used to retrieve data from data sources is the definition of A.Data analytics. B.Data mining. C.Data management. D.In-memory analytics.

A.Data analytics.Answer (A) is correct. Data analytics involves using qualitative and quantitative methodologies and procedures to retrieve data from data sources and then inspecting the data, based on data type, to facilitate the decision-making process.

Flushing out useless information is a step in A.Data cleaning. B.Data normalization. C.Data mining. D.Data discovery.

A.Data cleaning.Answer (A) is correct. Data cleaning consists of flushing out useless information and identifying missing data.

Data normalization focuses on conserving A.Data integrity. B.Data availability. C.Data compliance. D.Data confidentiality.

A.Data integrity.Answer (A) is correct. Data normalization involves storing each data element as few times as necessary. It results in the reduction of data and strengthened data integrity, which is ensuring that data accurately reflect the business events underlying them and that any anomalies are rectified.

A study finds a strong correlation between sales of beer and diapers. This is an example of A.Data mining. B.Predictive analytics. C.Data management. D.Error detection.

A.Data mining.Answer (A) is correct. Data mining examines large amounts of data to discover patterns in the data. Finding the correlation between seemingly unrelated data (e.g., sales of beer and diapers) is an example of data mining.

Which of the following data analytics methods should an auditor use to report on actual results? A.Descriptive analysis. B.Information discovery. C.Diagnostic analysis. D.Text analysis.

A.Descriptive analysis. Answer (A) is correct. Descriptive analysis is the most basic and commonly used data analytics method and concentrates on the reporting of actual results.

Fishbone diagrams are most often used in A.Diagnostic analysis. B.Prescriptive analysis. C.Predictive analysis. D.Descriptive analysis.

A.Diagnostic analysis.Answer (A) is correct. A fishbone diagram is a total quality management process improvement method that is useful in studying causation (why the actual and desired situations differ). It is often used in diagnostic analysis, which provides insights into the reason certain results occur.

Which of the following is not an example of text mining? A.Loading stock prices of a company into a database. B.Converting social media comments into rating scales. C.Extracting addresses from tax returns D.Extracting customer information from emails.

A.Loading stock prices of a company into a database.Answer (A) is correct. Text mining analyzes text data from the Web, comment fields, books, and other text-based sources through the use of machine learning or natural language processing technology. Stock prices, which are stored as numbers, can be directly loaded into a database without the process of text mining.

Information discovery is included in what stage of data analytics application? A.Obtain relevant data. B.Communicate results. C.Analyze data. D.Define questions.

A.Obtain relevant data.Answer (A) is correct. One of the five stages of implementing data analytics includes obtaining relevant data (commonly referred to as information discovery).

Pie charts are best used for displaying A.Relative proportions at a specific period in time. B.The relationship between two quantitative variables. C.Changes in components over time. D.Trends or variability over time.

A.Relative proportions at a specific period in time.Answer (A) is correct. Pie charts use circles to display the whole set of data, with each category displayed as a segment of the circle or a percentage of the total. Pie charts generally can only depict the relative proportions at a specific period.

The new purchasing director is analyzing purchase orders for the organization. Which of the following analyses would best be displayed on a histogram? A.The organization purchased US $27 million worth of inventory in the past year. Distribute by value, using US $500 increments, the quantity of purchase orders that fall within each range. B.In the past year the organization placed 10,000 purchase orders. Organize the number of orders placed with each supplier, sorted in descending order. C.The average turnaround time from issuing a purchase order to receiving the merchandise is 7 days. Review the last 2,000 purchase orders, and using 10 days as the upper control limit and 4 days as the lower control limit, graph the turnaround time for each order. D.Identify and organize the reasons the average turnaround time for purchase orders falls outside the control parameters of 4-10 days.

A.The organization purchased US $27 million worth of inventory in the past year. Distribute by value, using US $500 increments, the quantity of purchase orders that fall within each range.Answer (A) is correct. The histogram displays a continuous frequency distribution of the independent variable in the form of a bar graph. The y axis is the quantity of purchase orders and the x axis is the purchase order amount. The histogram would best display the quantity of purchase orders by dollar value.

Bubble charts, while similar to scatter plots, add a third variable, which is A.The size of data points. B.Colors and shading. C.Rectangles of different colors and sizes. D.Time-series data.

A.The size of data points.Answer (A) is correct. Bubble charts have two quantitative variables plotted on the x- and y-axes to depict the relationship between the variables. Bubble charts add a third variable to scatter plots by utilizing the sizes of the data points.

Which of the following conclusions can be drawn from the visualization? A.The price of Product A is the highest among the products. B.Variable costs of Product B are higher than those of Product C. C.The range for product costs of the five products is between $48 and $62. D.The ratio of direct labor cost to total cost for Product C is higher than that for Product B.

Answer (D) is correct. Product B has the same total costs as Product C but lower direct labor costs than Product C. Thus, the ratio of direct labor cost to total cost for Product C is higher than that for Product B.

Which one of the following statements defines data mining? A.A system used to organize and interpret complex data to ensure the data has been accurately recorded in the database. B.A process of using statistical techniques to extract and analyze data from large databases to discern patterns and trends. C.A process of using algorithms that serve to facilitate efficient communication within a firm. D.A system used to develop a firm's performance metrics.

B.A process of using statistical techniques to extract and analyze data from large databases to discern patterns and trends.Answer (B) is correct. Data mining examines large amounts of data to discover patterns using statistical models and techniques. The term data mining is somewhat misleading because its purpose is the discovery of patterns in large amounts of data, not the extraction of the data itself.

Which of the following is included in the Four Vs of big data? A.Velocity. B.All of the answers are correct. C.Variety. D.Volume.

B.All of the answers are correct.Answer (B) is correct. The four Vs of big data include volume, velocity, variety, and veracity. Some people include the value of the data as a fifth V.

All of the following are correct statements regarding businesses deciding to utilize cloud computing for big data projects except A.Businesses only pay for the storage and computing time actually used. B.Analysts are not required to have a detailed understanding of the available data and possess some sense of what answer(s) they're looking for. C.A public cloud provider can store petabytes of data and scale up thousands of servers just long enough to accomplish the big data project. D.Businesses are hesitant to invest in an extensive server and storage infrastructure that might only be used occasionally to complete big data tasks.

B.Analysts are not required to have a detailed understanding of the available data and possess some sense of what answer(s) they're looking for.Answer (B) is correct. Analysts must have a detailed understanding of the available data and possess some sense of the answers they are looking for. The value of data is only as valuable as the business outcomes it makes possible. It is how businesses make use of data that allows full recognition of its true value and the potential to improve decision-making capabilities and measure them against the results of positive business outcomes.

A hospital has observed an increase in the number of cases of a disease and has asked an analyst to collect data on the cases over the last 3 years. The analyst noted that the disease appeared 3 years ago during the second quarter of the year. Since then, the third and fourth quarters of each year showed significant spikes in the number of cases when compared to the first two quarters. What is the best way to present these findings? A.Pie chart, showing the number of cases in each quarter for the last 3 years. B.Bar graph, showing the number of cases in each quarter for the last 3 years. C.Table, showing the number of cases in each month for the last 3 years. D.Scatter plot, showing the change in the number of cases for each quarter for the last 3 years.

B.Bar graph, showing the number of cases in each quarter for the last 3 years.Answer (B) is correct. A bar chart (also called bar graph) is the best way to present the findings because it shows the number of cases each quarter in comparison to other quarters.

All of the following are correct statements regarding big data except A.Big data analytic tools complete missing pieces through data fusion, which is the process of integration of multiple data and knowledge representing the same real-world object into a consistent, accurate, and useful representation. B.Big data is an evolving term that describes any voluminous amount of structured data that has the potential to be mined for information. C.Big data uses inductive statistics and concepts from nonlinear system identification (e.g., output is not directly proportional to the input) to infer laws from large sets of data to reveal relationships and dependencies, or to perform predictions of outcomes and behaviors. D.Big data needs to be transformed to "Smart Data." Collecting large amounts of statistics and numbers translates into minimal benefit if there is no layer of added intelligence.

B.Big data is an evolving term that describes any voluminous amount of structured data that has the potential to be mined for information.Answer (B) is correct. Big data is an evolving term that describes any voluminous amount of structured, semi-structured, and unstructured data that has the potential to be mined for information. Thus, the statement is incorrect because big data includes semi-structured and unstructured data in addition to structured data.

Which of the following is a correct statement regarding big data? A.Once collected, data are readily available for the user. B.Big data is only as valuable as the business outcomes it makes possible. C.Big data is suitable for all applications. D.Businesses are quick to invest in the infrastructure needed to complete big data tasks.

B.Big data is only as valuable as the business outcomes it makes possible.Answer (B) is correct. Use of big data is only as valuable as the business outcomes it makes possible. The way a business uses big data may enable the business to fully recognize the data's value and its potential to improve decision-making capabilities and enhance positive business outcomes.

Which of the following best describes a characteristic of big data? A.Data collected are free from useless information or incorrect variables. B.Data of untapped markets is often not collected. C.Big data is in a visual context, such as a graph or chart, rather than a text format. D.Collected data often provides straightforward answers to users.

B.Data of untapped markets is often not collected.Answer (B) is correct. One limitation of big data is that user-level data results are incomplete. Generally, the data available to an organization are restricted to data of persons who have had some contact with the organization (e.g., visited the organization's website or called the organization). The data are only representative of the target market; thus, untapped markets could potentially exist, the data of which are not being captured.

By observing the color density, the relative values of data or changes over time A.Show the strength of the linear relationship. B.Reflect the proportions of components. C.Are less understandable at a glance. D.Are more intuitive.

D.Are more intuitive.Answer (D) is correct. By observing the color density, the relative values of data or changes over time are more intuitive.

Which of the following is a true statement regarding data visualization? A.Data visualization tends to convey more complete information than raw data. B.Data visualization can take various forms. C.Data visualization is the use of computers to convey information. D.Data visualization is always the most appropriate way for presenting data.

B.Data visualization can take various forms.Answer (B) is correct. Data visualization may take various forms depending on the purposes and needs of a given situation. Examples of data visualization includes tables, graphs, charts, maps, and images.

Under which category of data analysis should "anomaly detection" be classified? A.Predictive analysis. B.Descriptive analysis. C.Diagnostic analysis. D.Prescriptive analysis.

B.Descriptive analysis.Answer (B) is correct. The purpose of anomaly detection is to identify unusual patterns or deviations from the norm or expected results. The focus of anomaly detection is on the reporting of historical information (i.e., descriptive analysis).

Business analytics can be used by management to A.Build and maintain standards for data quality. B.Evaluate opportunities for enhancement or advancement. C.Specify the type of value and the applicable mathematical or logical operation methodology. D.Identify new topics and term relationships.

B.Evaluate opportunities for enhancement or advancement. Answer (B) is correct. Management utilizes data analytics to evaluate operational, financial, and other data to identify any deviations from the norm and opportunities for enhancement or advancement.

Which of the following is a correct statement regarding in-memory analytics? A.It is an open source software framework that stores large amounts of data and runs applications on clusters of commodity hardware. B.It analyzes data from system memory instead of hard drives. C.It is a technology that uses data, statistical algorithms, and machine-learning techniques to identify the likelihood of future outcomes based on historical data. D.It examines large amounts of data to discover patterns in the data.

B.It analyzes data from system memory instead of hard drives.Answer (B) is correct. In-memory analytics analyzes data from system memory instead of hard drives.

An organization's primary focus is on what it needs to do in order to accomplish its predicted future. Which of the following data analytics methods would best address this concern? A.Anomaly detection. B.Prescriptive analysis. C.Network analysis. D.Predictive analysis.

B.Prescriptive analysis.Answer (B) is correct. Prescriptive analysis concentrates on what an organization needs to do in order for the predicted future results to actually occur and therefore addresses this concern.

The type of data analytics that is most likely to yield the most impact for an organization but is also the most complex is called A.Descriptive analysis. B.Prescriptive analysis. C.Diagnostic analysis. D.Predictive analysis.

B.Prescriptive analysis.Answer (B) is correct. Prescriptive analysis uses descriptive, diagnostic, and predictive analytics to improve business strategy. It concentrates on what an organization needs to do in order for the predicted future results to actually occur. This type of analytics provides the most benefit but requires the most inputs.

Data fusion is best described as the A.Assurance of data quality. B.Process of integrating data and knowledge. C.Examination of data to discover unexpected patterns. D.Prediction of outcomes and behaviors.

B.Process of integrating data and knowledge.Answer (B) is correct. Data fusion is the process of integrating data and knowledge representing the same real-world object into a more consistent, accurate, and useful representation than the individual sources.

Velocity of data refers to the A.Wide variety of data file types. B.Speed at which big data are generated. C.Validation of data. D.Trustworthiness of data.

B.Speed at which big data are generated.Answer (B) is correct. Velocity refers to the speed at which big data are generated and must be analyzed.

What is the approximate percentage of Level III employees with a college degree or above? A.65% B.92% C.77% D.52%

C.77%Answer (C) is correct. Employees with a college degree or above include those with a college degree and those with a postgraduate degree. The percentage of Level III employees without a college degree is about 23%. Thus, the percentage of Level III employees with a college degree or above is about 77% (100% - 23%).

Entities that can benefit from using data analytics include A.Not-for-profit entities. B.Government agencies. C.All of the answers are correct. D.Business organizations.

C.All of the answers are correct.Answer (C) is correct. All organizations can utilize data analytics to reach conclusions based on evidence and reasoning to make well-supported decisions and formulate strong business models.

Under what data analytics method would dashboards and score cards be used? A.Prescriptive. B.Descriptive. C.Diagnostic. D.Predictive.

C.Diagnostic.Answer (C) is correct. Dashboards and score cards break down an observation into different aspects to facilitate the identification of the reason certain results occur.

Which of the following is a critical success factor in data mining a large data store? A.Effective search engines. B.Image processing systems. C.Pattern recognition. D.Accurate universal resource locator (URL).

C.Pattern recognition.Answer (C) is correct. Data mining allows a user to discover hidden relationships, such as associations, sequences of events, classifications (descriptions of the groups to which the item belongs), or clusters (new groupings previously not known). Typical applications of data mining are identification of potential customers and purchasing power.

Which element of visualization determines how to look for information? A.Title. B.Legend. C.Presentation. D.Axes.

C.Presentation.Answer (C) is correct. Presentation determines what and how to look for information.

Which of the following visualization methods is most suitable for retaining data details? A.Dot maps. B.Line chart. C.Table. D.Histogram.

C.Table.Answer (C) is correct. Tables present data in as close to their raw form as possible and are able to retain the details of the data.

Which of the following is a correct statement regarding volume-based value? A.Rapid analysis capabilities provide businesses with the right decision in time to achieve their customer relationship management objectives. B.The faster businesses can inject data into their data and analytics platform, the more time they will have to ask the right questions and seek answers. C.The more data businesses have on the customers, both recent and historical, the greater the insights. D.In the digital era, capability to acquire and analyze varied data is extremely valuable.

C.The more data businesses have on the customers, both recent and historical, the greater the insights.Answer (C) is correct. The more data businesses have on the customers, both recent and historical, the greater the insights is a correct statement regarding volume-based value.

Which of the following best represents the application of predictive analytics? A.A consultant organizes an analysis of causation for dissatisfied workers and possible interactions among causes. B.A cost accountant monitors whether direct materials used are within the acceptable variations for the last 6 months. C.The website recommends pet toys and bedding after the customer purchases pet food. D.The human resource manager prepares an analysis to show which departments have the highest employee turnover.

C.The website recommends pet toys and bedding after the customer purchases pet food.Answer (C) is correct. A common use of predictive analytics in the retail sector occurs when a customer selects an item to purchase online and prepares to finalize the transaction; the web page then displays additional products other customers purchased in conjunction with the initial item.

Each of the following represents a characteristic of big data except A.Mixture. B.Speed. C.Uniformity. D.Size.

C.Uniformity.Answer (C) is correct. Big data is often characterized by the "4 Vs" - volume, variety, velocity, and veracity. Thus, uniformity is not a characteristic of big data.

Financial statements and their notes are examples of Financial StatementsNotes to Financial Statements A. Financial StatementsUnstructured data Notes toFinancial StatementsUnstructured data B. Financial StatementsStructured data Notes toFinancial StatementsStructured data C. Financial StatementsUnstructured data Notes toFinancial StatementsStructured data D. Financial StatementsStructured data Notes toFinancial StatementsUnstructured data

D. Financial StatementsStructured data Notes toFinancial StatementsUnstructured data Answer (D) is correct. Structured data refers to data that are highly organized into predefined groupings and typically maintained in relational databases. The data are predefined such that each item falls into a specific anticipated data type. Unstructured data refers to information that has little or no predefined organizational structure, which makes it more difficult for computer programs to search, sort, and analyze. Financial statement amounts are reported in XBRL and stored as floats that can easily be sorted and searched by computer programs. Notes to financial statements usually have little predefined organizational structure and are difficult to sort and analyze using computer programs.

Match the following purposes with the elements of data visualization. Help convey how data are measured and presentedProvide additional information for understanding the visualization A. Help convey how data are measured and presentedTitle Provide additional information for understanding the visualizationLegend(s) B. Help convey how data are measured and presentedPresentation Provide additional information for understanding the visualizationLegend(s) C. Help convey how data are measured and presentedPresentation Provide additional information for understanding the visualizationAxes D. Help convey how data are measured and presentedAxes Provide additional information for understanding the visualizationLegend(s)

D. Help convey how data are measured and presentedAxes Provide additional information for understanding the visualizationLegend(s) Answer (D) is correct. Axes convey how the data are measured and presented through the use of (1) labels (i.e., what the axis measures and presents), (2) range (i.e., whether the axis captures all the data), and (3) scale (i.e., the intervals or scaling used to present data). Legends provide additional information that helps users understand the visualization. They usually provide an explanation about the different colors, shapes, and sizes depicted in the visualization.

A company identified the increasing operating costs as the reason for the declined profit margin for the previous year. To achieve its targeted profit margin for the coming year, the company develops a plan to cut its operating costs by 20%. Which of the following correctly matches the actions of the company and the type of data analytics methods used? Reason IdentificationPlan Development A. Reason Identification Diagnostic Analysis Plan Development Predictive Analysis B. Reason Identification Predictive Analysis Plan Development Prescriptive Analysis C. Reason Identification Descriptive Analysis Plan Development Predictive Analysis D. Reason Identification Diagnostic Analysis Plan Development Prescriptive Analysis

D. Reason Identification Diagnostic Analysis Plan Development Prescriptive Analysis Answer (D) is correct. Diagnostic analysis provides insight on the reason certain results occurred. Identifying the underlying reason for the declined profit margin is thus an example of diagnostic analysis. Prescriptive analysis concentrates on what an organization needs to do for the predicted future results to actually occur. By developing a plan to achieve the targeted profit margin, the company is applying a prescriptive analysis.

A car insurance company is considering opening branches in a foreign country. The country's population has been growing rapidly for the last 5 years. The company wants to know whether the number of car accidents are correlated with the recent population growth. Which of the following charts prepared by an analyst would be most helpful to the company? A.A table showing the number of car accidents in rows and the population size in columns. B.A line chart showing each month on the x-axis and the number of car accidents on the y-axis. C.A pie chart showing the number of car accidents in each city for the last 5 years. D.A scatter plot showing the number of car accidents on the y-axis and the population size on the x-axis.

D.A scatter plot showing the number of car accidents on the y-axis and the population size on the x-axis.Answer (D) is correct. A scatter plot is the best way to present the relationship by illustrating the correlation between the number of car accidents (y variable) and the population size (x variable).

Deviation from expected results can be identified by which application of data analytics? A.Data fusion. B.Prescriptive analysis. C.Diagnostic analysis. D.Anomaly detection.

D.Anomaly detection.Answer (D) is correct. Anomaly detection is used to identify unusual patterns or deviations from the norm or expected results.

All of the following are correct statements regarding big data except A.Big data is often characterized by the "4 Vs" - volume, variety, velocity, and veracity. B.Big data processes data with analytic and algorithmic tools to reveal meaningful information. C.Big data is an evolving term that describes any voluminous amount of structured, semi-structured, and unstructured data that has the potential to be mined for information. D.Big data includes information collected from social media, data from Internet-enabled devices, machine data, video, and voice recordings. The information collected is converted from high-density data into low-density data.

D.Big data includes information collected from social media, data from Internet-enabled devices, machine data, video, and voice recordings. The information collected is converted from high-density data into low-density data.Answer (D) is correct. Big data includes information collected from social media, data from Internet-enabled devices, machine data, video, and voice recordings. The information collected is converted from low-density data into high-density data, not from high-density data to low-density data.

Which product line contributed the greatest percentage of revenue in 2019? A.Jewelry. B.Home goods. C.Sporting goods. D.Clothing.

D.Clothing.Answer (D) is correct. In 2019, clothing contributed the greatest percentage of revenue, as depicted by the largest area in the center of the chart.

The Department of Transportation collects and combines acoustic, image, and other sensor data to better monitor the real-time traffic of a city. This is an example of A.Data scrubbing. B.Data normalization. C.Data mining. D.Data fusion.

D.Data fusion.Answer (D) is correct. Acoustic, image, and other sensor data are individual sources of data. The combination of the individual sources of data to better monitor real-time traffic is an example of data fusion, the process of integrating data and knowledge representing the same real-world object into a more consistent, accurate, and useful representation than their individual sources.

To ensure data quality, the IT department of a company establishes a master program to build standards for data quality. These standards are maintained and improved iteratively with inputs from personnel from different levels within the company. This process is an example of A.Data normalization. B.Data mining. C.Data scrubbing. D.Data management.

D.Data management.Answer (D) is correct. Data need to be high-quality and well-governed in order to be reliably analyzed. Data management consists of repeatable processes to ensure data quality and governance by establishing a master data management program and to build and maintain standards for data quality.

Which of the following best describes semi-structured data? A.Data organized into predefined groupings that can be easily sorted or searched. B.Large amounts of data collected from various sources. C.Information that has little or no predefined organizational structure. D.Data that are not highly organized but still have some identifying information.

D.Data that are not highly organized but still have some identifying information.Answer (D) is correct. Semi-structured data refers to data that are not as highly organized as structured data but still have some identifying information that can be used for organization by computer programs.

Which of the following best describes unstructured data? A.Conforms with the organization of data models associated with relational databases. B.Data with a high level of organization. C.Data systematically stored with markers to enforce hierarchies of records and fields within the data. D.Information that is not organized in a pre-defined manner (e.g., text-heavy facts, dates, numbers, and images).

D.Information that is not organized in a pre-defined manner (e.g., text-heavy facts, dates, numbers, and images).Answer (D) is correct. Unstructured data refers to information that is not organized in a pre-defined manner (e.g., text-heavy facts, dates, numbers, and images).

Which of the following is a correct statement regarding Hadoop? A.It analyzes data from system memory instead of hard drives. B.It analyzes text data from the web, comment fields, books, and other text-based sources through the use of machine learning or natural language processing technology. C.It is a technology that uses data, statistical algorithms, and machine-learning techniques to identify the likelihood of future outcomes based on historical data. D.It is open source software framework that stores large amounts of data and runs applications on clusters of commodity hardware.

D.It is open source software framework that stores large amounts of data and runs applications on clusters of commodity hardware.Answer (D) is correct. Hadoop is an open source software framework that stores large amounts of data and runs applications on clusters of commodity hardware.

Which of the following is a false conclusion regarding Product A? A.Direct costs for Product A are the highest among the products. B.Indirect costs for Product A are the highest among the products. C.Product costs for Product A are the highest among the products. D.Period costs for Product A are the highest among the products.

D.Period costs for Product A are the highest among the products.Answer (D) is correct. Period costs are costs that are expensed as incurred but not capitalized as part of inventory. In the cost categories listed in the visualization, the selling, general, and administrative (SG&A) costs are the period costs. Among the five products, Product A does not have the highest period costs (i.e., the bar for SG&A costs is not the longest).

Which of the following can be discovered using a data-mining process? A.Standard query reporting. B.Artificial intelligence. C.Data structure. D.Previously unknown information.

D.Previously unknown information.Answer (D) is correct. Data mining examines large amounts of data to discover patterns in the data (i.e., unexpected relationships among data). A classic example of the use of data mining is the discovery by convenience stores that diapers and beer often appear on the same sales transaction in the late evening. Thus, previously unknown information can be discovered using a data-mining process.

Which product line contributed the least amount of revenue in 2020? A.Home goods. B.Jewelry. C.Clothing. D.Shoes.

D.Shoes.Answer (D) is correct. In the visualization, shoes occupies the smallest area in 2020. Therefore, in 2020, shoes contributed the least amount of revenue.

Which product line has shown continuous growth in revenue from 2018 to 2020? A.Home goods. B.Clothing. C.Electronics. D.Sporting goods.

D.Sporting goods.Answer (D) is correct. The continuous growth of sporting goods is depicted by the expanding percentage of revenue from 2018 to 2020.

Which of the following statements is correct if there is an increase in the resources available within an economy? A.The standard of living in the economy will rise. B.The technological efficiency of the economy will improve. C.More goods and services will be produced in the economy. D.The economy will be capable of producing more goods and services.

D.The economy will be capable of producing more goods and services.Answer (D) is correct. If demand is sufficient and society can employ the resources, more goods and services will be produced.

All of the following are correct statements regarding velocity-based value except A.The computing power required to quickly process huge volumes and varieties of data can overwhelm a single server or multiple servers. Organizations must apply adequate computer power to big data tasks to achieve the desired velocity. B.The faster businesses can inject data into their data and analytics platform, the more time they will have to ask the right questions and seek answers. C.Rapid analysis capabilities provide businesses with the right decision in time to achieve their customer relationship management objectives. D.The more data businesses have on the customers, both recent and historical, the greater the insights.

D.The more data businesses have on the customers, both recent and historical, the greater the insights.Answer (D) is correct. The more data businesses have on the customers, both recent and historical, the greater the insights is a correct statement regarding volume-based value.


Conjuntos de estudio relacionados

Art 100 Ch 17. The 17th and 18th Centuries

View Set

Astronomy 1101 LSU Final combination of test 1 and 2

View Set

Personal Finance Ch 1,2,3 learnsmart

View Set

IT Security: Defense against the digital dark arts. Week2: Pelcgbybtl (Cryptology)

View Set

4.09: Uncertainty in the Postwar World

View Set

bio 102 unit 3 cumulative practice

View Set

Pain Assessment and Management: Fundamentals Midterm

View Set