ITM 209 Final Exam SG

Pataasin ang iyong marka sa homework at exams ngayon gamit ang Quizwiz!

Which of the following is a technology challenges for big data?

- Managing huge volumes of data - managing streams at an extremely fast and variable pace - Managing a variety of forms and functions of data - processing data at a huge speed

When considering the colors to use in a visualization, which of the following should be considered?

-Whether the color adds value to the visualization or is just decorative in nature - The manner is which certain color schemes may be interpreted based upon the culture(s) of the audience - accessibility / readability by individuals with color blindness

Which of the following are properties of primary keys?

-each tuple must have a unique primary key - several distinct attributes could be used together to form a primary key - is is the candidate key that is chosen as the principal means of identifying tuples within a relation

All organizations need to understand and govern PII through which of the following:

-identifying all sources of created, received, maintained or transmitted PII - evaluating all external sources of PII - identifying all human, natural and environmental threats to PII

Which of the following examples could cause a 'butterfly effect' (as defined in the test) in an organizations data?

-inaccurate customer records -incomplete purchasing history -a cascading spelling mistake

Which of the following is explained as the reason that humans retain comparative advantage over artificial intelligence when addressing uncertainty and equivocality in decision making?

-superior intuition - imagination - creativity

By what year does Ray Kurzwell predict that machines will ne able to achieve the intelligence of human beings?

2029

Which of the following describes what Trend Lines are?

A feature Tableau to know a line that represents the relationships between a set of data points that have been plotted (i.e. regression)

Which of the following does not describe unstructured data? A) A defined length, type, and format. B) Emails, twitter tweets, and text messages. C) Does not follow a specified format. D) Free-form text.

A) A defined length, type, and format.

Which of the following WOULD NOT be considered part of the ACCURATE characteristic of high-quality information? A) Is aggregate information in agreement with detailed information? B) Is the email address invalid? C) Does the name and the phone values have the exact same information? D) Is the name spelled correctly?

A) Is aggregate information in agreement with detailed information?

Joey is creating a model based upon past stock trading information. The purpose is to indicate to management management of the best stock derivative arrangements and when to enter into them. This would be an example of why type of analysis? A) Prescriptive B) Critical Path C) Predictive D) Descriptive E) None of the above

A) Prescriptive

In a full outer join, all the records from the right and left tables that meet the criteria of the query will appear. This would include records from each table where there are no related records (tuples) in the other table. A) True B) False

A) True

True or False: Organizations may have inconsistent data definitions between their production systems / databases. This may be a reason for the organization to utilize a data warehouse. A) True B) False

A) True

Using the 'as' statement in the select clause for a query will label the column or attributes header in the results with the specified text. For example: select people.personName as 'Name' would return 'Name' as the column header rather than personName. A) True B) False

A) True

Which of the following would be a reason to utilize a one-to-one (1-1) relationship? A) When you have attributes about tuples (records) for which not every tuple may have information for the attribute. For example, if you were recording information about people and did not record physical characteristics such as height for every person, you may create a 1-1 relationship. B) When a tuple needs to have multiple attributes concatenated to create the primary key. C) When the number of tuples for a table exceeds the allowable amount. D) None of the above

A) When you have attributes about tuples (records) for which not every tuple may have information for the attribute. For example, if you were recording information about people and did not record physical characteristics such as height for every person, you may create a 1-1 relationship.

Considering the following, which would be the correct inner join clause to use in the query: - The two tables being joined are prescriptions and patients. A patient may have multiple prescriptions. A prescription can only relate to a single patient. - The select clause is selecting the following fields: patients.name, patients.dateOfBirth, prescriptions.rxNumber, prescriptions.medication, prescriptions.dosage - The query contained 'from pharmacy.patients' for the from clause - The primary key of patients is patients.patientID - The primary key of prescriptions is prescriptions.rxNumber A) inner join pharmacy.prescriptions on prescriptions.patientID = patients.patientID B) inner join pharmacy.patients on prescriptions.patientID = patients.patientID C) inner join pharmacy.patients on patients.patientID = prescriptions.patientID D) inner join pharmacy.prescriptions on prescriptions.rxNumber = patients.rxNumber E) inner join pharmacy.prescriptions on prescriptions.rxNumber = patients.patientID

A) inner join pharmacy.prescriptions on prescriptions.patientID = patients.patientID

AJ wants to send Bryan a small message securely. He wants to make sure that only Bryan can read the message, thus ensuring confidentiality. Which of the following encryption methods would he use?

Asymmetric encryption, with the message encrypted using Bryan's public key.

What is a data lake? A) A technique for establishing a match, or balance, between the source data and the target data warehouse. B) A storage repository that holds a vast amount of raw data in its original format until the business needs it. C) An approach to business governance that values decisions that can be backed up with verifiable data. D) A business that collects personal information about consumers and sells that information to other organizations.

B) A storage repository that holds a vast amount of raw data in its original format

Which of the following is the collection of data from various sources for the purpose of data processing? A) Data granularity B) Data aggregation C) Data purposing D) Data scrubbing E) Data cleansing

B) Data aggregation

Bill runs a report of all the sales for the past quarter and puts it into a visualization to show his boss the results. This is an example of what type of analysis? A) Prescriptive B) Descriptive C) Critical Path D) Predictive

B) Descriptive

Data within a view is a duplicate copy of the data that is in the underlying tables related to the view. A) True B) False

B) False

True or False: Based upon the manner the tables are designed, a traveler could fly from the same departing airport to the same arriving airport multiple times on the same date. A) True B) False

B) False

True or False: In most organizations, the managers in the operational areas (such as the manufacturing plant level) would be more interested in less granular information, whereas the executive A) True B) False

B) False

True or False: The concatenated primary key of Table 3 could be replaced by a unique value that represented a ticket number for each time a passenger flew from one airport to another (if it exists). A) True B) False

B) False

Unstructured data extracts information from data and uses it to predict future trends and identify behavioral patterns. A) True B) False

B) False

A database containing data about reviews of restaurants by users contains the following tables: results of the query: SELECT restaurants.restName as 'Restaurant', restaurants.priceRange, reviews.numberOfStars, reviews.reviewText FROM schema.reviews RIGHT JOIN schema.restaurants ON reviews.restaurantID = restaurants.restaurantID ORDER BY reviews.numberOfStars, restaurants.restName

C

What would be the output from a query if the following wildcard pattern were used? select locations.cityName where locations.cityName like %or% from schema.locations; A) Finds cities that are four characters with "or" in the middle B) Finds cities that do not contain "or" in the middle, but can begin or end with "or" C) Finds any cities that have "or" in any position D) None of the above

C) Finds any cities that have "or" in any position

What is the role of a foreign key? A) It is a field that uniquely identifies a given record in a table B) It is a unique way to identify each attribute C) It is an attribute that is the primary key of one table that appears as an attribute in another table. It acts to provide a logical relationship between the two tables D) All of the above

C) It is an attribute that is the primary key of one table that appears as an attribute in another table. It acts to provide a logical relationship between the two tables

What uses techniques that create models indicating the best decision to make or course of action to take? A) Descriptive analytics. B) Critical Path analytics. C) Prescriptive analytics. D) Predictive analytics.

C) Prescriptive analytics.

What do the tables above show? A) Airports and the employees / pilots that work at them B) Traveler frequent flyer information C) Primary airports for a set of travelers D) Travelers and flights they have taken or will take

C) Primary airports for a set of travelers

Table 1: meals Fields: - mealID (PK) - mealName - numberOfServings - caloriesPerServing Table 2: ingredients Fields: - ingredientID (PK) - ingredientName - ingredientType Table 3: ???????? Fields: - ingredientID (PK) - mealID (PK) - quantity - unitOfMeasure 23) If you had the tables above, what would the third table (labeled above with ????????) best represent? A) A food order from a table of customers at a restaurant B) The bill of materials for manufacturing a car C) Recipes for meals D) The ingredients that a person has in their pantry E) The products in inventory at a grocery store

C) Recipes for meals

Which of the following best represents a SQL statement showing the ages of the employees within each department of a college campus who are over 50 or under 40, sorted numerically starting with the youngest? A) SELECT departments.deptcode, departments.department, employees.age FROM employees INNER JOIN departments ON departments.deptCode = employees.employeeID WHERE employees.age > 50 and employees.age < 40 ORDER BY employees.age ASC; B) SELECT departments.deptcode, departments.department, employees.age FROM employees INNER JOIN departments ON departments.deptCode = employees.deptCode WHERE employees.age > 50 and employees.age < 40 ORDER BY employees.age ASC; 26) C) SELECT departments.deptcode, departments.department, employees.age FROM employees INNER JOIN departments ON departments.deptCode = employees.deptCode WHERE employees.age > 50 or employees.age < 40 ORDER BY employees.age ASC; D) SELECT departments.deptcode, departments.department, employees.age FROM employees INNER JOIN departments ON departments.deptCode = employees.deptCode WHERE employees.age > 50 or employees.age < 40 ORDER BY employees.age DESC;

C) SELECT departments.deptcode, departments.department, employees.age FROM employees INNER JOIN departments ON departments.deptCode = employees.deptCode WHERE employees.age > 50 or employees.age < 40 ORDER BY employees.age ASC;

Which of the following applies to many to many relationships but not to one to many relationships? A) Only one inner join clause is needed for a many to many B) Primary keys are used as foreign keys in corresponding tables in the relationship C) You need a third table to create the relationship D) All of the above E) None of the above

C) You need a third table to create the relationship

Details about the data is referred to as: A) primary key B) data lake C) metadata D) big data E) entity

C) metadata

The primary key of the table 'ownership' is: A) stateIDnumber B) hullID C) stateIDnumber and hullID combined D) There is no primary key for the table 'ownership'

C) stateIDnumber and hullID combined

The assurance that messages and information remain only to those authorized to view them

Confidentiality

Scott has data that contains a field showing the high temperature in degrees Fahrenheit (ex. 65, 70, 73, 40, ect.) in his town by day. He wants to be able to show the temperature for each day in one of two categories:

Create the following calculated field: IF (temperature) < 80 THEN "Normal" ELSE "Above Normal" END

Which of the following would be an example of predictive analytics?

Creating an analysis of the number of cars that passed through a segment of freeway each day of the past two years to attempt to determine future traffic patterns

If the concatenated primary key of Table 3 did not include departingAirportCode and arrivingAirportCode, which of the following would become TRUE about the design of tables? A) Only one traveler could fly each day from one airport to another B) A traveler could fly from the same departing airport to the same arriving airport multiple times per day C) All travelers could only travel between two airports D) A traveler could only travel once a day E) None of the above

D) A traveler could only travel once a day

Which of the following is TRUE about a view: A) It can be used within a database to store table relationships (i.e. a query) for users to access B) It conceptually contains the results of a query C) If the underlying data in the tables and relations changes, so will the results of the query D) All of the above E) None of the above

D) All of the above

What is the relationship between meals and ingredients? A) Many to one B) Multiple to one C) One to many D) Many to many E) One to one

D) Many to many

Using the tables shown above, what does the data in the tables appear to represent? (Select the best answer) A) Boats from a brochure that people have inquired about. B) A wish list of the boats that people are interested in buying. C) Boats that people have driven. D) The history showing the buying and selling of boats by individuals.

D) The history showing the buying and selling of boats by individuals.

Which of the following IS NOT one of the five common characteristics of quality data? (as described in the text and in class) A) Complete B) Valid C) Accurate D) Unique E) Timely

D) Unique

The tables 'people' and 'boats' have what kind of relationship? A) one-to-one B) many-to-one C) one-to-many D) many-to-many

D) many-to-many

Bee works for a large auto dealership in their service department. She has a data set that constrains information on services provided, which includes the date a vehicle came in for service (field dateIn) and the date service was completed. (field dateComplete). Which of the following calculated fields in Tableau would identify the number of days that it took to complete a service?

DATEDIFF(day',[dateIn),[dateComplete])

Eric was asked to setup a visualization summarizing data on patients staying at the hospital based around the number of days they have been there. He has a data set that contains information on patients, which includes the data the patient was admitted (fieldadmittedData). Doing some research, he found that Tableau uses TODAY() to represent the current date. Which of the following calculated fields in Tableau would identify the number of days that patients have been at the hospital?

DATEIFF('day', [admittedDate], TODAY())

Which of the following is the process of analyzing data to extract information not offered by the raw data alone?

Data Mining

Collecting information from many sources and storing them together into a single location is referred to as:

Data aggregation

Tools used to find patterns and relationships in large volumes of information that predict future behavior and guide decision making are referred to as:

Data mining tools

Which of the following is a type of visualization in which you are presenting finding to an audience?

Declarative visualization

Which of the following is a type of visualization in which you are presenting findings to an audience?

Declarative visualization

Which of the following fields in a data set usually be found in the Dimensions area in Tableau?

Departments

Which of the following keywords when used in a SQL select statement will remove duplicate records from the results?

Distinct

Which of the following may be indicators of big data? A) Velocity B) Veracity C) Variety D) Volume E) All of the above

E) All of the above

During which of the following processes does information cleansing usually occur?

ETL processes

Companies use data warehouses for each of the following except:

Enter and process invoices real-time as they are recieved

The principles and standards that guide our behavior toward other people

Ethics

Which of the following can be described as using if then statements to capture human knowledge?

Expert systems

A data set is a collection of organized or unorganized data.

False

Box and whisker plots are used for identifying correlation between two variables

False

Contemporary database systems provide a three-level hierarchy for naming relations. The top level of the hierarchy consists of schemas, each of which contain catalogs.

False

Data models show the details of the physical view of information for a database.

False

Discrete example of continuous data is height

False

IBM's Watson can only analyze structured data.

False

Intuitive approaches to decision making rely on depth of information, analytical approaches focus on breadth by engaging a problem with a holistic and abstract views.

False

The intersection operation does not remove duplicates. To remove duplicates, intersect all must be utilized.

False

The only cause of poor quality of data is human error

False

The problem solving ability of Ai is more useful for supporting intuitive rather than analytical decision making.

False

The technique of organizing data into distinct segments that are defined before the analysis begins is referred to as cluster analysis.

False

The validation set of training data for an advanced neural network is used only to test the final solution in order to confirm the actual predictive power of the network

False

PKE (Public Key Encryption) uses a single common key between the sender and recipient of a message to encrypt and decrypt the message

Fase

Which of the following would violate a foreign-key constraint?

Having a value in the attribute for a foreign key that does not correspond to a value in the table which the foreign key is coming from

HIPAA is a regulation that applies to which industry?

Healthcare

Governance of the ethical and moral issues arises from the development and use of informational technologies as well as the creation, collection, duplication, distribution, and processing of information

Information Ethics

Which of the following is decreased when using a relational database?

Information Refundancy

Which of the following refers to the measure of the quality of information?

Informational Integrity

Which of the following is NOT a component of Artificial Intelligence

Intuition Engine

Jill is creating a visualization in Tableau that is plotting points on a map. She decides to use the 'size' mark in her visualization. What does this accomplish?

It differentiates the points based upon the values of the measures used by making larger values visually bigger points

Jill is creating a visualization in Tableau that is plotting points on a map. She decides to use their 'size' mark in her visualization. What does this accomplish?

It differentiates the points based upon the values of the measures used by making larger values visually bigger points.

Which of the following describes a full outer join?

It preserves tuples in both relations

Which of the following charts are good for showing data changes over time?

Line chart

The type of qualitative data that cannot be ranked, but can used to count, group and take a proportion is:

Nominal

Which of the following is the first line of defense in securing information?

People

What does PII stand for?

Personally Identifiable Information

Which of the following is used to uniquely identify a row (or tuple) in a table?

Primary Key

Normal Data and Ordinal Data both are types of:

Qualitative Data

What feature of Tableau would you utilize to label the percent of total that a slice of a pie chart makes up? (such as 5.5%)

Quick table calculation

When using diverging colors on a diagram, which of the following is one of the least desirable color schemes when considering the ability for those with color blindness to be able to effectively read / use the visualization?

Red-Green Diverging

Which of the following is referred to as the use of social skills to trick people into revealing access credentials or other valuable information?

Social Engineers

The pattern of reading that was originally based upon eye tracking behavior on websites but is applied to visualizations in general when determining the best layout for a dashboard is referred to as:

The F Pattern

What should be your focus when designing your visualization?

The audience

What should be your primary focus when designing your visualization?

The audience

Artificial Neural Network are designed after which of the following:

The human brain

Size, color, label and detail are all examples of Tableau features that are found where?

The marks card

Early systems of AI used deterministic hard coded logic. Which of the following describes why this method of creating AI became tenuous?

The worlds store of information kept growing

When Artificial Neural Networks are referred to as black boxes, which of the following is being referred to?

They provide little guidance on the intuitive logic behind their predictions

Information itself has no ethics. Therefore who is responsible for developing ethical guidelines about how to manage it?

Those who own the information

When creating a histogram, what is the purpose of using the 'create bins' feature within Tableau?

To group together bands of values into buckets for measures that represent continuous data

What is the where clause in a SQL statement used for?

To select only those rows in the result relation of the from clause that satisfy a specified predicate

Which of the following sets of data are used in machine learning to adjust the weights on the neural network?

Training Set

What chart type would be the best to show the hierarchical nature of data (i.e. how sub- components build up to their parents components)?

Tree Map

What chart type would be the best to show the hierarchical nature of data (i.e. how sub-components build up to their parent components)?

Tree map

Which function would be used in Tableau to show a line that represents the relationships between a set of data points that have been plotted (i.e. regression)?

Trend Lines

A person can act legally but not be acting ethically

True

A schema diagram is a pictorial depiction of the schema of a database that shows the relations in the database, their attributes, and primary keys and foreign keys.

True

Big data is growing at an exponential rate

True

Companies that are using analytics to automate processes in the business are gaining benefits through employees having more time to work on higher-value-added tasks.

True

Deep Learning is a subset of machine learning.

True

Dumpster diving is a method of obtaining information from users by going through discarded items (e.g. trash)

True

Human-AI symbiosis is effective because it allows for a blend of both analytic and intuitive approaches to decision making

True

In a SQL statement, union is used to join two queries together.

True

In order to preform any actions on a database, a user (or a program such as MySQL Workbench) must first connect to a database.

True

One example of continuous data is distance.

True

One example of continuous data is height

True

Qualitative data is categorical data

True

Tableau allows for connections to live data in a database for purposes of having dashboards that can be refreshed periodically at a predetermined frequency.

True

Text in a novel is an example of unstructured data

True

The select clause of the statement is used to the list the attributes desired in the result of a query

True

The use of the and logical connective is to find tuples that meet two or more criteria

True

Wish enough training through machine learning, a neural network can learn enough to begin to match the predictive accuracy o fa human expert

True

Which describes prescriptive analytics?

Uses techniques that create models indicating the best decision to make or course of action to take.

What is having used instead of where?

When groups are present through the use of an aggregate function (such as avg, count, ect.) and conditions need to be applied to the groups

When should you use multiple colors?

When you need to differentiate types of data

Which of the following charts is described in the chapter as functioning well for showing proportions (vs. Quantitative Data)?

Word Clouds

Which of the following use of predictive analytics has variables that are changed due to factors outside the data-generating process and are independent of all other variables?

active prediciton

Joe is doing an analysis of his investment portfolio. His data contains variables that are change due to factors outside the data generating process and are independent of all other variables in the data. Which of the following predictive analytics uses describes the type of prediction he is doing?

active prediction

The options for order by when writing a SQL statement are:

asc, desc

If the following join statement is used to join two tables in a query, which of the following tables would all of the tuples in the relation appear in results? full outer join schema.customers on invoices.customerID = customers.customerID

both customers and invoies

The as clause:

can be used to rename attributes in the results of the query

When using colors to visually distinguish between on a visualization in Tableau, which of the following use of color is most commonly applied?

categorical

Joe is working with accounts receivable data. The database he is getting data from lists the balance in accounts receivable as $2250,654. However, when he adds the accounts receivables from all the sub-accounts (i.e. customer accounts), the he gets a value of $251,928. The database may be suffering from integrity issues due to which of the following quality characteristics?

consistency

The ZN functions in Tableau can be used to do which of the following?

convert null values for a field in a data set to a value of 0

Which aggregation function shows the number of records that meet a set of criteria?

count

If the following join statement is used to join two tables in a query, which of the following tables would all of the tuples in the relation appear in the results? right outer join schema.customers on invoices.customerID = customers.customerID

customers

Which of the following describes the phenomenon where there is an incentive to record everything?

datafication

A summary or interpretation of a data set is an example of:

description analytics

Color and formatting should be used in Tableau to:

draw attention to relevant data

The purpose of integrity constraints is to:

ensure that changes made to the database do not result in a loss of data consistency

Regression models are used to:

estimate the relationships among variables

The three factors of the variety of data are:

form, function, source

Which of the following are better at making decisions when there is uncertainty

humans

Which of the following is the human capacity to analyze alternative with deep perception, transcending ordinary-level function based on simple rational thinking?

intuitive intelligence

If the following join statement is used to join two tables in a query, which of the following tables in the relation appear in results? left outer join schema.customers on invoiceses.customerID = customers.customerID

invoices

A digital certificate:

is a data file that identifies individuals or organizations online

Which of the following is used in a SQL statement where clause to show all records where a particular attribute has null values.

is null

When creating a view, the data that is returned from query the view:

is stored in the tables that the view queries

Encryption:

is used to scramble information into an alternative form that requires a key to read it

What is needed to train an neural network?

large amounts of data

Which of the following are all the same value in a normal distribution?

median, mean, and mode

The integrity constraint that requires that an attribute in a tuple not be blank ( i.e. no value) is:

not null

The operator like in a SQL statement is used for:

pattern matchting

The right to be left alone when you want to be:

privacy

Which of the following describes the basic premise of how an Artificial Neural Network works?

receive inputs , process the inputs, provide an output

The concept that a value that appears in one relation for a given set of attributes must appear for a set of attributes in another relation is:

referential integrity

Lauren is querying a data set and the results she keeps getting has a lot of duplicate rows returned. She would like to remove duplicates form the results and only display unique rows of data. What function in SQL would she use in her query?

select distinct

The three basic clauses of a SQL statement to select data are:

select, from, where

Three maps and heat maps use which of the following to show proportional size of values?

size and color

Tree maps and heat maps use which of the following to show proportional size of values?

size and color

Pattern discovery is:

the process of identifying distinctive relationships between observations in a data set

A null value means:

the value is unknown or does not exist

Which of the following is characterized as a lack of information about all alternatives or their consequences?

uncertainty

Which of the following describes the veracity characteristics of big data?

uncertainty and or untrustworthiness of data

The integrity constraint that requires that no two tuples can have the same value for an attribute is:

unique

Big data is mostly, over 90%:

unstructured data

Donovan is creating a chart that utilizes a map. He wants to have the map show the borders of the different countries within each state. Where would he go to enable this on the map?

use the Map Styles menu option

Which of the following describes the speed of data?

velocity

Which of the following charts functions well for showing proportions (vs. quantitative data)?

word clouds


Kaugnay na mga set ng pag-aaral

POLITICAL SCIENCE 100 FINAL EXAM REVIEW

View Set

chap 19 - technology in business

View Set

medical terminology usf final exam, medical terminology final exam ch 8-14

View Set

Chapter 1 Pretest and Appendix-B Test

View Set