INFS MIDTERM STUDY GUIDE

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

Which of the following is associated with Iconic Memory...

Automatic and unconscious

Visual distortion is typically not caused by:

Bright colors

Which of the following pre-attentive attributes should be used to represent quantitative data?

. Length 2D position

Consider the following calculated field TripCat. How many different values can TripCat have? IF [Tripsize] <=5*60 THEN "FastCat" ELSEIF [Tripsize] <=10 THEN "QuickCat" ELSEIF [Tripsize] <=30 THEN "SlowCat" ELSE "Dog?" END

B. 2

Which element in Tableau is specifically designed (independent of the data source), to allow user input in order to change vizzes without modifying any code.

B. Parameter

Contain measures related to the subject of analysis and the foreign keys

Fact table

You are given a table called CUSTOMER with the following fields/Columns: CustomerID, CustomerName, ContactName, Address, City, PostalCode, Country1) Write an SQL statement that displays all records and all columns2) Write an SQL statement that displays CustmorName, City, and Adress for customers from Germany only.3) Write an SQL statement that displays a list of the number of cities by country and orders the list by the number of cities in a descending order

1SELECT * FROM Customers; 2 SELECT CustomerName, City, Address FROM CUSTOMER WHERE Country = 'Germany'; 3) SELECT Country, COUNT(City) FROM CUSTOMER GROUP BY Country ORDER BY COUNT(City) DESC;

Spacing in tables and direct labeling can be effective because of the Gestalt Law of

Proximity

You are asked to create visualization with following requirements (using Divvy Trips dataset for Q4 2019): Create an interactive viz that displays the ranking of most popular hours for Divvy rides (by number of trips). Segment 'hour' into following segments Morning (all hours before noon), Afternoon (all hours from noon until before 6 pm), Night (all hours after 6 pm). Add color based on those three segments. Parameterize the number of hours displayed so that user can select a range between 10 and 20. Explain in detail steps you would do, including identifying what calculated fields, parameters, and filters you need to create, formulas you would use for each, any table calculations and/or parameter options, as well as identification for each dimension/measure/parameter its location on the worksheet (row, column, filter, mark, page display) Hint - you will need to calculate 'hour' dimension from Start Time (you can call it 'Start Time (Hours)'. You will then use this new dimension to calculate three segments (you can call this new dimension 'HourSegment'). See attached visual hint how your end product will look like. Yellow rectangles were used to hide items you will need to create/move.

First we will create the Start Time(Hours) dimension from creating a calculated field and using the DATEPART('hours', [Start Time]) function, and then convert it into discrete and a dimension. After that we will drag the already created Divvy_Trips_2019_Q4.csv(Count) measure to the columns shelf and the Start Time (Hours) dimension to the rows shelf. This will create a horizontal bar chart of all hours and counts in that chart. Next we will create a calculated field for the hour segments. The calculated field will be in this format: IF [Start Time(hours)] < 12 THEN 'Morning' ELSEIF [Start Time(hours)] < 18 THEN 'Afternoon' ELSE 'Night' END This will create the hours segment dimension and we will drag that to the color mark in the marks card. Next we will create the row_num index by creating a calculated field that only holds the formula INDEX() and then convert it to discrete. This will create an index that we will place in the columns shelf before Start Time(hours). Next we will do a table calculation on the row_num index and select specific dimension, custom, trip id, count, descending. Now the index is connected to the count of the data in descending order. Now we will create a parameter named TopK, we will select Range and set minimum to 10 and maximum to 20 and hit ok. Next we have to create a TopKFlag variable by making a calculated field with the formula: row_num<=[TopK] After making this variable we will drag it to the filters card and once the filters window comes up we select "True" to display only values that are less than or equal to what the TopK parameter slider is set at. To relist the pills. Within the dimensions we created Start Time(hours) and Hours Segments fields, and for measures we created the discrete row_num variable and the TopKFlag. In the filters card there is TopKFlag set to True. In the color mark card there is the hours segments. in parameters there is the Top K parameter which we select "Show Parameter" and that is why it shows to the right of the viz with a slider. In columns we have the automatically generated count of trips measure which is also the x axis title. In rows we have row_num on the left and Start Time(hours) on the right (which are also the row titles). And finally the cards to the right of the viz are titled as TopK and Hours Segments.

See the attachment for your reference. Consider the following scatter plot. For each of the 4 pills (in Color, Columns, Detail, and Rows), identify the most likely dimension, measure, or calculated field that was used to generate it by identifying the name of the element and its corresponding pills. Describe how we can limit the scatter plot to include only trips to the station "Shedd Aquarium" by identifying appropriate dimension to the appropriate shelf or card.

For each of the 4 pills (in Color, Columns, Detail, and Rows), identify the most likely dimension, measure, or calculated field that was used to generate it by identifying the name of the element and its corresponding pills.Gender: the gender dimension would be used to generate the corresponding pill. We can assume the orange and red pills are representing these types. You would place gender through the color filter (Marks box down at the bottom) to assign the colors.Day of Start Time : You would use a custom date to in order to display the day of start time variable. You do this by clicking on the existing variable Start time and clicking "create... custom date"You select date value and chose day. This is then placed in the marks box. The dates will then show in blueAvg Tripduartion: Average Trip Duration is placed in the row shef of tableau. This is created by placing the variable trip duration into the row shelf which displaces the average trip duration on the left y axis.# of Trips: number of trips (created in previous question) will be placed in the columns shelf and will display the number on the x axis.Describe how we can limit the scatter plot to include only trips to the station "Shedd Aquarium" by identifying appropriate dimension to the appropriate shelf or card.We can limit the scatter plot to only include trips taken to station Shedd Aquarim by adding the To Station Name to Columns shelf and filtering to Shedd AquarminThis can also be done by placing to Shedd Aquarmin in the filter box and selecting Shedd Aquarmin.

Select the false statement A. Blue pills represent discrete variables B. Tableau can connect to both databases and data warehouses as well as Excel files. C. Default behavior and choices differ for discrete and continuous variables D. Tableau metadata is inferred automatically from the data source E. Green pills represent discrete variables

Green pills represent discrete variables

See the attachment for your reference. Consider the following blank Tableau worksheet that uses the Divvy Trips dataset for Q3 2017. Your goal is to create a horizontal bar chart that shows, for every different To Station Name, the total number of trips that end at that station. In your description make sure to identity what dimensions and measures would be used and their appropriate locations in the worksheet (Rows, Columns, Marks, Filter, Page, etc). If any measure or dimension will be aggregated in the Columns or Rows shelves, show the aggregate function (e.g. COUNT, CNTD, SUM, AVG, etc.)

In order to create a horizontal bar chart that shows, for every different To Station Name, the total number of trips that end at that station, you would need to use:1) existing variable "To Station Name"2) Variable you create titled "# of Trips"To create the number of trips, you need to create a Calculated Field. Once this is open you would use the COUNTD function to measure Trip IDthe following would result in : COUNTD ([Trip ID])You would place the Number of Trips variable in the column shelf, and the To Station Name Variable in the Rows shelf.The result would display the station name on the left hand side, and the number of trips on the bottom x axis. The x axis will measure the number of trips.

Which of the following variables are likely to be considered quantitative?

Student GPAs

A characteristic of Analytical data

Subject Oriented


Ensembles d'études connexes

Chapter 5: The nurse as Manager of Care

View Set

ChFC Module 4 - Lesson 8: SIMPLE, 403(b), and 457 Plans

View Set

3 steps of Hemostasis (blood clot formation)

View Set

Second Year / Semester Two / 17BSB562 / The Marketing Mix / Essay Question Cue Card Plans

View Set

Mastering A&P II- Chapter 25 Practice Urinary System

View Set

Chapter 16: Documenting, Reporting, Conferring, and Using Informatics

View Set

History (Mass Politics and Imperial Domination)

View Set

Learning English for Spanish speakers C

View Set

Hand Fingertip Amputations, Dupuytren's, Vascular

View Set

Switching, Routing, and Wireless Essentials Chapters 10-13

View Set

nur 116 - Davis Advantage / Edge - Renal Failure

View Set