Tableau Conceptual
What function can you use to return the current date and time?
NOW()
If you need to modify the data connection, what should you do?
Riight click on the data source, options appear from there
Totals
Adds totals to the view. When you add totals, the drop options are Subtotals, Column Grand Totals, and Row Grand Totals.
In what ways can you change parallelism settings?
1) Adding a global limit on the number of parallel queries for Tableau Server 2) Limits for a particular data source type such as SQL Server 3)Limits for a particular data source type on a specific Server 4)Limits for a particular data source type on a specific server when connecting to a specific database
Situations in which extracts may not be appropriate:
1) Real-Time Data - because extracts are a point of time snapshot of data they would not be appropriate if you need realtime data in your analysis. It is possible to automatically refresh extracts using Tableau Server and many customers do this at intraday frequencies but true real time data access would require a live connection. 2) Massive Data - If the dataset you need to work with is massive (millions to billions of records) then extracting may not always be practical. Resulting extract file may be excessively large or extract process may take many hours. 3) Pass-through functions - if the workbook uses pass-through functions these will not work with a data extract. 4) Robust user-level security - if you have a requirement for robustly enforced, user level security then this needs to be implemented in the data source. If you have user level filters applied at the workbook level, these can always be removed by a user allowing them access to all data in the extract.
What are some possible issues encountered in splits?
1)auto and custom splits can sometimes be missing for a supported data type. 2)Null values or empty cells ( occasionally nulls will appear when there are no values for all expected new fields) 3)Data has been removed( sometimes tableau can use portions of the fields values as separator
Pareto Chart -
A Pareto chart is a type of chart that contains both bars and a line graph, where individual values are represented in descending order by bars, and the ascending cumulative total is represented by the line. most commonly associated with the 80 20 rule
What is a flat file?
A data file with no structured hierarchy
Histogram -
A histogram is a frequency plot: it will illustrate the number of times a particular result occurred in a data set and does so in the form of a bar chart. It is always a good idea to have an overview of the entire distribution of a variable, and histograms are another great way to gain that perspective.
Standard Deviation
A measure of data spread around its average, measured in the same units as the data itself. The sample standard deviation is an unbiased estimate of the population standard deviation given a slight correction. This standard deviation includes the correction.
First Quartile
A measure of location that is commonly used with other quartiles to provide a robust measure of spread. Robust in this case means not as sensitive to outliers as the standard deviation. The first quartile is the 25th percentile, typically the lower line in a boxplot.
Third Quartile
A measure of location that is commonly used with other quartiles to provide a robust measure of spread. Robust in this case means not as sensitive to outliers as the standard deviation. The third quartile is the 75th percentile, typically the upper line in a boxplot.
Skewness
A measure of the tendency of your data to have extreme values to one side. Positive skewness means the extreme values are to the right, while negative skewness means the extreme values are to the left.
Excess Kurtosis
A measure of the tendency of your data to have more extreme or outlying values than a normal distribution. A normal distribution has a kurtosis of 3 so this value is kurtosis minus three.
Data blending is?
A method for combining data that supplements a table of data from one data source with columns of data from another data source.
What is and how do you apply a Quick table calculation?
A quick table calculation is a one-step process where you choose a common table calculation type from a list. Tableau automatically applies the most typical settings for that calculation type. To apply a quick table calculation, do the following: Click a measure in the view and choose Quick Table Calculation from the context menu. Choose a Calculation type from the sub-menu.
What is the performance application of placing a filter on a data source?
A reduction in the amount of data you are dealing with. (Allowing quicker processing)
What is the data highlighter and where can it be used?
A simple tool that allows you to highlight specific data or data aspects to appear more pronounced to the user. Highlighter can be used in worksheets, dashboards, and stories
Sparklines -
A sparkline is a compact trend line chart without axis that fits in a small area and shows values over time. The key is that sparklines show trends that means that they visualize timeseries data, sparklines are line charts, they are without axis which means they do not start at zero, and they are compact which means that they are intended to fit in a small area and are mostly used for indicative purpose of the trend and changes over the period of time.
How do you build an absolute value calculation?
ABS(number) ABS(-7) = 7 or ABS([Budget Variance}) the above example returns the absolute value for all the numbers contained in the Budget Variance field
How do you build an arc cosine calculation?
ACOS(number) this result is in radians, example ACOS(-1) = 3.141592
What function in a calculated field returns the the ASCII code for the first character in the string?
ASCII("Alabama") = 65 - ASCII()
How do you build an arc sine calculation?
ASIN(number) the result is in radians example ASIN(1) = 1.57079
how do you build an arc tangent calculation?
ATAN(number) the results is in radians ATAN(180) = 1.565240
How do you build an arc tangent calculation for 2 given numbers?
ATAN2(y number,x number) the result is in radians ATAN2(2,1) = 1.1071487
What are Ad-hoc Calculations?
Ad-hoc calculations are calculations that you can create and update as you work with a field on a shelf in the view. Ad-hoc calculations are also known as type-in or in-line calculations.
Forecast
Adds a forecast to the view. Forecasting is only possible when there is at least one date and one measure in the view. Forecasting is not supported for views based on multidimensional data sources. In addition, the view cannot contain any of the following: Table calculations Disaggregated measures Percent calculations Grand Totals or Subtotals Date values with aggregation set to Exact Date A time series containing null values also imposes constraints.
Average Line
Adds one or more average lines to the view. You can add an average line for a specific measure or for all measures. You can click on a resulting average line and choose a different aggregation, such as Total or Sum. You can also choose Edit, Format, or Remove.
Constant Line
Adds one or more constant lines to the view. You can add a constant line for a specific measure, for all measures, or for date dimensions.
Average with 95% CI
Adds one or more sets of average lines with distribution bands; the distribution bands are configured at a 95% confidence interval. You can add these items for a specific measure or for all measures. The confidence interval distribution bands shade the region in which the population average will fall 95% of the time.
Median with Quartiles
Adds one or more sets of median lines and distribution bands to the view. You can add a median with quartiles for a specific measure or for all measures. The distribution bands are computed as quartiles; the middle two quartiles are shaded. You can click on a resulting median line or distribution and choose Edit, Format, or Remove. Median lines and distributions must be edited, formatted, or removed separately.
Median with 95% CI
Adds one or more sets of median lines with distribution bands; the distribution bands are configured at a 95% confidence interval. You can add these items for a specific measure or for all measures.The confidence interval distribution bands shade the region in which the population median will fall 95% of the time.
Trend Line
Adds one or more trend lines to the view. When you add trend lines, the drop options identify the trend line model types available in Tableau: Linear, Logarithmic, Exponential, and Polynomial. For some views, only a subset of these options is available.
If you want Tableau to fill in a lack of data with zeros (such as when looking looking at number of calls throughout a day) how do you do that? what's the advantage?
Advantage is a smooth, continuous line To do it: Click on the pill to edit in shelf; type zn, then type an open parentheses at the beginning of the line, and close parentheses at the end of the line
What's one way to describe continuous values?
All the shades between the discrete numbers or "buckets"
Cross-Tab -
Also known as a text table, essentially the same view you would see from excel through view data. The mark type is text and the data is organized into rows and columns.
What happens when you do a blend, and miss a link?
Among other things, it can create duplicate records because they're not properly merged
What is an example of a data source that cannot be extracted?
An OLAP database
How many seasons and data points are needed for a forecast?
At least two seasons, or at least five data points if it's not seasonal
In what ways can you split columns in a datasource?
Automatic and custom split can be used to split a column on a separator.
Bar-in-bar charts -
Bar in bar charts provide a fantastic way to compare a measure against a goal or to display two measures against one another.
What do the boxes represent in box plots?
Boxes indicate the middle 50 percent of the data (that is, the middle two quartiles of the data's distribution).
Are text dimensions continuous or discrete?
By definition it has to be discrete
How do you build a calculation that rounds to the nearest integer of equal or greater value?
CEILING(number) Ceiling(3.1415)=4
What function in a calculated field converts an integer ASCII code into a character?
CHAR(65) = 'A' - CHAR()
What function will return a TRUE value if the string contains the substring and a FALSE if it does not?
CONTAINS(string, substring)
How do you build a calculation for the cosine of an angle.
COS(Number) specify the angle in radians COS(PI()/4) = .707106
How do you build a calculation that returns the cotangent of an angle?
COT(number) specify the angle in radians COT(PI()/4) = 1
What are dimensions?
Categorical fields; the fields we want to slice and dice our data by
If you want to see the details of what the data interprter did, what should you do?
Click "review results" - headers will be in red, data will be in green
How do you change the range of values on your axis?
Click edit axis (right click?)
If you want to filter your data source as the data is brought in, how do you do that?
Click on add in the upper right hand corner to add filters
How do you export data to a spreadsheet, particularly when working with a table and this is not obvious?
Click on analysis, view data, then export all; It will be a csv file so you will separate on the commas when importing into the excel sheet
What are four primary use cases of LOD expressions?
Cohort analysis - comparing data for different subgroups Totals or averages across segments Aggregates of aggregates Binning aggregates
What location types can you search for in the Map?
Continent Country State or province County City Postcode
Continuous vs. Discrete
Continuous generally means forming an unbroken whole, without interruption; discrete means individually separate and distinct.
If you are trying to make a pie chart, and it won't give you the option or it shows up as a bunch of different pie charts, what do you do?
Create a new sheet. Move a measure pill to the column shelf, and a dimension field to the rows shelf. From there it should give you the pie chart option.
Union
Creates a new table that is a combination of the tables that have been union-ed. To union data, it must come from the same connection For best results the combined tables must have the same structure, # of fields, and related fields must have matching field names and data types
Sets
Custom fields that define a subset of data based on some conditions. can be based on rules and are dynamic. can be constant with fixed values as well
What function in a calculated field allows you to specify a portion of a date and increase it?
DATEADD(date_part, interval, date) - The number that you use for the interval will change the date by increasing the date-part
What function in a calculated field allows you to return the difference between date1 and date2 expressed in units determined by date_part?
DATEDIFF(date_part, date1, date2, [start_of_week]) - The start_of_week parameter is optional, and if it is not defined, then the start of the week is determined by the associated data source
What function do you use to return the date_part parameter of the date as a string?
DATENAME(date_part, date, [start_of_week] - start_of_week is optional
What function in a calculated field allows you to convert a string into a date/time with a specified format?
DATEPARSE(format, string) - (If the string does not match the date/time format, then it returns a value of Null) Only available for MS Excel, MSSQL, Oracle, PostgreSQL, tableau data extract and text file data sources
What function in a calculated field truncates the date to the accuracy of the date_part that you specify in the function?
DATETRUNC(date_part, date, start_of_week)
What function in a calculated field returns the day of the specified date as an integer?
DAY(date)
How do you build a calculation to divide two numbers and return with the closest integer?
DIV(integer1, integer2) Returns the integer part of a division operation, in which integer1 is divided by integer2 DIV(11,2) = 5
What is external query caching in Tableau and what is its purpose?
External query caching is a process in which tableau saves the query cache as part of the workbook (Desktop and Server). This allows for near instant load times for applicable workbooks as well as less database clogging.
What is Tableau's filter order in regards to LOD expressions?
Extract Filters Data source filters Context Filters______Fixed Dimension Filters_____Include Exclude Measure filters Table Calc Filters
How do you build a calculation that converts radians to degrees?
Degrees(number) DEGREES(PI()/4)=45.0
How can you ensure a left outer join is happening?
Drag both to rows, then see that there are nulls in the column for the second filed to show it did not match with the first field.
How do you create a hierarchy?
Drag the subordinate field onto the primary field
What function tests if the string ends with the selected substring?
ENDSWITH(string, substring)
How do you build a calculation that returns e raised to the power of the given number
EXP(number) EXP(2) = 7.389
What function would you use if you needed to find the beginning of a substring within another string?
FIND(string, substring, [start])
How do you build a calculation that rounds a number to the nearest integer of equal or lesser value?
FLOOR(number) Floor(3.1415) = 3
If kerberos fails you will not be prompted for username and password to login to Tableau.
False
T/F- Split and custom split options are supported for sets, groups, parameters and bins.
False they are not supported
Bar-in-Bar charts are only for showing a single level of detail.
False. Bar in Bar charts are great for adding another level of detail inside of a horizontal bar chart. You can do this by adding another dimension to your horizontal bar chart that will further divide the measure into sub-groups. The sub-groups are then color-coded on each bar
T/F - The fixed LOD computes values using the specified dimensions in addition to whatever dimensions are in the view.
False. FIXED - computes values using the specified dimensions without reference to the view level of detail - that is without reference to any other dimensions in the view Fixed LOD ignore all the filters in the view other than context filters, datasource filters, and extract filters
T/F - The include LOD explicitly removes dimensions from the expression, subtracts dimensions from the view level of detail.
False. INCLUDE - computes values using the specified dimensions in addition to whatever dimensions are in the view Most useful when including a dimension that isn't in the view
T/F- custom split allows you to detects common separators and split automatically.
False. In a custom split you specify on what separator(s) you want to split a column and then confirm to split the column.
Parallelism allows for significant improvements in data connection time.
False. Query parallelism allows for drastic improvements in visualization rendering times.
T/F - When you combine two sets they can be based on any two dimensions.
False. To combine two sets, they must be based on the same dimensions. That is, you can combine a set containing the top customers with another set containing the customers that purchased last year.
T/F - When using obscenely large amounts of data running aggregate data extracts can increase the time it takes to extract
False. When using obscenely large amounts of data running aggregate data extracts can reduce the time it takes to extract. (in this way the total information is condensed)
T/F - you cannot combine two different sets.
False. You can combine two sets. When you combine sets you create a new set containing either the combination of all members, just the members that exist in both, or members that exist in one set but not the other. Combining sets allows you to answer complex questions to understand cohorts of data.
T/F - You cannot use measure values and measure names at the same time.
False. You can use Measure Values and Measure Names to display the values for all measures in you data source, using their default aggregations, simultaneously.
Level of Detail Expressions have a positive effect on viz performance.
False. this is only partly true, as level of detail expressions can have both positive and negative impacts on performance depending on how they are used. With large data sources, this can be a huge performance gain (excluding a dimension). On the minus side, this can cause Tableau to run more complicated queries (for example, containing many joins), and if the underlying data source is slow, performance can suffer. (many times when using include or fixed LOD)
T/F - You cannot change or edit a quick table calculation when used.
False. you can continue working with the calculation by clicking the measure again and choosing Edit table calculation.
What are the three types of LOD expressions?
Fixed, Include, and Exclude {[FIXED | INCLUDE | EXCLUDE] <dimension declaration > : <aggregate expression>}
What is a tool that can be used to track time to completion for subunits of a project, or collection of data?
Gantt Chart
Combined axis chart
Generally, a combined axis chart is purely a matter of how you put your chart together rather than a defined chart type. Allows you to show two different values on a single chart. When working with multiple measures in a view, you can customize the mark type for each distinct measure in your worksheet. For example, you can create a view with a line showing a target amount across several months and a bar chart showing the actual attainment for the months. These measures can be displayed as individual axes, blended axes, or dual axes.
How do you create groups?
Go to the dimension within which you want groups; right click to create group; click on the things you want; click on group; name group; note that field name is the overall category
How do you make your viz fill the whole screen?
Go to the tool bar at the top, under "help" and click on drop down menu to fit to width
What color are continuous fields?
Green
How do you build a calculation to map an (x,y) coordinate to the x-coordinate of the nearest hexagonal bin?
HEXBINX(number,number) HEXBINX([Longitude],[Latitude])
How do you build a calculation to map an (x,y) coordinate to the y-coordinate of the nearest hexagonal bin?
HEXBINY(number,number) HEXBINY([Longitude],[Latitude])
What are Hexagonal Bins used for?
Hexagonal bins are an efficient and elegant option for visualizing data in an x/y plane such as a map. Because the bins are hexagonal, each bin closely approximates a circle and minimizes variation in the distance from the data point to the center of the bin. This makes the clustering both more accurate and informative.
What function would you use to determine if a string date value is a valid date value?
ISDATE(string)
Which function in a calculated field is a logical test that is included in the list of logical functions, tests a string to determine if it is a valid date(T/F)?
ISDATE(string)
What's an easy calculation that defines positive profits
If >=0 then "positive" else "negative"
In a union, what happens when two tables have the same record?
It goes on there twice
How can you quickly identify the primary data source in Tableau?
It has a blue check mark.
How do you know a data source is an extract?
It will appear as two cylinders with an arrow
What is the best way of bringing a pill over, other than a drag and drop?
Just start typing its name in the shelf
What function would you use to get a certain number of characters in a string from the left side of the target string?
LEFT(string, num_chars)
What function would you use to count the number of characters in a given string field?
LEN(string)
How do you build a calculation that returns the natural log of a number?
LN(number)
How do you build a calculation that returns the log of a number for the given base?
LOG(number,[base]) if the base is omitted then a base of 10 is used.
How would you convert a string to lower case in a calculated field?
LOWER(string)
What is a level of detail expression?
Level of detail expressions support aggregation at dimensionalities other than the view level. With level of detail expressions, you can attach one or more dimensions to any aggregate expression. Unlike table calculations, totals, or reference lines, level of detail expressions are computed in the data source .
What are some limitations and constraints for LOD expressions?
Level of detail expressions that reference floating-point measures can behave unreliably when used in a view that requires comparison of the values in the expression. Level of detail expressions are not shown on the Data Source page. When referencing a parameter in a dimensionality declaration, always use the parameter name, and not the parameter value. With data blending, the linking field from the primary data source must be in the view before you can use a level of detail expression from the secondary data source.
What function in a calculated field can you use to return the maximum of two dates?
MAX(expression) or MAX(expr1, expr2) This function exists in several categories with multiple uses
How do you build a calculation that returns the max of two arguments?
MAX(number, number) Returns nun if either argument is null. MAX can also be applied to a single field in an aggregate calculation. MAX(4,7) MAX(Sales, Profit)
If you needed to pull characters from the middle of a string field what function would you use?
MID(string, start, [length]) The start argument is where the returned value will begin and the length argument is how many characters will be returned. If not included then all remaining characters after the start position will be included.
What function can you use to return the minimum of two dates?
MIN(expression) or MIN(expr1, expr2) This function exists in several categories with multiple uses
How do you build a calculation that returns the minimum of two arguments?
MIN(number,number) Returns null if either argument is null. MIN can also be applied to a single field in an aggregate calculation. MIN(4,7) Min(Sales,Profit) MIN([First Name],[Last Name])
What function can you use to return the month of the specified date as an integer?
MONTH(date)
What are two examples of cubes (OLAPS)?
Microsoft analysis services, oracle s base; they process and transform data from a relational database into preaggregated results with defined hierarchical strucures for standardization and quick access to results
Motion Charts -
Motion charts show data using the x and y-axes, displaying changes over time by showing movement of data points within the defined space as well as changes in the color of the lines.
Does sorting within a hierarchy rearrange the headers of the fields that appear before the sorted fields?
No
Can fields generated from a split or custom split be used as keys to join tables?
No, but these fields can be used to blend data sources.
Can fields from a split be used in a pivot?
No, fields split fields cannot be used in a pivot.
What does OLAP stand for?
Online Analytical Processing
How do you build a calculation that raises the specified number provided to a specified power
POWER(number,power) Power(5,2) = 5^2 = 25
Pivoting
Pivot allows the move from crosstab format to a columnar format to more easily perform analysis.
hierarchy
Product, product family, product department. Or something like time such as years, quarters, and months.
Describe Query Fusion in Tableau.
Query fusion is a process whereby Tableau looks at all the queries in the dashboard and attempts to simplify them into fewer queries, often this results in less work for the database and faster response times for the user.
If you needed find any occurrence of a substring in another string and replace those characters with the replacement string, what function would you utilize?
REPLACE()
What function would you use to get a certain number of characters in a string from the left side of the target string?
RIGHT(string, num_chars)
Do you know how to build a calculation that rounds numbers to a specified number of digits?
ROUND(number, [decimals]) if decimals is omitted then the number is rounded to the nearest integer. ROUND(sales)
How would you remove blank spaces at the end of the following string? "Alabama "
RTRIM(string) returns the string with any trailing spaces removed
Do you know how to build a calculation that converts the given number from degrees to radians
Radians(number) Radians(180) = 3.1415
Reference bands
Reference bands are shaded areas behind the marks in the view between two constant or computed values on the axis. You can add reference bands to any continuous axis in the view. To add a reference band: You generally follow the same steps as creating a reference line just select band instead of line.
Reference Distribution
Reference distributions add a gradient of shading to indicate the distribution of values along the axis. Distribution can be defined by percentages, percentiles, quantiles (as in the following image), or standard deviation. Reference distributions can also be used to create bullet charts.
How can you share a datasource with others?
Right click on data source and use "publish to server"
How do you refresh an extract?
Right click on data source, go to extract, refresh extract
How can you replace a data source?
Right click on it and click replace data source
If you want to save a data source for future use, what should you do?
Right click on the data source, and click on "add to saved data sources" it will save as .tds, which just tells Tableau where to find that data; it will be saved in the "datasources" in My Tableau Repository
How do you replace a data source with another?
Right click on what you want top replace. Click "edit data source"; click on data; click on edit connection
Do you know how to build a calculation that returns the sign of a number?
SIGN(number) The possible return values are -1 if the number is negative, 0 if the number is zero, or 1 if the number is positive. If the average of the profit field is negative, then SIGN(AVG(Profit)) = -1
What function returns the sign of a number?
SIGN(number) the possible return values are -1 if negative, 0 if the number is zero, and 1 if the number is positive
What function returns the sine of an angle?
SIN(number) this angle is in radians example SIN(0) = 1.0
If you needed return a string with a particular number of spaces, what function would you utilize?
SPACE(number)
How many split fields does SQLServer allow?
SQLServer allows up to four split fields, to get around this you can split on fields that have already been split.
What function returns the square root of a number
SQRT(number) SQRT(25) = 5
Which function returns the square of a number?
SQUARE(number) SQUARE(5) = 25
If you wanted to find out if the field name starts with a particular substring such as Mr., what function would you utilize?
STARTWITH(string, substring)
Sets -
Sets are custom fields that define a subset of data based on some conditions. A set can be based on a computed condition, for example, a set may contain customers with sales over a certain threshold. Computed sets update as you data changes. Alternatively a set can be based on specific data points in your view. You can use sets to compare and ask questions about a subset
How can you quickly clear a sheet in Tableau?
Shift, Alt, Backspace at the same time
Heat map -
Similar to a crosstab but more visual. A heat map is a great way to compare categories using color and size. In this, you can compare two different measures.
what are the situations in which an extract is appropriate?
Slow query execution - if your data system is slow to process the queries being generated by Tableau, extracts can be a simple way to improve performance. offline anlysis packaged workbooks for public/reader/generally if youre planning on sharing workbooks with others additional fuctionality such as JET drivers Data security allows you to limit visible data
What an example of a data source that must be extracted?
Some cloud-based databases "must" be extracted
Sort-
Sorting in its most general sense is a way of arranging dimension members in a specified order
When sorting do you need to be concerned about breaking a pre-existing dimension hierarchy? (As in rearranging header that occur before the sorted fields)
Sorts do not break the dimension hierarchy. Sorted fields are always displayed within the ordered context already set forth by the fields on the Rows and Columns shelves. This means that Tableau will not rearrange any of the headers of the fields that appear before (to the left of) the sorted field. If you want to break the dimension hierarchy when sorting a multidimensional data source, place only the hierarchy level that you want sorted on the Rows or Columns shelf.
TF- Queries in the same LOD can be fused.
T- Tableau recognizes this and pulls the aggregations together into a single query. NOTE: fusion does not occur in TDE sources.
Which function returns the tangent of an angle?
TAN(number) answer is in radians
What function can you use to return the current date?
TODAY()
How would you remove the blank spaces around the following string? " Alabama "
TRIM(string)
Gantt Bar Charts -
The Gantt chart was invented back in the 1910s by Mr. Henry Gantt as a way to visualize a schedule or progression of time.
Rectangular Selection Tool
The Rectangular tool selects marks within a rectangular shape.
What is the difference between a regular dual axis chart and a scatter chart?
The big difference with a scatter plot is that both axes in the chart are measures rather than dimensions
When might you use a bullet graph?
The bullet graph is generally used to compare a primary measure to one or more other measures in the context of qualitative ranges of performance such as poor, satisfactory, and good. You can create a bullet graph by adding two reference lines: a distribution to indicate the qualitative ranges of performance, and a line to indicate the target.
what are the main calculation restrictions in multi-connection data sources?
The calculations must be supported by all the connections in the multi-connection data source. They must also be supported by Tableau extracts.
Data interpreter
The data interpreter detects these sub-tables so that you can work with subset of your data independently of the other data. It also removes extraneous data to help prep for analysis
What is the main advantage of a motion chart?
The main advantage of motion chart is to view the entire trail of how the data has changed over time and not just a snapshot of the data. Motion charts introduce the time-dimension element to a chart that would otherwise only represent a snapshot of what was occurring at a given time. They demonstrate the ability to see big picture patterns that develop from point to point while also stressing the individual elements being presented. (involves animation)
What are measures?
The numbers we want to analyze
Pan & Zoom
The pan tool and zoom controls help you interact with the view and inspect your data. They are located in the upper-left corner of the view, on the view toolbar.
Describe the result of a full outer join.
The result is a table that contains all values from both tables. those without a match produce a null value in the grid.
Describe the result of a left join.
The result of a left join is a table that contains all the values from the left table and the corresponding matches in the right table
Describe the result of a right join.
The result of a right join is a table that contains all values from the right table and corresponding matches in the left tables.
Describe the result of an inner join.
The result of an inner join is a table that contains values that have matches in both tables.
Dual axis charts -
The technique is something we apply when we'd like to show the relationship between two measures across a single dimension. The concept here is that instead of having a single row for each measure, we'd like to see both measures on the same set of rows, so that we can better evaluate differences between the two or trends that appear over time. Dual axes are useful when you have two measures that have different scales.
What are the two types of sets and the difference between them?
The two types of sets are constant and computed. -Members of a constant set are fixed and do not change. To create a constant set you need to select the members you want to include - a constant set can be based on a single dimension or multiple dimensions. - The members of a computed set are dynamic and change when the underlying data changes. Computed sets can only be based on a single dimension.
Are dimensions discrete, or continuous?
They can be both
When you create a calculated field for a workbook, how will that field show up on the other sheets when you're on data source?
They will show up as null. Don't try to delete or change them! This causes problems when multiple sheets have fields with similar names.
Tree maps -
Treemaps are very powerful visualizations, particularly for illustrating hierarchical (tree-structured) data and part-to-whole relationships. Because of their visual nature, treemapping is ideal for legibly showing hundreds or even thousands of items in a single visualization simultaneously.
Tableau Server saves each connection from session to session if you don't sign out.
True
You can run Tableau Server along side Kerberos.
True
T/F- Automatic split tableau detects a separator and splits the column on that separator.
True.
T/F- blending is essentially a simulation of a left join.
True.
T/F- when working with multiple tables in tableau the preferred approach is to define the joins in the connection window.
True. Doing this does not define a specific query but rather defines how the tables relate to each other.
T/F - It is possible to define a level of detail expression at the table level without using any of the scoping keywords.
True. For example, the following expression returns the minimum (earliest) order date for the entire table: {MIN([Order Date])}
T/F - The exclude LOD explicitly removed dimensions from the expression, subtracting dimensions from the view level of detail.
True. Most useful for eliminating a dimension in the view
T/F - Sets can be automatically generated as a result of an action.
True. Sets can be automatically generated as a result of an action. There are many ways to create a set, select marks or headers in the view or right-click (control-click on Mac) a field in the Data pane. You can also create a set that is based on a filter you've already defined.
Parallel aggregation enables running aggregations in parallel and in effect splitting the work load across multiple cores.
True. The mathematical concept for the max degree of parallelism is (number of available logical processors)/2. This enables query operations on data extracts to run up to N times faster, where N is the number of cores.
T/F-Filter can be created on data sources, which reduces the amount of data in the data source.
True. This is especially important for performance applications
Tableau takes advantage of SIMD instructions on the local machine to perform low level operations in parallel.
True. This is known as data engine vectorization.
T/F - when you create groups in Tableau you can include an other group
True. When you include an other category, the groups you have defined are shown and all other dimension members are combined into an "Other" category. This option is useful for highlighting certain groups or comparing specific groups against everything else
If you needed to convert all values of a field to uppercase what function would you utilize?
UPPER(string)
How can you go back to the live connection?
Uncheck the "use extract" option
Scatter plots -
Use scatter plots to visualize relationships between numerical variables. The scatter plot, also known as a scatter diagram, scatter chart, scattergram or scatter graph, is useful to compare two different measures for patterns. Like the circle view and the side-by-side circle chart, the scatter plot also uses symbols to visualize data.
When is the best circumstance in which a live connection is warranted?
When we want to use rapidly changing data, or a high performance database
Map Layering
When you create a map view there are several options to help you control the appearance of the map. These options are available in the Map Layers pane.
When is it best to use a data blend in Tableau 10?
When you want to combine data from data sources that are not supported by cross data base joins such as cubes or a kind of extract only source. Another reason is if the data is at different levels of detail
What are the disadvantages of using cross tabulations?
While text tables are applicable to most data sets, they do not tell a very compelling story or highlight important pieces of data without additional formatting on your part. Text tables are not normally recommended as the primary visualizations in your dashboard or presentation. Consider using them as an appendix inclusion for those report consumers that want more detail in a traditional format instead.
Lasso Selection Tool
With the Lasso tool, you can select multiple marks by drawing a freehand shape around them. This tool is useful when you want to include only certain marks, and exclude others around them.
Logarithmic model type
With the Logarithmic model type, the explanatory variable is transformed by the natural log before estimation of the model. So the formula is Y = b0 + b1 * ln(X) + e ******* Because a logarithm is not defined for number less than zero, any marks for which the explanatory variable is negative are filtered before estimation of the model. Avoid using a model that discards some data unless you know that the data being filtered out is invalid. The trend line description reports how many marks were filtered before model estimation.
Exponential model type
With the exponential model type, the response variable is transformed by the natural log before estimation of the model. So the formula is ln(Y) = b0 + b1 * X + e ********** With an exponential model, your response axis does not become logarithmic. Instead, the marks plotted in your view are found by plugging in various explanatory values to find values of ln(Y). These values are then exponentiated to plot the trend line. What you see is the exponential model: Y = e ^ (b0 + b1 * X + e) Because a logarithm is not defined for numbers less than zero, any marks for which the response variable is negative are filtered before model estimation.
Linear type model
With the linear model type, no transformations are performed on either the explanatory or response variable. So the formula is Y = b0 + b1 * X + e
Polynomial
With the polynomial model type, the response variable is transformed into a polynomial series of the specified degree. So the formula is Y = b0 + b1 * X + b2 * X^2 + ... + e *********** With a polynomial model type, you also select a Degree between 2 and 8. The higher polynomial degrees exaggerate the differences between the values of your data. So if your data increases very rapidly, the lower order terms may have almost no variation compared to the higher order terms, rendering the model impossible to estimate accurately. Also, more complicated higher order polynomial models require more data to estimate. So check the model description of the individual trends line for a red warning message indicating that an accurate model of this type is not possible.
What function in a calculated field can you use to get the year of the specified date as an integer?
YEAR()
When doing a union, do columns have to have the same name?
Yes - if they don't, Tableau will consider them separate columns. To make them one column after you've done the union, highlight them and click merge
Distribution Band
You can add reference distributions for a specific measure or for all measures in the view. After you drag a reference distribution from the Analytics pane and drop it on a target, Tableau automatically opens Edit Reference Line, Band, or Box dialog box.
Reference Line
You can add reference lines for a specific measure or for all measures in the view. After you drag a reference line from the Analytics pane and drop it on a target, Tableau automatically opens the Edit Reference Line, Band or Box dialog box.
Where are all the places you can change a data type?
You can do it when looking at the sheet by right clicking, or do it in the grid or metadata view under the data source
When are you unable to add trend lines?
You cannot add a trend line to a view that has the Product Category dimension, which contains strings, on the Columns shelf and the Profit measure on the Rows shelf. However, you can add a trend line to a view of sales over time because both sales and time can be interpreted as numeric values.
Which function returns the expression if it is not null, otherwise it returns 0?
ZN(expression) Use this function to use zero values instead of null values. Example: ZN([Profit]) = [Profit]
Group
a combination of dimension members that make higher level categories
Group-
a combination of dimension members that make higher level categories. For example, if you are working with a view that shows average test scores by major, you may want to group certain majors together to create major categories. Groups are often useful for correcting data errors as well as answering "What If" type questions
Is a join horizontal or vertical, and why?
a join is horizontal, because it adds more columns
box plots
also known as box-and-whisker plots, are used to show the distribution of values along an axis
What is a table calculation?
an additional computation done on top of results set within Tableau
Measure name field
appears at the bottom of the Dimensions area in the Data pane and contains the names of all measures in your data, collected into a single field with discrete values.
Measure value field
appears at the bottom of the Measures area in the Data pane and contains all the measures in your data, collected into a single field with continuous values.
Statistical Summary Card
available on the Show/Hide Cards toolbar menu , provides a quick view of information about a selection or the entire data source.
what color are discrete pills?
blue
What color are discrete fields?
blue; they create labels in the chart
Bullet Graphs -
bullets graphs are used to compare data against historical performance or pre-assigned thresholds. A lot of information can be derived from a small space with this type of chart that is also Tableau's answer to those looking for a gauge or meter visualization. A bullet graph is similar to a standard bar graph except that there is a distribution showing progress towards a goal behind the bar. Like a standard bar graph, a bullet graph can be presented either horizontally or vertically.
If you want to connect to an extract when you open a new workbook, how do you do that?
click on "other files" on source home screen
What are the two types of sorting in Tableau?
computed sorting & manual sorting
what creates an headers when dragged to the worksheet
dimensions or discrete fields
what color are continuous pills?
green
Box plots -
is a standardized way of displaying the distribution of data, or a quantitative variable, based on the five number summary: minimum, first quartile, median, third quartile, and maximum.
What creates an axis when dragged to the worksheet
measures or continuous values
Can Tableau write changes back to the data source?
no
What is the Metadata grid?
part of the data source page and is used to review the fields and the first 1000 rows of the data contained in the Tableau data source.
what are the features that tableau provides to help load messy data?
pivot, union, data interpreter, and merge columns
What is the security application of using data source filters?
restricting the data users can see when you publish a workbook for data source.
How can you change the default aggregation?
right click on default properties, then click on aggregation for options
Once you replace a data source, what do you do if Tableau can't find the data for a field that was named different things?
right click on the old field, click replace references, and click on where the refreshed data is.
Radial Selection Tool
selects marks within a circular area.
What is the resulting datatype from a custom split?
string
What is the general limit for parallel queries in Tableau?
typically the limit is 16 with certain exceptions for Text files(1), excel files(1), SAPBW(1), and Amazon Redshift(2)
What does pivoting refer to?
using varying levels of detail including skipping levels.
Describe the process of using a cross-database join.
utilizing a cross database join requires that you first set up a multi-connection data source - that is you create a new connection to each database before you join tables.
At what level are dates automatically aggregated?
year
Should the number of tables be minimized to only the tables needed for a specific worksheet/ viz?
yes. depending on the complexity of the data connections and the worksheet, it may also be worth separating out data connections for each worksheet and creating specific join patterns for those sheets.
What do you do when Tableau doesn't recognize your location information
you can custom geocode your data so that you can use it to create map views in Tableau with ease. Custom geocoding your data allows you to create custom geographic roles that you can use as you create map views in Tableau.