BUAD 342- Final
discount
- [Quantity sold]*[Sales price per]*[Item Discount]
Smart home devices
- smart Tvs, smart appliances, smart cameras, smart windows blinds
SAS explains that there are three predominant types of analytics in use today.
-Descriptive statistics. -Predictive analytics -Prescriptive analytics
workbook
. Tableau uses a _____ and sheet file structure, much like Microsoft
You want only the measurements of interest in the Measurements section.
. This data set includes Sales data, so most of these will be financial measurements.
Why do we need you to focus on critical thinking more than the specific tool features? If you think about the steps you are performing and focus on
1-interpreting what the data says about a business, and 2-your objectives in creating a series of outputs for your user, you can leverage your learning in this class to more quickly learn how to use other visualization tools.
dashboard
A Tableau ___________ is best described as a collection of views from multiple worksheets that can be viewed simultaneously (at the same time).
A.Dashboard B.Story C.Sheet D.All of the above- ANS E.None of the above
A Tableau workbook contains sheets. A sheet can be a
80 percent and 20 percent
A recent Harvard Business Review article reports that people ____ of their time cleaning and shaping data, and only ______ of their time analyzing it
true
According to the video from ESET, a botnet is a network of computers infected with malware that can be controlled by a hacker known as a bot master.
Internet of Things
An organization has decided to use devices on shopping carts and shopping baskets to track a consumer's path through a store and improve store layout and place premium products in high traffic areas. This is an example of
Microsoft Power BI
BI is a collection of online, mobile and desktop services and features that enables you to find and visualize data, share discoveries, and collaborate in intuitive new ways.
Bluetooth trackers
Beeping medicine caps and automated phone calls reminding patients it's time to take the next dose. Smart locks
false
Blocks in a blockchain can be deleted if the information is proven to be false or misleading after they have been added.
Why did we start with database queries first?
Familiarity with databases, queries and SQL are viewed as an added advantage when someone is learning visualization tools. -You will find that some questions are easier to answer using a database query than a visualization tool.
false
General Data Protection Regulation (GDPR) was created to protect consumers' credit card information collected by companies.
Where are your files in Windows?
Go File Explorer and Find My Documents. You should see My Tableau Repository folder. Unless you change your defaults, files are likely stored in Documents>My Tableau Repository>Workbooks and Documents>My Tableau Repository>Datasources
üCreate three calculated fields -
Gross Sales Net Sales Discounts
.Inner Join
I am using Tableau and pulling in from two data sources. I need to include only records from the two sources that have a data in common and a matching primary key. I should choose a (n)
IaaS
I have decided not to buy my own servers, virtual machines, networks, etc. but to rent them on a pay-as-you-go basis. My best option is
Types of cloud services
IaaS, PaaS, SaaS
story
If you would like to convey information using a controlled sequence of visualizations, which capability within Tableau would you use?
FIRST steps of any data analysis project.
Importing your data and cleaning/formatting data are the most important
Artificial Intelligence (AI)
Just as the cloud enables the Internet of Things, the Internet of Things devices are enabled by advances in
desktop
Power BI _____is designed for authoring (data modeling, merging different data sources, visualization, etc.). Desktop application runs only on Windows.
US Currency
Set default properties, number formats for three computations to
are not supported
Some features (such as count distinct) ______ in Tableau when you are using Microsoft Access data sources, and Microsoft Excel and Text File data sources that have a legacy connection. In some cases, you can correct this by choosing the Extract option -However, when you do, you will need to save the extracted data
drivers
Sometimes you need additional software called ____ to enable a connection to Tableau (and other software):
When you import data,
Tableau initially tags each field as a dimension or a measure. Typically, fields with text (nominal or other text) values are tagged as Dimensions and fields with numeric values are tagged as Measurements. OFTEN THIS IS NOT ACCURATE.
cloud service
The Power BI _____ is designed for sharing and creating dashboards.
side bar.
The Story and Layout panes are available in its
1048576 rows by 16384 columns
The absolute maximum total number of rows and columns on a worksheet:
Infrastructure-as-a-service (IaaS)-
The most basic category of cloud computing services. With IaaS, you rent IT infrastructure—servers and virtual machines (VMs), storage, networks, operating systems—from a cloud provider on a pay-as-you-go basis.
Why should you learn about these technologies?
They are impacting your future workplace, your potential clients, and the very nature of your future careers in significant ways.
false
When Tableau determines that field is a Dimension instead of a Measurement, there is only one way to change its properties.
They can contain formulas and formatting
Which of the following is false about .csv files?
A.Deep Learning
Which of the following technological advances attempts to teach machines to learn on their own by enabling them with layers of processing power?
Gross Sales & Price per Item
Which of the following would be considered a measurement in Tableau?
A.3D Printing B.Uber C.Netflix D.Amazon Prime E.None of the above (Ans)
Which of these is not an example of a disruptive technology?
joining data
You need to think about how you combine the data, and your choices are
net sales
[Quantity sold]*[Sales price per]*(1-[Item Discount])
Tableau Prep
a brand new data preparation product designed to empower people to quickly and confidently combine, shape, and clean their data, further reducing the time from data to insight.
Sense Mother
a family of incredibly smart sensors that you can set to monitor whatever you care about
Cloud Computing
a general term for the delivery of hosted services over the internet. Such as: •Create new apps and services •Store, back up, and recover data •Host websites and blogs •Stream audio and video •Deliver software on demand •Analyze data for patterns and make predictions
A story is
a good way to "control" the order in which a user looks at worksheets. You can create stories to tell a data narrative, provide context, demonstrate how decisions relate to outcomes, or to simply make a compelling case.
machine learning
a subset of artificial intelligence (AI) that trains machines how to learn. Machine learning allows software applications to become more accurate in predicting outcomes without being explicitly programmed. The basic premise of machine learning is to build algorithms that can receive input data and use statistical analysis to predict an output value within an acceptable range
Kilobree
a toothbrush that helps improve brushing but also sends data to your phone -- --reminds you to brush, or rats out your children when they don't.
A sheet can be
a worksheet, a dashboard, or a story
A Tableau story
allows to designate a sequence of visualizations (stored as worksheets or dashboards) that work together to convey information.
Tableau Prep
application features a direct and visual experience for data preparation giving customers a deeper understanding of their data, smart features to automate complex tasks, and integration with the Tableau analytical workflow for faster speed to insight
Dimensions
are fields that are qualitative and usually cannot be aggregated to produce a meaningful sum. Dimension fields are usually used for row or column headings. Examples include sales region, employee, location, or category.
Public clouds
are owned and operated by a third-party cloud service provider, which deliver their computing resources like servers and storage over the Internet. Microsoft Azure is an example of a public cloud. With a public cloud, all hardware, software, and other supporting infrastructure is owned and managed by the cloud provider. You access these services and manage your account using a web browser.
Measures
are those fields that can be measured, aggregated (used for mathematical operations). Measures are usually used for plotting or giving values to the sizes of markers. Examples include sales revenue or times.
Changes at "Mickey's House"- Disney Has invested $1 Billion to Boost Customer Experience
by Leveraging Big Data, IoT and Machine Learning
From Microsoft - The main difference between a Free or Pro user is
centered around sharing and collaboration. Only Pro users can publish content to app workspaces, consume apps without Premium capacity, share dashboards and subscribe to dashboards and reports.
With SaaS,
cloud providers host and manage the software application and underlying infrastructure, and handle any maintenance, like software upgrades and security patching. Users connect to the application over the Internet, usually with a web browser on their phone, tablet, or PC.
Hybrid clouds
combine public and private clouds, bound together by technology that allows data and applications to be shared between them. By allowing data and applications to move between private and public clouds, hybrid cloud gives businesses greater flexibility and more deployment options.
The Tableau workspace
consists of menus, a toolbar, the Data pane, cards and shelves, and one or more sheets.
Tableau worksheet
contains a single view along with shelves, cards, legends, and the Data and Analytics panes in its side bar
A workbook
contains sheets
PaaS is
designed to make it easier for developers to quickly create web or mobile apps, without worrying about setting up or managing the underlying infrastructure of servers, storage, network, and databases needed for development.
Power BI mobile
enables you to view your reports and dashboards.
Tableau Prep
has customized visual experiences to make common yet complex tasks - such as joins, unions, pivots and aggregations - simple. -These intuitive experiences help customers get desired results faster and with greater confidence.
Predictive analytics
has surged in popularity. The desire to predict customer behavior has been a main driver. Increased computing power with the ability to run hundreds or thousands of models quickly - and widespread adoption of predictive techniques like support vector machines, neural networks and random forests - are bringing predictive analysis to the forefront of many organizations. These models use past data and predictive algorithms to help you determine the probability of what will happen next.
Descriptive statistics.
have been around the longest. Remember the Swedes in 1749? Tabulating population counts was an early foray into descriptive analysis - the summary of collected data points. These are the models that will help you understand what happened and why. There are still plenty of descriptive analytics in use today - everything from how many clicks a page receives to how many units are produced vs. how many are sold.
Tableau dashboard
is a collection of views from multiple worksheets. - ___ and layout panes are available in its side bar.
Visualization
is a general term that describes any effort to help people understand the significance of data by placing it in a visual context
Software-as-a-service (SaaS)
is a method for delivering software applications over the Internet, on demand and typically on a subscription basis.
Analytics
is an encompassing and multidimensional field that uses mathematics, statistics, data mining, visualizations, predictive modeling and machine learning techniques to find meaningful patterns and knowledge in recorded data.
union
is another method for combining two or more tables by appending rows of data from one table to another. Ideally, the tables that you union have the same number of fields, and those fields have matching names and data types.
left join
is asking for all the rows from the table displayed or listed on the left and, if there's a match in the table on the right match it up. When a value in the left table doesn't have a corresponding match in the right table, you see a null value in the output.
right join
is asking for all the rows from the table displayed or listed on the right and, if there's a match in the table on the left , match it up. When a value in the right table doesn't have a corresponding match in the left table, you see a null value in the data grid.
Full outer join
is asking for every row from the table on the left and every row from the table on the right. If they have matching values, the computer matches them up. If they don't have matching values, the computer will still display each row, but show nulls where they don't match.
A disruptive technology
is one that displaces an established technology and shakes up the industry or a ground-breaking product that creates a completely new industry.
Robotic process automation (RPA)
is the use of software with artificial intelligence (AI) and machine learning capabilities to handle high-volume, repeatable tasks that previously required humans to perform. These tasks can include queries, calculations and maintenance of records and transactions
as you update worksheets used in dashboards and stories,
it also updates them in the dashboard and story views.
Gartner Research--
major cloud providers, there are three top contenders, and then everyone else.
Inner join
matches up records between two tables based on designated values. The result set that you get contains the rows from the table on the left that match the table on the right. If there are rows in either table that don't match, they aren't returned in the result set.
a union.
merging/appending data via
The cloud service is
multi-platform - use Microsoft Edge, Internet Explorer, Chrome, Safari, Firefox or mobile apps (iOS, Android and Windows 10).
Employees table includes
one record per Employee.
Types of cloud deployments:
public, private, hybrid
private cloud
refers to cloud computing resources used exclusively by a single business or organization. A private cloud can be physically located on the company's on-site datacenter. Some companies also pay third-party service providers to host their private cloud. A private cloud is one in which the services and infrastructure are maintained on a private network.
Platform-as-a-service (PaaS)
refers to cloud computing services that supply an on-demand environment for developing, testing, delivering, and managing software applications.
artificial intelligence
simulation of human intelligence processes by machines (training machines to perform human tasks).
Sales table
table includes one record per Sale
deep learning
the subset machine learning composed of algorithms that permit software to train itself to perform tasks, like speech and image recognition, by exposing multilayered neutral networks to vast amounts of data
Technology trends/issues transforming the way we conduct business (and as a result,
the way we manage and leverage information)
You will get so called clean data
with instructions for how to merge and/or join separate files -realizing that you typically must perform data clean up before starting your analyses is very important
Unions are used when
you have the same data set - data values of the same fields - broken into separate files and you want to recombine them.)
To decide about the joins you need, let's review the nature of the data and the relationships:
üEach (one) employee is linked to zero, one, or many sales. ü üEach (one) customer is linked to zero, one, or many sales. ü üEach (one) product is linked to zero, one or many salelines. üEach (one) product is linked to one or many suppliers. üEach (one) shipper is linked to zero, one or many sales. ü üEach (one) sale is linked to one employee. üEach (one) sale is linked to one customer. üEach (one) sale is linked to one or many shippers. üEach (one) sale is linked to one or many salelines. üEach (one) saleline is linked to one or many product. üEach (one) saleline is linked to one order.
Which fields do you use for the joins?
•Access will ask you for the field to join on SOMETIMES - Why does it ask you?
What are the technology trends/issues transforming the way we conduct business (and as a result, the way we manage information)?
•Analytics and Big Data •Migration to Cloud Computing •The Internet of Things •AI/Machine Learning/Rise of Business Bots/RPA/Augmented Reality and Virtual Reality •Blockchain •Cryptocurrencies •Social Media/BYOD/BYOD MESSAGING (e.g. Slack, Facebook messenger, GroupMe, Whatsapp) •Compliance with Information Management Regulations and Standards (e.g., PCI-DSS, EMV technology, GDPR) •Cyber Security
Only link to a subset of the tables -
•Sales •Salelines •Customers •Employees •Shippers •Products
We are including 6 tables:
•Sales •Salelines •Customers •Employees •Shippers •Products
Why not use Excel only???
•Worksheet and workbook specifications and limits •Open Excel workbooks are limited by available memory and system resources •Spreadsheets such as Excel are not interactive by nature. They are static. •By its very design, a spreadsheet is a table view of data. Data visualizations are not based on the concept of a table view. Trends, relationships, and patterns are easier to communicate and interpret with data visualization.
Create (concatenated/combined) Calculated Fields for employee and customer names. An example of how you do that:
•[First Name]+" "+[Last Name] or [First Name]+", "+[Last Name]
gross sales
•[Quantity sold]*[Sales price per]
üMove the following from Measure to Dimensions (why??):
•empReportsTo •2 instances of Sales Invoice ID •Cust Income Bracket
Prescriptive analytics.
•is the newest kid on the block. Knowing what will happen and knowing what to do are two different things. Prescriptive analytics answers the question of what to do by providing information on optimal decisions based on the predicted future scenarios. The key to prescriptive analytics is being able to use big data, contextual data and lots of computing power to produce answers in real time.