U1 - Tableau
U1 - Tableau
What is Tableau?
Tableau is a leading data visualization tool used for data analysis and business
intelligence. Gartner’s Magic Quadrant classified Tableau as a leader for analytics and
business intelligence. This Tableau tutorial will cover a wide range of how-to topics,
including how to create different charts and graphs, in addition to visualizing data
through reports and dashboards in Tableau to derive meaningful insights.
What is Tableau?
Tableau is an excellent data visualization and business intelligence tool used for
reporting and analyzing vast volumes of data. It is an American company that started in
2003—in June 2019, Salesforce acquired Tableau. It helps users create different charts,
graphs, maps, dashboards, and stories for visualizing and analyzing data, to help in
making business decisions.
Tableau has a lot of unique, exciting features that make it one of the most popular tools
in business intelligence (BI). Let’s learn more about some of the essential Tableau
Desktop features. Now that we know what tableau is exactly, let us understand some of
its salient features.
Tableau Features
● Tableau supports powerful data discovery and exploration that enables users
to answer important questions in seconds
● No prior programming knowledge is needed; users without relevant
experience can start immediately with creating visualizations using Tableau
● It can connect to several data sources that other BI tools do not support.
Tableau enables users to create reports by joining and blending different
datasets
● Tableau Server supports a centralized location to manage all published data
sources within an organization
Next up in the what is tableau tutorial, we will learn about the various file types. In
Tableau, you can save your work using various Tableau-specific file types. The following
are the different types of files that Tableau supports:
Before you can build a view and analyze your data, you must first connect
Tableau to your data. Tableau supports connecting to a wide variety of data,
stored in a variety of places. For example, your data might be stored on your
computer in a spreadsheet or a text file, or in a big data, relational, or cube
(multidimensional) database on a server in your enterprise. Or, you might
connect to public domain data available on the web such as U.S. Census Bureau
information, or to a cloud database source, such as Google Analytics, Amazon
Redshift, or Salesforce.
Tableau Desktop
When you launch Tableau Desktop, the data connectors that are available to
you are listed on the Connect pane, which is the left pane on the Start page.
Under Search for Data, select Tableau Server to find data using Tableau Server
or Tableau Online. File types are listed next, then common server types, or
servers that you've recently connected to. Click More to see the complete list of
data connectors you can use.
For supported files and databases, Tableau provides native connectors that are
built for and optimized for those types of data. If your file or database type is
listed under Connect, use this native connector to connect to your data. If your
file or database type is not listed, you might have the option of creating your
own connection using Other Databases (JDBC), Other Databases (ODBC), a
Web Data Connector, or a Connector Plugin built using the Tableau Connector
SDK. Tableau provides limited support for connections that you create using
these options.
After you've connected to data, you can save the connections to have them
show up under the Saved data sources section on the Connect pane.
Generated Values
Tableau has built-in fields that make difficult tasks easier. These are found on the left side
of the screen at the bottom of the dimensions list and at the bottom of the measures list.
When you perform an operation (such as double-clicking on a geographic field) these
Tableau generated fields are automatically added to the design window. Generated values
include:
Measure Names, Measure Values and Number of Records are always present. If your
dimensions include standard geographic place names, Tableau will also automatically
generate center-point geocodes.
Tableau Geocoding
If your data includes standard geographic fields like country, state, province, city, or postal
codes – denoted by a small globe icon – Tableau will automatically generate the longitude
and latitude values for the center points of each geographic entity displayed in your
visualization.
If Tableau failed to recognize any location, a small gray pill would appear in the lower right
of the map. Clicking on that pill would expose a menu that would help you identify and
correct the geocoding.
Number of Records
When you build a view using Measure Names as a filter to allow the user to select from a
subset of measures. You want “Number of Records” in the tooltip, but NOT in the filter
values.
The final generated value provided is a calculated field near the bottom of the measures
shelf called Number of Records. Any icon that includes an equals sign denotes a calculated
field. The number of the records calculation formula includes only the number one.
Common issues
● To view, edit, or create joins, you must open a logical table in the
relationship canvas—the area you see when you first open or create a
data source—and access the join canvas.
● Published Tableau data sources cannot be used in joins. To combine
published data sources, you must edit the original data sources to
natively contain the join or use a data blend.
● When joining tables, the fields that you join on must be the same data
type. If you change the data type after you join the tables, the join will
break.
● Fields used in the join clause cannot be removed without breaking the
join. To join data and be able to clean up duplicate fields, use Tableau
Prep Builder instead of Desktop
Create a join
These can be in the same data source (such as tables in a database or sheets in
an Excel spreadsheet) or different data sources (this is known as a cross-
database join). If you combine tables using a cross-database join, Tableau
colors the tables in the canvas and the columns in the data grid to show you
which connection the data comes from.
1. Drag the first table to the canvas.
2. Select Open from the menu or double-click the first table to open the join
canvas (physical layer).
3. Double-click or drag another table to the join canvas.
If your next table is from another data source entirely, in the left pane,
under Connections, click the Add button ( in web authoring) to add a
new connection to the Tableau data source. With that connection
selected, drag the desired table to the join canvas.
4. Click the join icon to configure the join. Add one or more join clauses by
selecting a field from one of the available tables used in the data source,
choosing a join operator, and a field from the added table.
Note: You can delete an unwanted join clauses by clicking the "x" that
displays when you hover over the right side of the join clause.
5. When finished, close the join dialog and join canvas.
There are several ways to combine data, each with their own strengths and
weaknesses.
Relationships are the default method and can be used in most instances,
including across tables with different levels of detail. Relationships are flexible
and are adaptable to the structure of the analysis on a sheet by sheet basis.
However, you can't create relationships between tables from published data
sources.
Joins combine tables by adding more columns of data across similar row
structures. This can cause data loss or duplication if tables are at different
levels of detail, and joined data sources must be fixed before analysis can
begin.
Blends, unlike relationships or joins, never truly combine the data. Instead,
blends query each data source independently, the results are aggregated to the
appropriate level, then the results are presented visually together in the view.
Because of this, blends can handle different levels of detail and working with
published data sources. Blends are also established individually on every sheet
and can never be published, because there is no true “blended data source”,
simply blended results from multiple data sources in a visualization.
1. Ensure that the workbook has multiple data sources. The second data
source should be added by going to Data > New data source.
2. Drag a field to the view. Whichever data source this first field comes from
will become the primary data source.
3. Switch to another data source and make sure there is a blend
relationship to the primary data source.
○ If there is an orange linking field icon ( ), the data sources are
automatically linked. As long as there is at least one active link, the
data can be blended.
○ If there are gray, broken link icons ( ), click the icon next to the
field that should link the two data sources. It will turn orange,
representing an active link.
4. Drag a field into the view from the secondary data source.
As soon as this second data source is used in the same view, a blend is
established. In the example below, our primary data source is Sales Targets
and the secondary data source is Sample - Superstore
● The primary data source is indicated with a blue check mark on the data
source. Fields from the primary data source used in the view have no
indication.
● The secondary data source is indicated with an orange check mark on the
data source and an orange bar down the side of the Data pane. Fields
from the secondary data source used in the view have an orange check
mark.