Unit 5
Unit 5
DATA ANALYTICS
Unit: 5
Data Visualization
Mr.Ravi Pandey
B Tech VIIth Sem Assistant Professor
ECE
12/12/2023 2
Evaluation schemeLEARNING TASK
THE CONCEPT
5 Lab – I 0 0 2 25 25 50 1
6 Internship Assessment 0 0 2 50 50 1
Course objective:
The objective of this course is to understand the fundamental concepts of Data Science,
learn about various types of data formats and its manipulations. It helps students to
learn exploratory data analysis and visualization techniques in addition to R
programming language.
CO 1 Understand the fundamental concepts of data analytics in the areas that plays major role K1
within the realm of data science.
CO 2 Explain and exemplify the most common forms of data and its representations. K2
CO 5 Illustrate various visualization methods for different types of data sets and application K3
scenarios.
Text books:
1) Glenn J. Myatt, Making sense of Data: A practical Guide to Exploratory Data Analysis
and Data Mining, John Wiley Publishers, 2007.
2) Data Analysis and Data Mining, 2nd Edition, John Wiley & Sons Publication, 2014.
Reference Books:
• Security.
•Transportation.
•Risk detection.
•Risk Management.
•Delivery.
•Fast internet allocation.
•Reasonable Expenditure.
•Interaction with customers.
•Planning of cities
Course Outcomes
Course outcome: After completion of this course students will be able to:
CO5 Understand and analyze the I/O management and File systems K2, K4
12 December 2023 11
THE CONCEPT LEARNING TASK
Program Outcomes
1. Engineering knowledge
2. Problem analysis
3. Design/development of solutions
4.Conduct investigations of complex problems
5. Modern tool usage
6. The engineer and society
7. Environment and sustainability
8. Ethics:
9. Individual and team work
10. Communication
11. Project management and finance
12. Life-long learning
Course PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
Outcome
1 3 2 2 - - - - - - - - 1
3 3 3 - - - - - - - - 1
2
3 3 3 - - - - - - - - 1
3
3 2 1 - - - - - - - - 1
4
3 2 2 - - - - - - - - 1
5
Average
3 2.4 2.2 - - - - - - - - 1
1 3 - -
3 2 -
2
3 2 -
3
3 2 2
4
3 2 -
5
Average
3 2 2
•Solve real-time complex problems and adapt to technological changes with the ability of
lifelong learning.
•Work as data scientists, entrepreneurs, and bureaucrats for the goodwill of the society
and pursue higher education.
•Exhibit professional ethics and moral values with good leadership qualities and effective
interpersonal skills.
• NA
Prerequisite:-
Recap-
Features of Tableau
•Data Blending: Data blending is the most important feature in
Tableau. It is used when we combine related data from multiple
data sources, which you want to analyze together in a single view,
and represent in the form of a graph.
•Real-time analysis: Tableau can help extract valuable
information from fast moving data with interactive analytics.
•The Collaboration of data: Team members can share data, make
follow up queries, and forward easy-to-digest visualizations to
others who could gain value from the data.
12/12/2023 Sanchi Kaushik UNIT 05 Data Analytics 30
THE CONCEPT LEARNING TASK
Topic Objective
Step 2) This will start downloading the .exe file for Windows
by default, and you can see the downloading process in the
bottom left corner of the website.
Line Chart
The line chart, or line graph, is another familiar method for
displaying data. It connects several distinct data points,
presenting them as one continuous evolution. The result is a
simple, straightforward way to visualize changes in one value
relative to another.
•Go to a new Worksheet
•Drag Order Date into Columns.
•Drag Sales into Rows.
Line Chart
Step 1: Add Dimension Field and Measure Field
Step 2: Add Profit Measure Field to Rows
Step 3: Drag SUM Measure Field to Y-Axis
Step 4: Find the Details of Measure Fields
Step 5: Manage the Appearance of Tableau Line Chart
Bar Chart
A bar chart or graph is used to represent category wise data of a
dataset. The length of the bars is proportional to the value that
each category represents. This gives instant insight into the data
pictorially. In Tableau, we can create several variants like a simple
vertical bar graph (also known as a line graph), segmented bar
graph, stacked bar graph and side by side bar chart.
Tree Map
The tree map displays data in nested rectangles. The dimensions
define the structure of the tree map and measures define the
size or color of the individual rectangle. The rectangles are easy
to visualize as both the size and shade of the color of the
rectangle reflect the value of the measure.
A Tree Map is created using one or more dimension with one or
two measures.
Show Me in Tableau
Show Me window holds the most commonly used charts
in Tableau. You can use any of these charts to view the
data. Click Show Me on the toolbar to open this window,
as we have shown below.
Side-by-Side Circle
Highlight Table Pie Charts Side-by-Side Bars
Views
Dual Combination
Discrete Area Chart Scatter Plot Histograms
Chart
Box and Whisker Plots Gantt Chart Bullet Graph Packed Bubbles Chart
Aggregate Functions
Let's see some essential examples of aggregate functions:
•AVG (expression): It returns the average of all the values in the
expression. AVG is used only with numeric fields. And the Null
values are ignored.
•COUNT (expression): It returns the number of items in a
group. And the Null values are not counted.
Aggregate Functions
Let's see some essential examples of aggregate functions:
•MEDIAN (expression): It returns the median of an expression
over all records. Median can only be used with numeric fields,
and Null values are ignored.
•STDEV (expression): It returns the statistical standard
deviation of all values in the given expression based on a
sample of the population.
Aggregate Functions
Let's see some essential examples of aggregate functions:
•MEDIAN (expression): It returns the median of an expression
over all records. Median can only be used with numeric fields,
and Null values are ignored.
•STDEV (expression): It returns the statistical standard
deviation of all values in the given expression based on a
sample of the population.
Data Interpreter
If your data source is Excel, CSV, PDF, or Google Sheets
and you are facing data-related challenges like stacked
headers, additional information in the adjoining header or
footer, sub-tables etc., Tableau has a really useful in-built
utility to do the cleaning and identification of the tables in
your dataset.
Data Interpreter
If your data source is Excel, CSV, PDF, or Google Sheets
and you are facing data-related challenges like stacked
headers, additional information in the adjoining header or
footer, sub-tables etc., Tableau has a really useful in-built
utility to do the cleaning and identification of the tables in
your dataset.
Data Manipulation
Multiple data manipulation operations can be carried out to
make data fit for visualization in the data preview pane or
the data pane in the worksheet view. The data manipulation
options vary by the data type of the columns. For example,
the split option only comes up for string datatype columns,
the create bins option only comes up for numeric datatype
columns, etc.
Hide
Right-click on the column header and select hide to hide an unwanted
column from the data view. It is best practice to hide unwanted columns
before extract creation in order to reduce the data extract size and reduce
latency in load time. Unwanted columns can also be hidden just to prevent
any unwanted clutter in the data pane.
Group
Groups can be created in Tableau for string datatype column values in
order to assign the same alias for multiple values or reduce the
granularity of values for a column. To create a group for a column,
right click on its header and choose create group.
Split
Columns consisting of string values can be easily split into multiple columns
using the split or custom split options. For example, the name field can be
split into two columns consisting of the first name and last name using split.
Split automatically determines the space delimiter in the string value and
splits the strings accordingly.
Bins
The values of a numeric column can be grouped into bins. Bins are
used to convert continuous numeric values into discrete sequences of
range values. The bins are then used to visualize the trends in
histograms.
Calculated Fields
Calculated fields can be used if you need to create customized logic
for manipulating certain data types or data values. There are a large-
range of functions available in Tableau that can used individually or
collectively for data manipulation.
Pivot
Consider the pivoted dataset shown below. Since the dimension
values for the year are the column headers, a chart based on years
cannot be plotted. In order to convert this dataset from the pivoted, or
wide, shape to the narrow shape, which is preferred for visualization,
use the pivot option, as shown below
Tableau Filters
Tableau lets you filter individual views or entire data sources based on
dimensions, measures, or sets. In databases, filtering is a useful way to see
only the data you want to see. When the data is visualized in a readable,
actionable format, Tableau filters are required to highlight any underlying
insights that can be derived from it.
Tableau filters are based on a condition that can be used to limit the
number of records in the database. Tableau filters support both simple
cases based on field values and advanced calculations or context-based
filters.
Extract Filter
The Tableau filter that extracts a small subset of data from the
original data source is the extract filter. If the user extracts data
from the data source, this Tableau filter is used.
After connecting the text file to Tableau, you’ll see two options
in the top right corner of the data source tab: Live and Extract.
A direct connection to a data source is referred to as a live
connection.
Context Filter
A Context Filter is a standalone Tableau filter that can create a
new dataset from the original data set and compute the
worksheet selections. Context Filters are used to boost the
performance of data source views, filters, and queries. You
could run a context filter before any other Tableau Filters, and
then apply the
Dimension Filter
In Tableau, dimensions are independent fields, typically
any field that contains categorical or qualitative data.
Dimension Filters are those Tableau Filters that are
applied to dimensional data. It’s a non-aggregated filter
that allows you to add a dimension, group, sets, and bins.
The top or bottom conditions, wildcard match, and
formula can all be used to apply a Dimension Filter.
Dimension Filter
In Tableau, dimensions are independent fields, typically
any field that contains categorical or qualitative data.
Dimension Filters are those Tableau Filters that are
applied to dimensional data. It’s a non-aggregated filter
that allows you to add a dimension, group, sets, and bins.
The top or bottom conditions, wildcard match, and
formula can all be used to apply a Dimension Filter.
Measure Filter
The Tableau Filter that can be used to filter data based on the
values in a measure is called the Measure Filter. Fields
containing quantitative data are commonly referred to as
measures. The data can be modified using the aggregated
measure values in a Measure Filter.
User Filter
The Tableau filter that protects the row-level data in a dataset
is the User Filter. When the workbook is published on a server,
it can be used. For different users, different filtering conditions
can be used. It’s the process of limiting what data a user can
see depending on who’s looking at the dashboard.
12.Now, click OK and notice the color selection of your entire view.
Assignment 1
1. What is the Difference Between Tree-maps and Heat Maps?
2. What is a Parameter in Tableau? Give an Example.
3. Write steps for installation of Tableau.
4. Write Overview of SUM, AVR, and Aggregate features ,Creating custom
calculations and fields.
5. Explain How to apply Applying new data calculations to your visualization.
6. What Are the Different Joins in Tableau?
7. What is the Difference Between a Live Connection and an Extract?
8. What is a Calculated Field, and How Will You Create One?
9. How Can You Display the Top Five and Bottom Five Sales in the Same View?
10.Is There a Difference Between Sets and Groups in Tableau?
12/12/2023 Sanchi Kaushik UNIT 05 Data Analytics 122
Faculty VideoTHE
Links,CONCEPT
You tube &LEARNING
NPTEL Video TASK
Links and Online
Courses Details
https://www.youtube.com/watch?v=6mBtTNggkUk
https://www.youtube.com/watch?v=HGMrIZq5dq0
https://www.youtube.com/watch?v=ttCDqyfrcEc
1.What is Tableau
2.Calculations in Tableau
a) Numerical
b) String
c) Date
d) All of above
a) Table Calculation
b) Basic Expression
c) LoD Expression
a) 2
b) 8
c) 16
d) 32
a) Horizontal
b) Vertical
c) Image Extract
a) Dimensions
b) Parameters
c) Sets
d) Measures
a) Number (whole)
b) String
c) Boolean
d) Float
9.)Views in Tableau
a) Sheet
b) Dashboard
c) Story
d) All of above
a) .twb
b) .twbx
c) .hyper
d) .tde