0% found this document useful (0 votes)
17 views6 pages

Unit 5

Data visualization is the graphical representation of quantitative information, utilizing elements like graphs, charts, and maps to make data easier to understand. It plays a crucial role in analyzing large datasets, revealing trends, outliers, and patterns, and can be achieved through various tools and techniques. Effective data visualization involves understanding the audience, simplifying complexity, using relevant charts, telling a story, and testing the visualization for clarity and accuracy.

Uploaded by

lemel52930
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views6 pages

Unit 5

Data visualization is the graphical representation of quantitative information, utilizing elements like graphs, charts, and maps to make data easier to understand. It plays a crucial role in analyzing large datasets, revealing trends, outliers, and patterns, and can be achieved through various tools and techniques. Effective data visualization involves understanding the audience, simplifying complexity, using relevant charts, telling a story, and testing the visualization for clarity and accuracy.

Uploaded by

lemel52930
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

Data Visualization;is a graphical representation of quantitative information and data by using visual

elements /context like graphs, charts, and maps;It converts large and small data sets into visuals, which is easy
to understand and process for humans;It provides tools an accessible way to see and understand trends,
outliers, and patterns in data. In the world of Big Data, the data visualization tools and technologies are
required to analyze vast amounts of information; These are common in your everyday life, but they always
appear in the form of graphs and charts. The combination of multiple visualizations and bits of information are
still referred to as Info graphics;Data visualizations are used to discover unknown facts and trends. You can
see visualizations in the form of line charts to display change over time;Visual Elements;Charts and
Graphs;Bar charts, line graphs, scatter plots, and pie charts are common forms of visual data representations
;Tables;Display raw data in a structured manner, often used when precision is more important than visual
appeal;Maps;Geographic data can be visualized through heat maps, choropleth maps, or point-based maps to
show patterns across locations;Effective Data Visualization in 5 Steps;Identify the Audience;The first step in
creating a data visualization is to understand your audience, including their knowledge, technical skills,
attention span, and interest in the topic;Remove Unnecessary Complexity; Keep your data visualization
simple by eliminating any elements that could confuse or distract the audience;Use Relevant Charts;To create
effective data visualizations, use the right charts for the data. For example;Pie Chart;Bar Chart;Tape
Diagram;Pictograph;Scatter plot;Time
Series;Area Chart;Bubble Graph;Line
Chart;Radar Chart;Venn diagram;Heat
map;Tell a Story;Data visualization should
tell a clear story, not just present
numbers. Provide relevant background
information, highlight key points, and
guide the audience towards the
conclusion;Test Your Data Visualization;
The final step is to test the data
visualization to ensure it's ready for
presentation. Check that the key points
are clear, the data is accurate, and the
charts are easy to understand. Ask a colleague to review it for errors and provide feedback on the design and
content;Tools;Tableau; A popular tool that helps creates interactive and shareable visualizations like charts
and graphs. It's easy to use and works well with large datasets;Power BI;A Microsoft tool that allows you to
create interactive reports and dashboards;Google Data Studio;A free tool by Google that helps creates
customizable reports and dashboards using data from various sources like Google Analytics and Google
Sheets;Excel;A well-known spreadsheet tool that includes basic charting and graphing features, making it easy
to create simple visualizations from data;Plotly;A tool for creating interactive plots and graphs, often used for
web-based data visualizations;Qlik;A tool that helps create interactive visualizations and dashboards, similar
to Tableau and Power BI, often used for data discovery and analytics;D3.js;A JavaScript library used for
creating custom and complex data visualizations on websites; R(ggplot2);A programming language and library
for creating detailed and customized statistical visualizations, especially used by data scientists and
analysts;Infogram;A simple tool for creating charts, infographics, and reports;Looker;A business intelligence
tool used to create data reports and dashboards. It integrates well with databases and helps in real-time data
analysis;Uses;Understanding Trends;Identifying Patterns and Relationships;Simplifying Complex Data;Making
Decisions;Communicating Insights;Applications; Business Intelligence and Reporting;Financial
Analysis;Healthcare;Marketing and Sales ;Human Resources.
Pixel-Oriented Visualization Technique;These are a way to display very large datasets by using pixels as a
visual representation of data values. Each data point is shown as a
tiny dot (a pixel) on the screen, and its color represents the value of
the data;If the data has many dimensions,a separate section of the
screen is created for each dimension. The data is arranged in the
same order across all these windows, so you can compare patterns
easily;This method makes it possible to show massive amounts of
data at once while still being able
to spot trends, patterns, or unusual
values.It uses every pixel on the
screen efficiently to give a clear and
compact view of the data;(Diagram
); For a data set of m dimensions,
create m windows on the screen,
one for each dimension; The m
dimension values of a record are
mapped to m pixels at the
corresponding positions in the
windows;The colors of the pixels
reflect the corresponding values;
(Diagram);To save space and show the connections among multiple dimensions, space filling is often done in a
circle segment;Advantages;High Data Density;Efficient Pattern Recognition;Scalability;Dimensional Flexibility;
Drawback;Hard to Understand;Color Confusion;Limited Detail;Screen Limitations;Applications; Financial
Market Analysis;Medical and Genomic Research; Network Security;Climate Science.

Geometric Projection Visualization Techniques:It simplify multi-dimensional data by projecting it into a lower-
dimensional space (2D or 3D) using mathematical
transformations, while preserving the original structure and
relationships;Line Plot;This is the plot that you can see in the
nook and corners of any sort of analysis between 2 variables.
;The line plots are nothing but the values on a series of data
points will be connected with straight lines;The plot may seem
very simple but it has more applications not only in machine
learning but in many other areas;Used to analyze the
performance of a model using the ROC- AUC curve;(Diagram);
Bar Plot;This is one of the widely used plots, that we would
have seen
multiple
times not just in data analysis, but we use this plot
also wherever there is a trend analysis in many
fields; We can visualize the data in a cool plot and
can convey the details straight forward to others
;This plot may be simple and clear but it’s not much
frequently used in Data science applications;
(Diagram);
;Stacked Bar Graph;Unlike a Multi-set Bar Graph which displays their bars side-by-side, Stacked Bar Graphs
segment their bars. Stacked Bar Graphs are used to show how a larger category is divided into smaller
categories and what the relationship of each part has on the total amount. There are two types of Stacked Bar
Graphs;(Diagram);Simple Stacked Bar Graphs place each value for the segment after the previous one. The
total value of the
bar is all the
segment values
added together.
Ideal for comparing
the total amounts
across each group
/segmented bar;
100% Stack Bar
Graphs show the
percentage-of-the-
whole of each group
and are plotted by
the percentage of each value to the total amount in each group. This makes it easier to see the relative
differences between quantities in each group;One major flaw of Stacked Bar Graphs is that they become
harder to read the more segments each bar has. Also comparing each segment to each other is difficult, as
they're not aligned on a common baseline;Scatter Plot;(Diagram);It is one of the most commonly used
plots used for visualizing simple data in Machine
learning and Data Science;This plot describes us
as a representation, where each point in the
entire dataset is present with respect to any 2 to
3 features(Columns);Scatter plots are available in
both 2-D as well as in 3-D. The 2-D scatter plot is
the common one, where we will primarily try to
find the patterns, clusters, and separability of the
data;The colors are assigned to different data
points based on how they were present in the
dataset i.e, target column representation;We can
color the data points as per their class label given
in the dataset;Box and Whisker Plot;This
plot can be used to obtain more statistical details

about the data;The straight lines at the maximum and


minimum are also called whiskers;Points that lie outside the
whiskers will be considered as an outlier;The box plot also
gives us a description of the 25th, 50th,75th quartiles;With
the help of a box plot, we can also determine the
Interquartile range(IQR) where maximum details of the data
will be present;These box plots come under univariate
analysis, which means that we are exploring data only with
one variable;(Diagram);
;Pie Chart;A pie chart shows a static number and how categories represent part of a whole the
composition of something. A pie chart represents numbers in
percentages, and the total sum of all segments needs to equal
100%;Extensively used in presentations and offices, Pie Charts
help show proportions and percentages between categories, by
dividing a circle into proportional segments. Each arc length
represents a proportion of each category, while the full circle
represents the total sum of
all the data, equal to 100%;
(Diagram);Donut Chart;
A donut chart is essenti ally a
Pie Chart with an area of the
centre cut out. Pie Charts are
sometimes criticised for focusing readers on the proportional areas of the
slices to one another and to the chart as a whole. This makes it tricky to see
the differences between slices, especially when you try to compare
multiple Pie Charts together;A Donut Chart somewhat remedies this
problem by de-emphasizing the use of the area. Instead, readers focus
more on reading the length of the arcs, rather than comparing the
proportions between slices ;Also, Donut Charts are more space-
efficient than Pie Charts because the blank space inside a Donut Chart
can be used to display information inside it; (Diagram);
Marimekko Chart;Also known as a Mosaic Plot;Marimekko
Charts are used to visualise categorical data over a pair of variables. In
a Marimekko Chart, both axes are variable with a percentage scale,
that determines both the width and height of each segment. So
Marimekko Charts work as a kind of twoway 100% Stacked Bar Graph.
This makes it possible to detect relationships between categories and
their subcategories via the two axes;The main flaws of Marimekko
Charts are that they can be hard to read, especially when there are
many segments. Also, it’s hard to accurately make comparisons
between each segment, as they are not all arranged next to each other
along a common baseline. Therefore, Marimekko Charts are better
suited for giving a more general overview of the data ;(Diagram);Applications;Data Exploration;Machine
Learning; Bioinformatics; Customer Segmentation;Psychology and Social Sciences;Finance ;Advantages; Data
Simplification;Pattern Recognition;Clustering and Outlier Detection; Versatility;Insight Generation;
Challenges;Information Loss;Interpretation;Computational Cost;Choice of Parameters.
Hierarchical Data Visualizations;Hierarchical Visualizations (or Trees) are collections of items, where each
item connects to one parent (except the root), and both items and connections can have multiple attributes
;Circle Packing;(Diagram);Circle Packing is a variation of a Treemap that uses circles instead of
rectangles. Containment within each circle represents a level in the hierarchy:
each branch of the tree is represented as a circle and its sub-branches are
represented as circles inside of it. The area of each circle can also be used to
represent an additional arbitrary value, such as quantity or file size. Colour may
also be used to assign categories or to represent another variable via different
shades;As beautiful as Circle Packing appears, it's not as space-efficient as a
Treemap, as there's a lot of empty space within the circles. Despite this,Circle
Packing actually reveals hierarchal structure better than Treemap;Sunburst
Diagram ;(Diagram);As known as a
Sunburst Chart, Ring Chart, Multi-level
Pie Chart, Belt Chart, Radial Treemap
;This type of visualisation shows
hierarchy through a series of rings, that
are sliced for each category node. Each
ring corresponds to a level in the
hierarchy, with the central circle
representing the root node and the hierarchy moving outwards from
it;Rings are sliced up and divided based on their hierarchical
relationship to the parent slice. The angle of each slice is either divided
equally under its parent node or can be made proportional to a
value;Colour can be used to highlight hierarchal groupings or specific
categories;Treemap; (Diagram); Treemaps are an alternative way
of visualising the hierarchical structure of a Tree Diagram while also
displaying quantities for each category via area size. Each category is
assigned a rectangle area with their subcategory rectangles nested
inside of it;When a quantity is assigned to a category, its area size is displayed in proportion to that quantity
and to the other quantities within the same parent category in a part-to-whole relationship. Also, the area size
of the parent category is the total of its subcategories. If no quantity is assigned to a subcategory, then it's
area is divided equally amongst the other subcategories within its
parent category;The way rectangles are divided and ordered into sub-
rectangles is dependent on the tiling algorithm used. Many tiling
algorithms have been developed, but the "squarified algorithm" which
keeps each rectangle as square as possible is the one commonly
used;Ben Shneiderman originally developed Treemaps as a way of
visualising a vast file directory on a computer, without taking up too
much space on the screen. This makes Treemaps a more compact and
space-efficient option for displaying hierarchies, that gives a quick
overview of the structure. Treemaps are also great at comparing the
proportions between categories via their area size;The downside to a
Treemap is that it doesn't show the hierarchal levels as clearly as other
charts that visualise hierarchal data (such as a Tree Diagram or
Sunburst Diagram) ;Applications; Organizational Structures;Biological
Classification;File Systems;Product Categories;Knowledge Organization; Family Trees and Genealogy;Project
Management;Website Structures;Advantages;Clarity/ Simplified Complex Data;Spatial Organization;Detail
Exploration;Intuitive Structure ; Highlight Proportions; Efficient Navigation;Drawback;Overcrowding;Limited
Detail;Scalability.
Icon-Based Visualization Techniques;It uses small icons to represent multidimensional data values
;Visualization of the data values as features of icons;Typical visualization methods o Chernoff Faces ,Stick
Figures;Chernoff Faces;(Diagram);A way to display variables on a two-dimensional surface, e.g., let x be
eyebrow slant, y be eye size, z be nose length, etc;The figure shows faces
produced using 10 characteristics–head eccentricity,eye size, eye spacing, eye
eccentricity, pupil size, eyebrow slant, nose size, mouth shape, mouth size, and
mouth opening. Each assigned one of 10 possible values. Stick Figure;(Diagram)
A census data figure showing age, income, gender, education  A 5-piece stick
figure (1 body and 4 limbs w. different angle/length)  Age, income are indicated
by position of the figure;Gender,education are indicated by angle/length;
Visualization can show a texture pattern;2 dimensions are mapped to the display
axes and the remaining dimensions are mapped to the angle and/ or length of the
limbs.

Visualizing Complex Data and Relations;For a large data set of high dimensionality, it would be difficult to
visualize all dimensions at the same time; Hierarchical visualization techniques partition all dimensions into
subsets (i.e., subspaces);The subspaces are visualized in a hierarchical manner;“Worlds-within-Worlds,” also
known as n-Vision, is a representative hierarchical visualization method;To visualize a 6-D data set, where the
dimensions are F,X1,X2,X3,X4,X5. ;We want to observe how F changes w.r.t. other dimensions. We can fix
X3,X4,X5 dimensions to selected values and visualize changes to F w.r.t. X1, X2 Most visualization techniques
were mainly for numeric data;Word Cloud;(Diagram);A visualisation method that displays how frequently
words appear in a given body of text, by making the size of each word
proportional to its frequency. All the words are then arranged in a cluster or
cloud of words. Alternatively, the words can also be arranged in any format:
horizontal lines, columns or within a shape;Word Clouds can also be used to
display words that have meta-data assigned to them. For example, in a Word
Cloud with all the World's country's names, the population could be assigned to
each name to determine its size;Colour used on Word Clouds is usually meaningless and is primarily aesthetic,
but it can be used to categorise words or to display another data variable;Typically, Word Clouds are used on
websites or blogs to depict keyword or tag usage;Word Clouds can also be used to compare two different
bodies of text together;Although being simple and easy to understand, Word Clouds have some major
flaws;Long words are emphasised over short words;Words whose letters contain many ascenders and
descenders may receive more attention.;They're not great for analytical accuracy, so used more for aesthetic
reasons instead.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy