0% found this document useful (0 votes)
82 views10 pages

Data Science

Uploaded by

rajputsandesh726
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
82 views10 pages

Data Science

Uploaded by

rajputsandesh726
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 10

Name: Sandesh Lilesh Patil

Class: TYBcs Div: B Roll No. 203

Activity Name: Detail Process applied in Data Analysis phase


wise

College Name: Dr. D. Y. Patil ACS College Pimpri Pune 411018


 What is the Data Analysis?

• Data analysis inspects, cleans, transforms, and models


data to extract insights and support decision-making. As a
data analyst, your role involves dissecting vast datasets,
unearthing hidden patterns, and translating numbers into
actionable information.
• Data and analysis together form the backbone of
evidence-based decision-making, enabling organizations and
individuals to understand complex phenomena, predict outcomes,
and derive actionable conclusions for improved outcomes and
efficiency.
 Steps For Data Analysis Process:

1.Define the Problem or Research Question


2.Collect Data
3.Data Cleaning
4.Analyzing the Data
5.Data Visualization
6.Presenting Data
 Define The Problem Or Research Question:

• In the first step of process the data analyst is


given a problem/business task. The analyst has to understand
the task and the stakeholder’s expectations for the solution. A
stakeholder is a person that has invested their money and
resources to a project. The analyst must be able to ask
different questions in order to find the right solution to
their problem. The analyst has to find the root cause of the
problem in order to fully understand the problem. Communicate
effectively with the stakeholders and other colleagues to
completely understand what the underlying problem.
 Collect Data:

• The second step is to Prepare or Collect the Data. This step


includes collecting data and storing it for further analysis. The analyst has
to collect the data based on the task given from multiple sources. The data
has to be collected from various sources, internal or external sources.
Internal data is the data available in the organization that you work for
while external data is the data available in sources other than your
organization. The data that is collected by an individual from their own
resources is called first-party data. The data that is collected and sold is
called second-party data. Data that is collected from outside sources is
called third-party data. The common sources from where the data is collected
are Interviews, Surveys, Feedback, Questionnaires. The collected data can be
stored in a spreadsheet or SQL database.
 Data Cleaning:

• The third step is Clean and Process Data. After the data is
collected from multiple sources, it is time to clean the data. Clean data
means data that is free from misspellings, redundancies, and irrelevance.
Clean data largely depends on data integrity. There might be duplicate data
or the data might not be in a format, therefore the unnecessary data is
removed and cleaned. There are different functions provided by SQL and
Excel to clean the data. This is one of the most important steps in Data
Analysis as clean and formatted data helps in finding trends and solutions.
The most important part of the Process phase is to check whether your data
is biased or not.
 Analyzing The Data:

• The fourth step is to Analyze. The cleaned data is


used for analyzing and identifying trends. It also performs
calculations and combines data for better results. The tools
used for performing calculations are Excel or SQL. These tools
provide in-built functions to perform calculations or sample
code is written in SQL to perform calculations. Using Excel, we
can create pivot tables and perform calculations while SQL
creates temporary tables to perform calculations. Programming
languages are another way of solving problems.
 Data Visualization:

• The fifth step is visualizing the data. Nothing is more


compelling than a visualization. The data now transformed has to be made
into a visual (chart, graph). The reason for making data visualizations is
that there might be people, mostly stakeholders that are non-technical.
Visualizations are made for a simple understanding of complex data. Tableau
and Looker are the two popular tools used for compelling data
visualizations. Tableau is a simple drag and drop tool that helps in
creating compelling visualizations. Looker is a data viz tool that directly
connects to the database and creates visualizations. Tableau and Looker are
both equally used by data analysts for creating a visualization. R and
Python have some packages that provide beautiful data visualizations. R has
a package named ggplot which has a variety of data visualizations.
 Presenting Data:

• Presenting the data involves transforming raw information into a


format that is easily comprehensible and meaningful for various
stakeholders. This process encompasses the creation of visual
representations, such as charts, graphs, and tables, to effectively
communicate patterns, trends, and insights gleaned from the data analysis.
The goal is to facilitate a clear understanding of complex information,
making it accessible to both technical and non-technical audiences.
Effective data presentation involves thoughtful selection of visualization
techniques based on the nature of the data and the specific message
intended. It goes beyond mere display to storytelling, where the presenter
interprets the findings, emphasizes key points, and guides the audience
through the narrative that the data unfolds.
THANK YOU

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy