0% found this document useful (0 votes)
196 views

Data Science

This document discusses data science and its importance. It defines data science as reflecting how data is discovered, processed, analyzed, modeled and visualized regardless of size. Data science employs techniques like machine learning, data mining and visualization to turn large amounts of structured and unstructured data into valuable business insights. It discusses the data science process, need for data science, algorithms used, properties of data, benefits, tools and techniques in data science like mathematics, statistics, programming and modeling.

Uploaded by

Pranav Sharma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
196 views

Data Science

This document discusses data science and its importance. It defines data science as reflecting how data is discovered, processed, analyzed, modeled and visualized regardless of size. Data science employs techniques like machine learning, data mining and visualization to turn large amounts of structured and unstructured data into valuable business insights. It discusses the data science process, need for data science, algorithms used, properties of data, benefits, tools and techniques in data science like mathematics, statistics, programming and modeling.

Uploaded by

Pranav Sharma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

qwertyuiopasdfghjklzxcvbnmq

wertyuiopasdfghjklzxcvbnmqw
ertyuiopasdfghjklzxcvbnmqwer
tyuiopasdfghjklzxcvbnmqwerty
DATA
uiopasdfghjklzxcvbnmqwertyui
SCIENCE
opasdfghjklzxcvbnmqwertyuiop
asdfghjklzxcvbnmqwertyuiopas
dfghjklzxcvbnmqwertyuiopasdf
Pranav Sharma

ghjklzxcvbnmqwertyuiopasdfgh
jklzxcvbnmqwertyuiopasdfghjkl
zxcvbnmqwertyuiopasdfghjklzx
cvbnmqwertyuiopasdfghjklzxcv
bnmqwertyuiopasdfghjklzxcvbn
mqwertyuiopasdfghjklzxcvbnm
qwertyuiopasdfghjklzxcvbnmq
wertyuiopasdfghjklzxcvbnmqw
DATA SCIENCE
Introduction
Data is everywhere, and is found in huge and exponentially increasing
quantities. Data science as a whole reflects the ways in which data is
discovered, conditioned, extracted, compiled, processed, analysed, interpreted,
modelled, visualized, reported on, and presented regardless of the size of the
data being processed. Data science is the study of where information comes
from, what it represents and how it can be turned into a valuable resource in the
creation of business. Mining large amounts of structured and unstructured data
to identify patterns can help an organization rein in costs, increase efficiencies,
recognize new market opportunities and increase the organization's competitive
advantage. The data science field employs mathematics, statistics and computer
science disciplines, and incorporates techniques like machine learning, cluster
analysis, data mining and visualization.

“The art of uncovering the insights and trends that are hiding behind
the data.”

Data science (discovery of data insight)


Aspect of data science is all about uncovering findings from data. Diving in at a
basic level to mine and understand complex behaviours, trends, and inferences.
It's about generating hidden insight that can help enable companies to make
smarter business decisions. For example:

 Netflix data mines movie viewing patterns to understand what drives user
interest, and uses that to make decisions on which Netflix original series to
produce.
 Proctor & Gamble utilizes time series models to more clearly understand
future demand, which help plan for production levels more optimally.
Data Science Process
Step 1: Organize Data
It includes the physical storage and formatting of data and integrated finest
practices in data management.
Step2:PackageData
In this the prototypes are created, the visualization is built and also statistics is
performed. It includes logically joining and manipulating the raw data into a
new representation and package.
Step 3: Deliver Data
In this process data is delivered to those who need that data.
Need of Data science:
Data science helps in finding following insights and answering them.

1. Descriptive Analytics: what happened? Hindsight

2. Diagnostic Analytics: why did it happen? Insight


3. Predictive Analytics: what will happen?

4. Perspective Analytics: How can we make it happen? Foresight


Five Basic Questions that Data science
Answers (Algorithms):
1. Classification Algorithms: Answers Is this A or B? (yes/no type )
2. Regression Algorithms: Answers How MUCH and How MANY
TYPES QUESTION?
3. Anomaly Detection Algorithms: Is this normal or weird? Eg: used in
Credit card detection fraud.
4. Clustering Algorithms: How is this organized?
5. Reinforcement learning Algorithms: Possible future outcomes.

Properties of DATA FOR DATA SCIENCE:


 Is your data is Relevant?
 Is your data is connected?
 Is your data is Accurate?
 Is it enough to work with?
Benefits of Data science:
1. The main advantage of enlisting data science in an organization is the
empowerment and facilitation of decision-making.
2. Organizations with data scientists can factor in quantifiable, data-based
evidence into their business decisions. These data-driven decisions can
ultimately lead to increased profitability and improved operational
efficiency, business performance and workflows.
3. In customer-facing organizations, data science helps identify and refine
target audiences.
4. Data science can also assist recruitment: Internal processing of
applications and data-driven aptitude tests and games can help an
organization's human resources team make quicker and more accurate
selections during the hiring process.
5. Specific benefits of data science vary depending on the company's goal
and the industry. Sales and marketing departments, for example, can mine
customer data to improve conversion rates or create one-to-one marketing
campaigns.
Tools and Techniques used in Data Science
Data science incorporates:

1. Mathematics,
2. statistics,
3. computer science and programming,
4. statistical modeling,
5. database technologies,
6. data modeling,
7. artificial intelligence and
8. Learning, natural language processing, visualization, predictive analytics,
and so on.

Various tools like SQL are used for DATA Warehousing. Non programing
tools like Excel are also used. SAS (previously "Statistical Analysis System )
are also popular . With advent of technology cheaper data storing, analytics,
visualizing tools are available.
Summary:
Companies have never before collected as much varying data as they do today,
nor have they needed to handle it as quickly. The variety and amount of data
that they collect through many different mechanisms is growing exponentially.
This growth requires new strategies and techniques by which the data is
captured, stored, processed, analysed, and visualized.
Data science is an umbrella term that encompasses all of the techniques and
tools used during the life cycle stages of useful data.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy