Data Science
Data Science
wertyuiopasdfghjklzxcvbnmqw
ertyuiopasdfghjklzxcvbnmqwer
tyuiopasdfghjklzxcvbnmqwerty
DATA
uiopasdfghjklzxcvbnmqwertyui
SCIENCE
opasdfghjklzxcvbnmqwertyuiop
asdfghjklzxcvbnmqwertyuiopas
dfghjklzxcvbnmqwertyuiopasdf
Pranav Sharma
ghjklzxcvbnmqwertyuiopasdfgh
jklzxcvbnmqwertyuiopasdfghjkl
zxcvbnmqwertyuiopasdfghjklzx
cvbnmqwertyuiopasdfghjklzxcv
bnmqwertyuiopasdfghjklzxcvbn
mqwertyuiopasdfghjklzxcvbnm
qwertyuiopasdfghjklzxcvbnmq
wertyuiopasdfghjklzxcvbnmqw
DATA SCIENCE
Introduction
Data is everywhere, and is found in huge and exponentially increasing
quantities. Data science as a whole reflects the ways in which data is
discovered, conditioned, extracted, compiled, processed, analysed, interpreted,
modelled, visualized, reported on, and presented regardless of the size of the
data being processed. Data science is the study of where information comes
from, what it represents and how it can be turned into a valuable resource in the
creation of business. Mining large amounts of structured and unstructured data
to identify patterns can help an organization rein in costs, increase efficiencies,
recognize new market opportunities and increase the organization's competitive
advantage. The data science field employs mathematics, statistics and computer
science disciplines, and incorporates techniques like machine learning, cluster
analysis, data mining and visualization.
“The art of uncovering the insights and trends that are hiding behind
the data.”
Netflix data mines movie viewing patterns to understand what drives user
interest, and uses that to make decisions on which Netflix original series to
produce.
Proctor & Gamble utilizes time series models to more clearly understand
future demand, which help plan for production levels more optimally.
Data Science Process
Step 1: Organize Data
It includes the physical storage and formatting of data and integrated finest
practices in data management.
Step2:PackageData
In this the prototypes are created, the visualization is built and also statistics is
performed. It includes logically joining and manipulating the raw data into a
new representation and package.
Step 3: Deliver Data
In this process data is delivered to those who need that data.
Need of Data science:
Data science helps in finding following insights and answering them.
1. Mathematics,
2. statistics,
3. computer science and programming,
4. statistical modeling,
5. database technologies,
6. data modeling,
7. artificial intelligence and
8. Learning, natural language processing, visualization, predictive analytics,
and so on.
Various tools like SQL are used for DATA Warehousing. Non programing
tools like Excel are also used. SAS (previously "Statistical Analysis System )
are also popular . With advent of technology cheaper data storing, analytics,
visualizing tools are available.
Summary:
Companies have never before collected as much varying data as they do today,
nor have they needed to handle it as quickly. The variety and amount of data
that they collect through many different mechanisms is growing exponentially.
This growth requires new strategies and techniques by which the data is
captured, stored, processed, analysed, and visualized.
Data science is an umbrella term that encompasses all of the techniques and
tools used during the life cycle stages of useful data.