Unit
Unit
INTRODUCTION
Understanding Data
Data can be texts or numbers written on papers, or it can be bytes and bits inside the
memory of electronic devices, or it could be facts that are stored inside a person’s
mind.
Data is factual information used as a basis for reasoning, discussion , calculation
and decision making.
Types of Data
Categories of Data
Structured Vs Unstructured Data
Big Data
Big Data is a collection of data that is huge in volume, yet growing exponentially with time. It
is a data with so large size and complexity that none of traditional data management tools can
store it or process it efficiently. Big data is also a data but with huge size.
some of the Big Data examples-
•The New York Stock Exchange is an example of Big Data that generates about one terabyte
of new trade data per day.
•Social Media: The statistic shows that 500+terabytes of new data get ingested into the
databases of social media site Facebook, every day. This data is mainly generated in terms of
photo and video uploads, message exchanges, putting comments etc.
Types Of Big Data
Following are the types of Big Data:
1.Structured Data
2.Unstructured Data
3.Semi-structured Data
3Vs
Data science
Data science is the field of study that combines domain expertise, programming
skills, and knowledge of mathematics and statistics to extract meaningful insights
from data.
Data science practitioners apply machine learning algorithms to numbers, text,
images, video, audio, and more to produce artificial intelligence (AI) systems to
perform tasks that ordinarily require human intelligence.
• Data science is a deep study of the massive amount of data, which involves
extracting meaningful insights from structured, semi structured, and
unstructured data that is processed using the scientific method, different
technologies, and algorithms.
• It is a multidisciplinary field that uses tools and techniques to manipulate the
data so that you can find something new and meaningful.
Data Science Components