Module 1
Module 1
Example
DATA
Broadly, there are two kinds of data – structured &
unstructured data.
Structured data is clearly defined data types with patterns
that make them easily searchable. In simpler terms, when a
layman looks at structured data, she can easily make out
that this is data.
Structured data typically contains data types that are
combined in a way to make them easy to search for in their
data set. This means that structured data is easily
detectable via search because it is highly organized
information. The image on your screen is an example of
structured data. It's neatly segregated into fixed fields. It’s
the data that most of us are used to and like working with to
analyze largely quantitative problems.
Semi-Structured
Semi-structured data is a form of structured data that does
not conform to the tabular structure of data but does
contain tags or other markers to differentiate elements and
enforce hierarchies within the data.
Email messages are a good example of semi-structured
data. While the actual content of the email is unstructured,
this format does contain structured data such as the name
and email address of the sender and recipient, and the time
sent among others.
Another example is a digital photograph. The image itself is
unstructured, but if the photo was taken on a smartphone,
for example, it would be date and time stamped, geo-
tagged, and would have a device ID. Once stored, the photo
could also be given tags that would provide a structure,
such as ‘dog’ or ‘pet.’
Making Decisions
As a business analyst, your primary job would be to make
decisions based on your knowledge, understanding, and
insights generated through data analytics.
Hence, it becomes really important for you to understand
data analytics as an understanding of that domain can act
as a competitive edge in your decision-making.
Data Analytics
Data analytics is the extraction of insights from data using
statistical techniques and technologies.
Descriptive analytics
Descriptive analytics answers the question: What has
happened and what is happening right now?
Descriptive analytics uses historical and current data from
multiple sources to describe the present state by identifying
trends and patterns. Descriptive analytics uses basic
arithmetic like average, sum, and percentage changes
among others. Many times in data analysis, descriptive
analytics is the first step that helps organizations
understand the facts of what has already happened.
Example:
Diagnostics Analysis
Diagnostic analytics answers the question: Why did this
happen?
Diagnostic analytics uses data, which is often built on
descriptive analytics to discover the reasons or causes for
past performance. It is oftentimes referred to as root cause
analysis.
Example:
Let’s say that an unusually high number of people come to
the emergency room of a hospital complaining of similar
symptoms. After testing them, it is found that about 70% of
them have Disease A. Descriptive analytics helps us get to
the final number of people having Disease A. However, it is
Diagnostic analysis that can help to make correlations
between the symptoms and determine what symptoms
point to Disease A.
Predictive Analysis
Predictive analytics answers the question: What is likely to
happen in the future?
Predictive analytics applies techniques such as statistical
modelling and forecasting to the output of descriptive and
diagnostic analytics to make predictions about future
outcomes. Predictive analytics is often considered a type of
“advanced analytics,” and frequently depends on
technology.
Example:
Prescriptive Analysis
Prescriptive analytics answers the question: What do we
need to do?
Prescriptive analytics is a type of advanced analytics that
involves the application of testing and other techniques to
recommend specific solutions that will deliver desired
outcomes. In business, prescriptive analytics uses machine
learning, business rules, and algorithms.
Example: