0% found this document useful (0 votes)
12 views39 pages

N 3. Classification of Digital Data

Uploaded by

newt67710
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views39 pages

N 3. Classification of Digital Data

Uploaded by

newt67710
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 39

Road Map

• Evolution of Technology
• Types of Data
• Big Data- Definition Aspect
• Big data Vs Not Big data
• Challenges of big data
Evolution of Technology

Reference : https://www.youtube.com/watch?v=zez2Tv-bcXY
Internet of Things

Reference : https://www.edureka.co/blog/big-data-tutorial
Social Media Usage
Classification of Digital Data
Digital Data
Structured data
• When do we say that the data is structured??
• When data conforms to a predefines schema/structure.
• Sources of structured data
Working with structured data
• Insert/update/delete
• Indexing
• Transaction processing
• Security
• Scalability
Semi-structured data
• It does not conform to the data models that one typically associates
with relational databases or any other form of data tables
• It uses tags to segregate semantic elements
Sources of semi-structured data
Unstructured data
• Does not conform to any predefined data model
• The structure can be unpredictable.
Sources of unstructured data
How to deal with unstructured data?
Inclass#exercise
Solution
Let’s Discuss
• Why email in unstructured category?
• Where should we put CCTV footage?
You are at city shopping mall. You see few people are browsing
the items. Some of them are looking for discounts. Some of them
are filling feedback form. Few people are at billing counter. You
may consider other things and events happening in this
scenario. Think for while on the different types of data
generated. Mention each of them with proper logic
You are at university library. You see few students browsing through the
library catalog on kiosk. You see the working of librarians and other
staff to issue/return books, magazines, and journals. Few students are
using the e-library service, too. Which type of data is generated in this
scenario? Support your answer by considering big data
Big Data – Definitional
Aspects
Characteristics of Big data
Gartner’s 3V casted by Douglas Laney in 2001
Volume , Velocity and Variety

IBM’s 4V casted by Zikopoulos


Volume , Velocity , Variety and Veracity

Yuri Demchenko’s 5V
Volume , Velocity , Variety , Veracity and Value

Microsoft’s 6V
Volume , Velocity , Variety , Veracity , Value and Visibility
Volume
Velocity

Taken from : Hewlett-Packard Development Company “truths and myths about big data”,2013
Veracity
Value
What is big data about?

Answers are often “too big to ….”


• Load into memory……..…Store on a hard drive…….…Fit in a standard database
• “Fast changing”………..Not just relational
• “Digital breadcrumbs” left behind (communication transactions..)—Hard little data
particles left behind as people go about their daily lives
• Open web data/social media data (facebook, twitter, blogs, online news, videos….)
• Remote sensing (satellite, meters…)
What is big data about - and not about?

“Big Data is not about the data” (Gary King)


Institute for social science ,Harvard university

• It’s about the analytics—the insights gleaned from the data; and the
necessary capacities to do so—human, technological
• One step further: it’s about knowledge: getting near to the ‘true’ meaning
of a facebook status update;
• It’s about sharing and diffusion – visualizations
Big data Definition
Challenges with Big data
The problem is storing the colossal amount of data.
A big data analytics cycle can be described by the following stage −
Business Problem
Definition

1. Business Problem Definition Analysis of Results Data Identification

2. Data Identification

3. Data Acquisition & Filtering


Data Acquisition
Data Visualization
& Filtering
4. Data Extraction

5. Exploratory Data Analysis

6. Data Preparation for Modeling and Assessment Data Preparation


for Modeling and Data Extraction
Assessment
7. Data Visualization Exploratory Data
Analysis

8. Analysis of Results
Classification of Data Analytics
Big data Analytics-Case studies
• Healthcare
Traditional Vs Big data Approach
OLTP: Online Transaction Processing
• DBMSs
OLAP: Online Analytical Processing
• Data Warehousing
RTAP: Real-Time Analytics Processing
• Big Data Architecture & Technology

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy