Slide 1 Big Data Introduction
Slide 1 Big Data Introduction
S3Lab
Smart Software System Laboratory
1
“Without big data, you are blind and deaf
and in the middle of a freeway.”
– Geoffrey Moore
Big Data 2
Evolution of Technology
3
IOT
4
Social media
5
Other factors
6
What is BigData
● Big data is the term for a collection of data sets so large and complex
that it becomes difficult to process using on-hand database
management tools or traditional data processing applications.
● Challenges: Capture, Curation, Storage, Search, Sharing, Transfer,
Analysis, and Visualization.
7
Big Data
Big Data: 3V’s
8
Big Data
Big Data: 3V’s
Volume (scale)
9
Big Data
Big Data: 3V’s
Volume (scale)
10
Big Data
Big Data: 3V’s
Volume (scale)
11
Big Data
Big Data: 3V’s
Volume (scale)
13
Big Data
Big Data: 3V’s
Variety (Complexity)
14
Big Data
Big Data: 3V’s
Variety (Complexity)
● Semi-Structured, NoSQL
15
Big Data
Big Data: 3V’s
Variety (Complexity)
17
Big Data
Big Data: 3V’s
Velocity (Speed)
18
Big Data
Big Data: 3V’s
Velocity (Speed)
19
Big Data
Big Data: 3V’s
Velocity (Speed)
21
Big Data
Big Data: 5V’s
Value
22
Big Data
Big Data: 5V’s
Value
23
Big Data
Big Data: 5V’s
Veracity
24
Big Data
Big Data: 5V’s
Veracity
25
Big Data
Big Data: 4V’s
26
Big Data
Big Data: 5V’s
27
Big Data
Big Data: NV’s
● The above image depicts the five V’s of Big Data but as and when the
data keeps evolving so will the V’s. Iam listing five more V’s which have
developed gradually overtime:
○ Validity: correctness ofdata
○ Variability: dynamic behaviour
○ Volatility: tendency to change in time
○ Vulnerability: vulnerable to breach or attacks
○ Visualization: visualizing meaningful usage of data
28
Big Data
Big Data: Applications
29
Big Data
Big Data: Applications
30
Big Data
Big Data: Applications
31
Big Data
Big Data: Applications
32
Big Data
Big Data: Applications
33
Big Data
Big Data: Applications
34
Big Data
Big Data: Applications
35
Big Data
Big Data: Applications
Weather forecast
36
Big Data
Big Data: Applications
Weather forecast
37
Big Data
Big Data: Applications
Weather forecast
38
Big Data
Big Data: Applications
Media and entertainment
39
Big Data
Big Data: Applications
Media and entertainment
40
Big Data
Big Data: Applications
Media and entertainment
41
Big Data
Big Data: Applications
Media and entertainment
42
Big Data
Big Data: Applications
Media and entertainment
43
Big Data
Big Data: Applications
Media and entertainment
44
Big Data
Big Data: Applications
Health care
45
Big Data
Big Data: Applications
Health care
46
Big Data
Big Data: Applications
Health care
47
Big Data
Big Data: Applications
Health care
48
Big Data
Big Data: Applications
Health care
49
Big Data
Big Data: Applications
Logistic
50
Big Data
Big Data: Applications
Logistic
51
Big Data
Big Data: Applications
Logistic
52
Big Data
Big Data: Applications
Logistic
53
Big Data
Big Data: Applications
Logistic
54
Big Data
Big Data: Applications
Travel and tourism
55
Big Data
Big Data: Applications
Travel and tourism
56
Big Data
Big Data: Applications
Travel and tourism
57
Big Data
Big Data: Applications
Travel and tourism
58
Big Data
Big Data: Applications
Travel and tourism
59
Big Data
Big Data: Applications
Government and law enforcement
60
Big Data
Big Data: Applications
Government and law enforcement
61
Big Data
Big Data: Applications
Government and law enforcement
62
Big Data
Big Data: Applications
Government and law enforcement
63
Big Data
Big Data: Applications
Government and law enforcement
64
Big Data
Big Data: Scale
65
Big Data
Big Data: Evolution
● The Model of Generating /Consuming Data has changed
○ Old Model: a few companies are generation data, all others are consuming data
○ New Model: All of us are generating data, and all of us are consuming data
66
Big Data
Big Data: Evolution
68
Big Data
Big Data: Evolution
● Big data is more real-time in nature
than traditional DW applications
● Traditional DW architectures (e.g.
Exadata, Teradata) are not well-
suited for big data apps
● Shared nothing, massively parallel
processing, scale out architectures
are well-suited for big data apps
69
Big Data
Big Data: Evolution
70
Big Data
Big Data: Landscape
71
Big Data
Big Data: Landscape
72
Big Data
Big Data: Landscape(Open sources)
73
Big Data
In this course:
74
Big Data
Projects
75
Projects
Projects
Projects
Projects
Projects
Projects
In this course:
82
Big Data
Install Cloudera Quickstart VM
83
Install Cloudera Quickstart VM
Choose “cloudera-quickstart-vm-5.13.0-0-virtualbox.ovf”
84
Install Cloudera Quickstart VM
85
Install Cloudera Quickstart VM
VMware
86
Install Cloudera Quickstart VM
VMware
87
Q &A
88
Big Data