BDA Assignment 1,2
BDA Assignment 1,2
insigbts
Dpliceians of 8iq Data;
OHea th canet DiseSe predicfi ao, patieot
monitoiDg, oncd persO0cli2ed oedici ne.
QE-cgomee ! CUstoer
CuStorner behcNiIr n elysi
ecornroendaiaSysteros.
0Hodoap far
Ap open -soure Fameoork Used
etorno nel poeSn big cdato.
storrng
"Ex Pacebook USes Hadsop to )Chage
USer daa.
MopRedoce:
-A pruqteooiDq rodel that poe9Seg
tatye cdoasefs by divici ha tashe jo
Syallerr chUnle
Ey Grouqle uses Mopbecuo for idering
web pgyes
3)NGS9L Daabages;
Dtebases clesiqned for hobdiing
(ate-scal e o0sttucued o Semi
stiuctued deta.
"Ex, MongoD6, Casscund ra.
Apache Spare
"A fagt big data procegsinc too used
for real- tioe
chalytics.C
Ex. S-fteaning aralyHes for
hmarkct predic+ians.
data sCience.
Dact Science is a elol that Combines
S}atiSti csS, progaming cund olgrsoun
noledaeoextta ct meaningot
Tnsights Proo Srutured nd UOstruC4Ured
doa. Uses techoues nachiDe(Pke
learning, clatq cunaysis, and visu aliealion
fd. solve real crolDYobjern.
Ex. Precic fing CUSfosyen bebawir in ecoomere
USing ecornrsendtian Sysers.
9 Data Andys)s:
ApPlyioq 9achine lecunjro, statsticat hefbods,
Card clqatitbo3to ndpof ferns
"Rx Nesflix analyzeg Viewing habits to
oy wha is opping analytic Pa) i bg dofa sar
to c big dGe staclc in
aping cntlyfics Ploco td
pder Stonding the sBages o analyiCS procees
hd bdo thou can be iroplernnfed usrna
VnicUS f big dota tack.
O Datq Inqegtion Layer:
(ollets taw cke fa Froro differet
SOUrCeg ike loT clevi cos, S0cil med',
databoseg ard log.
*Ex. Apa che Ploroe CHn Kafka fo real
+irne clata sthearminq
o Dato Storage Layer :
Stores rnagsivE doutasetS io distibUted
Bxcursple : Hadoop HDFS, No S) L oetabeses.
9 Dota Processinq Lauyer:
"Proesses cuhd analys dala usn g
batch or keal-time pYOLSSing
En Ppache Spatk. tools
Datg Pralytica Layet:
Ppphes achire
Jearoinq, Al,
statishcal technfyoeg to oxtractadfnsigh ts
behdwigr predicop: Pos custome
9 Dasa V'sudiztian Layer:
Convetts iosehls s0ta reprts, dcshboarels,
qrapbs for decisjarn -makinq.
"En. Pocuer B far î0teractive
vfsUlizaionS.
PVeSing
ettlected
but
6atches at MterVols.
Ry foatees :
Lotgc Scale Date
O PoceSsinq:
Handes massiVe slcet ses eCercienHy
OTire- Deluyet Pracssing:
af Eined in4ervcls.
3Cost EPfecive: Suifabe For non-Urgent
data andlysis
OHgh Acayrecy: Since
cotlected before processing, the reguffs
accurate.
Exaple
-coserce: Analyzing cauly cales
frends nd
and geneteing business repas
" Banking: ocegsinq erd-of-doy ttenschy
to deect teudtet ns.
Social tedia: Agqregaing User
activity daßa st erqdgemot insiqhts
E)Eplcuo
D
Ftlousing bi
olotetKOusple.
0eb-(uebtes (Ghe Cgoqleend rocaeat)
qenercero)a9Sive cYDUDtc
arctes, cicts, and
inera chons
qUeries daily to iopove search
egolts
advertising gitotcgies
2) Pinane- Banks CUc Stoc k (sctkotS procOSS
buge inaneil 4tansocions every
Se cond Par er Poud letectiofy
bants Chalyze ttapctian
patterns in real tie to ioeotify
SUspiciUS aetivi fy.
3) ]ntetoet of Things((9T)Smt deiCOg
Ike 1toess bands
therrs 0Slaps COD ioU OUShp cutlecf arc
tian so) datg.Por ex, a sart
horme Sysiem halyaes Usagc
paflen9 ta aptipiz0
Consumption
energy
)nvisonoeot- Senstrs Cnd atellHeg
callec- choade abd poll ufian
ex ASA
USes Saehe cloufa
global Jeppereture chahges
and predjct weathar pattens.