0% found this document useful (0 votes)
34 views

BDA Assignment 1,2

The document discusses the concept of Big Data, its applications, and challenges in data storage, processing, quality, security, and privacy. It highlights the importance of Big Data analytics in improving decision-making, customer experience, and fraud detection across various industries. Additionally, it outlines the components of a Big Data stack, including data ingestion, storage, processing, and visualization layers.

Uploaded by

Kailas Rahane
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views

BDA Assignment 1,2

The document discusses the concept of Big Data, its applications, and challenges in data storage, processing, quality, security, and privacy. It highlights the importance of Big Data analytics in improving decision-making, customer experience, and fraud detection across various industries. Additionally, it outlines the components of a Big Data stack, including data ingestion, storage, processing, and visualization layers.

Uploaded by

Kailas Rahane
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

AsSyooet 1

)Obat s Btogctcf ct pplicction?


Biq Data fes to las cnd cor9plen datasefs
be prIceSSed whg trad itHopo
Cto DagcmeOtools Dt ischora CHeri ecd by
Volue, Vetocity, oDt Variet (BV3) ah o is
Used far onclysis fo extraCt
act valuable

insigbts
Dpliceians of 8iq Data;
OHea th canet DiseSe predicfi ao, patieot
monitoiDg, oncd persO0cli2ed oedici ne.
QE-cgomee ! CUstoer
CuStorner behcNiIr n elysi
ecornroendaiaSysteros.

2) Erpkin voioUs Big clota chaenqes ith enargple ?


O Dasa Storage& Moagernent:
" Challenge Storig cund ironeginog rmasive
amoUn ts Df sttuctored cn ungtrUcturod
dla.
"Erarop)e: Social OeiqplatPorros
medi q ike facebook
cenerale petcby fes of data
do ckoly , eiri ng
olvoned gBorage soloin s ITee Hadoop
cloud storege
3 Date Proossing eadrea time
*Chelenqe: Aoyeing clec in
For qcle dedsiop-roking.
"Cxi Stock arkot frading platfogros
need to prOcess rnsacions coHbin
L O a l (0sses.
toi)iseco ds to
to cwoi

3 Dotd Quality & Accosacy:


" Challchqe ! Ensur fog dat S clean,
Copsis}ept, and Aree fro emonS
Cni In beatthcare, incorrect
due todoplicafe or ralasingy clte carn
lead to msdiagnosis

) SecoYity & Privacy:


" Challenge: Protectipq gensifive datq
Prarn breachos Onautbori20d aces.
. Px Bcunkin g Ssters9 mUst Secune
CUstoroer dafe to pIereDt cyberatacks
and raud.
2) Uhf s
3) bjo clat a- Exelaun coth its
cossificaion.
Big Dta reFers t0 entrey arge cn apd
Corplex dafasets that traditi ona dafa
pYss ing -taols handle efficientlp.
haraGteriaod bySUs:
eVo/Ume: Huge usIUpsot dafg
"Velocify t fiah-Speecf dota qeherafion.
o Uarietyn Different types oF
dafa (text, iro cges,
Nidess, ef)
Clogsificafiop f Bi bata!
OSttUCUreo Data:
"Oraahized in ras Crd colur ns (ike dafebages).
Ex. SpL databases, boskioq transaction necord s.
Onsructored Dota:tb
No fi xed foima, diegicu t do otqani2e.
. ExSoctal edig posts, e(hcu l
emails, irsages,vides
) Sermi-Structueo Data:al
eDeLrtial ly oreanized but lacks a Stc+
forro af.
Px JSIN., XML4l eg, se0SJT atd.
etminoloqy
etminology Used in bi
useo
o) Eplair rious
veuioS

0Hodoap far
Ap open -soure Fameoork Used
etorno nel poeSn big cdato.
storrng
"Ex Pacebook USes Hadsop to )Chage
USer daa.
MopRedoce:
-A pruqteooiDq rodel that poe9Seg
tatye cdoasefs by divici ha tashe jo
Syallerr chUnle
Ey Grouqle uses Mopbecuo for idering
web pgyes
3)NGS9L Daabages;
Dtebases clesiqned for hobdiing
(ate-scal e o0sttucued o Semi
stiuctued deta.
"Ex, MongoD6, Casscund ra.
Apache Spare
"A fagt big data procegsinc too used
for real- tioe
chalytics.C
Ex. S-fteaning aralyHes for
hmarkct predic+ians.
data sCience.
Dact Science is a elol that Combines
S}atiSti csS, progaming cund olgrsoun
noledaeoextta ct meaningot
Tnsights Proo Srutured nd UOstruC4Ured
doa. Uses techoues nachiDe(Pke
learning, clatq cunaysis, and visu aliealion
fd. solve real crolDYobjern.
Ex. Precic fing CUSfosyen bebawir in ecoomere
USing ecornrsendtian Sysers.

6) ohy big dato analytioal is inportance.


yBetter Deisi op- Makng
Anatyeing late elatasets helps busiesseg
make data diven decisisn S.
.Ex, Netffix ndlyzes Oser prelerenos to
JECopmend persnali
ShouuS:
Impred Custorner Erperience!
. Unalerstehoing CUStorner behavior helps
Conponies enhonce servicQs.
0seg Brq Dela s@gqest ejevant
products
3Praud Detection SecDtey
UDUS0al patet ns
Defecing
prere0t Cyber Frowc
eiq Dola fo
*Exaple: Bntg USe Big
tleni Py SUspiciOUs t ansgchons.
ORea-TneProcessing & FPPicieocy
Helps foolustri es ptiize qperati ong
bedvce costs.
Ex! HealthCare USes ealime
daa Por pcept ooDitori ng.
Assfybrsent 2
D xplain
aualutica( lou for brq
bYg data.
rors Vario0S
SoCiCJocdia, SeDs0rS, clatafoSCS Cnd (og:
" ExE-ComrehceiSite8
Collect CUStornor
DurchaBe bistry.
Dac Storage:
Ike
lohgeCUOonts gf data usin
Hocdaop, NaS9L Or lavc sta rage.
Ex Cro ogle tunes user saCrch el ate i0
tistribuleddotabageg.
9) Defa Procssinq:htefia
"Cleabing, 1Heing, cnd trosf orrsing: Yaw claba
ioto USefol Formafs.
Exs Rerovin q ooplica te entries Porn cusfomar

9 Data Andys)s:
ApPlyioq 9achine lecunjro, statsticat hefbods,
Card clqatitbo3to ndpof ferns
"Rx Nesflix analyzeg Viewing habits to
oy wha is opping analytic Pa) i bg dofa sar
to c big dGe staclc in
aping cntlyfics Ploco td
pder Stonding the sBages o analyiCS procees
hd bdo thou can be iroplernnfed usrna
VnicUS f big dota tack.
O Datq Inqegtion Layer:
(ollets taw cke fa Froro differet
SOUrCeg ike loT clevi cos, S0cil med',
databoseg ard log.
*Ex. Apa che Ploroe CHn Kafka fo real
+irne clata sthearminq
o Dato Storage Layer :
Stores rnagsivE doutasetS io distibUted
Bxcursple : Hadoop HDFS, No S) L oetabeses.
9 Dota Processinq Lauyer:
"Proesses cuhd analys dala usn g
batch or keal-time pYOLSSing
En Ppache Spatk. tools
Datg Pralytica Layet:
Ppphes achire
Jearoinq, Al,
statishcal technfyoeg to oxtractadfnsigh ts
behdwigr predicop: Pos custome
9 Dasa V'sudiztian Layer:
Convetts iosehls s0ta reprts, dcshboarels,
qrapbs for decisjarn -makinq.
"En. Pocuer B far î0teractive
vfsUlizaionS.

9Explajn bjq data stack detail.


Datc Ingesti gn Mys,Oracle, FB,toitfer Uaraus soUrces
Uarigus
Data storeg Sãt or NoSoL type f Datebase
Date ProcOs sing MapReduce orReal Tine TNpe, I Job
Dota Aece ss * ve,Spaje sQL,P9 Query Language
User ErpetiencefAnalytics, Pd-hoc geries AppicatioDS
ODale (ngesin: Tools (GEe Apache kafFa, Harloop
Distributed Fle Sste CHDpS) Amaon S3.
Storage : Daba is stord iadistribuec
otorage systersns h ke Haoaup
le Sysiers
Sy3+er) (rDPS). Distiboted
8) Anaysis Bid cata prUccSng fraeLo ork.a
anayzes (arge datasels in parelle clus/er
(G)Secoity VaioUS gegihy eegUTes i
cCCCSS Cgnttol cabe Secuyi. encgion
9Espk batch cnaly hC in detul.

PVeSing
ettlected
but
6atches at MterVols.
Ry foatees :
Lotgc Scale Date
O PoceSsinq:
Handes massiVe slcet ses eCercienHy
OTire- Deluyet Pracssing:
af Eined in4ervcls.
3Cost EPfecive: Suifabe For non-Urgent
data andlysis
OHgh Acayrecy: Since
cotlected before processing, the reguffs
accurate.
Exaple
-coserce: Analyzing cauly cales
frends nd
and geneteing business repas
" Banking: ocegsinq erd-of-doy ttenschy
to deect teudtet ns.
Social tedia: Agqregaing User
activity daßa st erqdgemot insiqhts
E)Eplcuo
D
Ftlousing bi
olotetKOusple.
0eb-(uebtes (Ghe Cgoqleend rocaeat)
qenercero)a9Sive cYDUDtc
arctes, cicts, and
inera chons
qUeries daily to iopove search
egolts
advertising gitotcgies
2) Pinane- Banks CUc Stoc k (sctkotS procOSS
buge inaneil 4tansocions every
Se cond Par er Poud letectiofy
bants Chalyze ttapctian
patterns in real tie to ioeotify
SUspiciUS aetivi fy.
3) ]ntetoet of Things((9T)Smt deiCOg
Ike 1toess bands
therrs 0Slaps COD ioU OUShp cutlecf arc
tian so) datg.Por ex, a sart
horme Sysiem halyaes Usagc
paflen9 ta aptipiz0
Consumption
energy
)nvisonoeot- Senstrs Cnd atellHeg
callec- choade abd poll ufian
ex ASA
USes Saehe cloufa
global Jeppereture chahges
and predjct weathar pattens.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy