Big Data Analytics Unit Test-I Answers Bank
Big Data Analytics Unit Test-I Answers Bank
Q3) Define Big data and explain different challenges of big data?
Big Data is a collection of data that is huge in volume, yet growing exponentially with time. It is
a data with so large size and complexity that none of traditional data management tools can store
it or process it efficiently. Big data is also a data but with huge size.
OR
Big data is a term for data sets that are so large or complex that traditional data processing
application softwares are inadequate to deal with them.
This was the era of the Enterprise Data Warehouse; used to capture information, and of Business
Intelligence Software; used to present and report it.
Statements:
Statements:
Statements:
• Volume
• Variety
• Velocity
• Variability
(i) Volume – The name Big Data itself is related to a size which is enormous. Size of data plays a
very crucial role in determining value out of data. Also, whether a particular data can actually be
considered as a Big Data or not, is dependent upon the volume of data. Hence, ‘Volume’ is one
characteristic which needs to be considered while dealing with Big Data solutions.
Variety refers to heterogeneous sources and the nature of data, both structured and unstructured.
During earlier days, spreadsheets and databases were the only sources of data considered by
most of the applications. Nowadays, data in the form of emails, photos, videos, monitoring
devices, PDFs, audio, etc. are also being considered in the analysis applications. This variety of
unstructured data poses certain issues for storage, mining and analyzing data.
(iii) Velocity – The term ‘velocity’ refers to the speed of generation of data. How fast the data is
generated and processed to meet the demands, determines real potential in the data.
Big Data Velocity deals with the speed at which data flows in from sources like business
processes, application logs, networks, and social media sites, sensors, Mobile devices, etc. The
flow of data is massive and continuous.
(iv) Variability – This refers to the inconsistency which can be shown by the data at times, thus
hampering the process of being able to handle and manage the data effectively.
NoSQL RDBMS
Q14) How is the traditional BI environment different from the big data environment?
Q15) Share your experience as a customer on an e-commerce site. Comment on the big data
that gets created on a typical e-commerce site.