BIG DATA ANALYTICS Sem r20
BIG DATA ANALYTICS Sem r20
(AUTONOMOUS)
IV B. Tech I Semester Regular Examinations, December – 2023
(Regulations: VCE-R20)
BIG DATA ANALYTICS
(Common to Computer Science and Engineering & Computer Science and Engineering (AI&ML))
Date: 13 December, 2023 FN Time: 3 hours Max Marks: 75
Answer All Questions
PART – A
1. a) Write about the three characteristics of data? L1 CO1 2M
b) Differentiate between traditional relational databases (RDBMS) and L2 CO2 2M
Hadoop.
c) Define what a Mapper and Reducer are in the context of MapReduce L1 CO3 2M
programming.
d) What are the key features of Cassandra that make it suitable for L1 CO4 2M
distributed data storage and management?
e) Name the three kinds of meta store present in HIVE. L1 CO5 2M
f) Discuss about the challenges with Big Data. L2 CO1 3M
g) Is Hadoop a Database or not? Justify. L3 CO2 3M
h) Discuss about partitioner in MapReduce. L2 CO3 3M
i) Compare and contrast MongoDB with RDBMS. L2 CO4 3M
j) What are the key components of Hive's architecture, and how do they L2 CO5 3M
interact to process data?
PART – B
2. a) Discuss in detail about the big data sources. L2 CO1 5M
b) Explain in detail about the categories of big data. L1 CO1 5M
(OR)
c) Describe the storage dilemma in terms of big data analytics. L2 CO1 5M
d) Explain briefly about building the big data team. L2 CO1 5M
3. a) Explore the impact of NoSQL databases on modern web applications and L2 CO2 5M
their ability to handle large-scale, real-time data and list the advantages.
b) Discuss the key aspects and high-level architecture of Hadoop. L2 CO2 5M
(OR)
c) Write about how to work with HDFS commands. L2 CO2 5M
d) Write short on HDFs deamons. L2 CO2 5M
6. a) Describe the Hive Query Language (HQL) and its categories, including L2 CO5 5M
DDL and DML. Provide a brief example of any one DDL and one DML
query.
b) Describe the use of Hive File Formats. Compare and contrast common L2 CO5 5M
Hive file formats, highlighting their advantages and suitable use cases.
(OR)
c) Develop a WordCount program using Pig Latin statements in Hadoop Pig L3 CO5 5M
framework and explain with suitable example.
d) Explore the anatomy of Pig and their respective roles in the Hadoop L3 CO5 5M
ecosystem. Explain how they complement each other in processing and
analyzing large datasets.