KDS601 Big Data and Analytics
KDS601 Big Data and Analytics
BTECH
(SEM VI) THEORY EXAMINATION 2023-24
BIG DATA AND ANALYTICS
TIME: 3 HRS M.MARKS: 100
Note: 1. Attempt all Sections. If require any missing data; then choose suitably.
SECTION A
1. Attempt all questions in brief.
a. List the differences between structured, semi-structured, and unstructured data. 2
b. Write short note on Drivers of Big Data. 2
c. List the core functionalities of Apache Hadoop. 2
d. Explain how the data format of Hadoop is important. 2
e. List the steps involved in using Sqoop to import data from RDBMS into Hadoop. 2
f. Define how file system Works. 2
g. List the key differences between the fair scheduler and the capacity scheduler in Hadoop. 2
h. List the Data Type used in Mango DB. 2
i. Define HiveQL and its key features. 2
j. List out the Data Processing Operators used in Pig. 2
SECTION B
2. Attempt any three of the following:
2
13
a. Discuss why Big Data is crucial for modern businesses and industries. 10
90
b. Discuss Hadoop Distributed File System (HDFS), and how does it work. 10
2.
_2
Discuss the benefits and challenges of using HDFS for big data storage and processing.
24
c. 10
P1
Differentiate NoSQL databases from traditional RDBMS. Also list the key characteristics 10
5.
d.
and use cases for NoSQL databases.
4E
.5
e. Discuss the role of Zookeeper in monitoring a Hadoop cluster. 10
17
P2
SECTION C
3. Attempt any one part of the following:
|1
Q
Distinguishes Big Data analytics from traditional data analytics. Also list the techniques
a. 10
AM
Discuss the roles of additional components like HBase, Pig, and Hive in the Hadoop 10
a.
22
ecosystem.
b. Discuss the Working of Map Reduce and its Characteristics 10
9:
a. Explain how HDFS implement data replication and list its advantages. 10
b. Discuss the Security issues in Hadoop and why it is important for Data analysis. 10
20
b. Explain the Various Ecosystem Components used in Hadoop with proper example. 10
5-
a. Explain the IBM's overall strategy for Big Data, and its key components. 10
b. Discuss Big SQL, and how it extends SQL capabilities to Big Data environments. 10
1|Page
QP24EP1_290 | 15-Jun-2024 9:22:01 AM | 117.55.242.132