0% found this document useful (0 votes)
27 views2 pages

Winter 2024

This document outlines the examination details for the Big Data Analytics subject at Gujarat Technological University, including instructions, question structure, and marks allocation. It covers various topics such as Hadoop, NoSQL systems, HDFS architecture, and real-time data mining applications. The exam is scheduled for December 2, 2024, with a total of 70 marks available.

Uploaded by

jemin.mavani.k
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views2 pages

Winter 2024

This document outlines the examination details for the Big Data Analytics subject at Gujarat Technological University, including instructions, question structure, and marks allocation. It covers various topics such as Hadoop, NoSQL systems, HDFS architecture, and real-time data mining applications. The exam is scheduled for December 2, 2024, with a total of 70 marks available.

Uploaded by

jemin.mavani.k
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Enrolment No.

/Seat No_______________

GUJARAT TECHNOLOGICAL UNIVERSITY


BE- SEMESTER–VI (NEW) EXAMINATION – WINTER 2024
Subject Code:3161607 Date:02-12-2024
Subject Name:Big Data Analytics
Time:02:30 PM TO 05:00 PM Total Marks:70
Instructions:

1. Attempt all questions.


2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.
4. Simple and non-programmable scientific calculators are allowed.
MARKS
Q.1 (a) Define Big Data. Explain various types of Big Data. 03
(b) Explain the terms in Hadoop. 04
i) Scaling out ii) Hadoop Streaming
(c) i) Enlist and explain Big Data characteristics. 03
ii) Discuss two scheduling policies of YARN. 04

Q.2 (a) Explain Hadoop in the cloud. 03


(b) Enlist and explain four ways that NoSQL systems handle big data 04
problems.
(c) Define HDFS. Draw HDFS architecture and explain its components. 07
OR
(c) Draw the architectural diagram for Physical Organization of Computer 07
Nodes. Explain Map-Reduce framework in detail.

Q.3 (a) Enlist the difference between NoSQL and RDBMS. 03


(b) Explain Key-Value Stores and Column Family (Bigtable) stores as 04
NoSQL data architecture patterns.
(c) Explain the followings in details. 07
i) Decaying Window
ii) Define RTAP. Describe RTAP applications.
OR
Q.3 (a) Enlist the difference between master-slave and peer-to-peer distribution 03
models.
(b) Explain Graph Stores and Document stores as NoSQL data architecture 04
patterns.
(c) Explain Stream data model and its architecture in detail with a neat 07
diagram.

Q.4 (a) Explain interactive Spark with PySpark in detail. 03


(b) Explain HBase in detail. 04
(c) Explain real time “stock market prediction” using streaming data mining. 07
OR
Q.4 (a) How Implicit filtering is differing from explicit filtering. 03
(b) Enlist and explain the key features of pig. 04
(c) Explain real time “Sentiment Analysis” using streaming data mining. 07

1
Q.5 (a) Explain the following commands of HDFS: 03
i) copyFromLocal ii) setrep iii) checksum
(b) Explain Spark components in detail. 04
(c) Write short note on Zookeeper. 07
OR
Q.5 (a) Explain the reasons why Spark is more suitable for streaming data 03
analytics.
(b) Discuss Machine Learning with MLlib in Spark. 04
(c) Explain working of Hive with proper steps and diagram. 07

*****************************************

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy