0% found this document useful (0 votes)
36 views1 page

Gujarat Technological University

This document contains exam questions for a Big Data Analytics course. It asks students to answer questions about big data concepts like the four V's of big data, HDFS, RDDs, Hadoop ecosystem components, YARN, Spark, HBase, Hive, Pig and MapReduce. Students are asked to discuss architectures, transformations, failures and more in detail for topics relating to big data frameworks. The exam is out of 70 total marks and contains multiple choice and written response questions.

Uploaded by

Jigar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
36 views1 page

Gujarat Technological University

This document contains exam questions for a Big Data Analytics course. It asks students to answer questions about big data concepts like the four V's of big data, HDFS, RDDs, Hadoop ecosystem components, YARN, Spark, HBase, Hive, Pig and MapReduce. Students are asked to discuss architectures, transformations, failures and more in detail for topics relating to big data frameworks. The exam is out of 70 total marks and contains multiple choice and written response questions.

Uploaded by

Jigar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Seat No.: ________ Enrolment No.

___________

GUJARAT TECHNOLOGICAL UNIVERSITY


BE - SEMESTER–VII (NEW) - EXAMINATION – SUMMER 2017
Subject Code: 2171607 Date: 29/04/2017
Subject Name: Big Data Analytics(Departmental Elective - II)
Time: 02.30 PM to 05.00 PM Total Marks: 70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.

Q.1 (a) What is Big data? Discuss it in terms of four dimensions, volume, velocity, 07
variety and veracity.
(b) Define HDFS. Discuss the HDFS Architecture and HDFS Commands in brief. 07
Q.2 (a) What is RDD? Explain about transformations and actions in the context of 07
RDDs. State and explain RDD operations in brief.
(b) Discuss big data in healthcare, transportation and medicine. 07
OR
(b) What is Hbase? Discuss in detail the data model and Implementation Aspect. 07
Q.3 (a) Write Short note on Hadoop Ecosystem also explain various elements of 07
hadoop.
(b) Discuss Hadoop YARN in detail with failures in classic Map-reduce 07
OR
Q.3 (a) What is Spark? State the advantages of using Apache Spark over Hadoop 07
MapReduce for big data processing with example.
(b) Elaborate on HivQL data manipulation queries in detail 07

Q.4 (a) Discuss the concept of regions in HBase and Storing Big data with HBase. 07
(b) How does HDFS ensure data Integrity in a Hadoop Cluster? 07
OR
Q.4 (a) Explain how HBase uses Zookeeper to Build Applications with Zookeeper. 07
(b) What are the Components of Spark? Also state the features of Spark. 07
Q.5 (a) Explain Pig data Model in detail and Discuss how it will help for effective data 07
flow.
(b) Explain Map-reduce framework in detail. Draw the architectural diagram for 07
Physical Organization of Compute Nodes.
OR
Q.5 (a) Draw and discuss the architecture of Hive in detail. 07
(b) Write a brief short note on: Spark Unified Stack 07

*************

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy