21cs71BDA Question Bank
21cs71BDA Question Bank
question papers]
Module1: Introduction to Big Data Analytics.
1. Define Big Data. Explain the Evolution of Big Data and their characteristics
2. What is grid computing? List and explain the features, drawbacks of grid computing
3. Discuss the functions of each of the five layers in Big Data architecture design
4. Illustrate the various phases involved in Big Data Analytics with neat diagram.
5. Discuss the evolution of BigData
6. Explain the characteristics of BigData
7. Write a neat block diagram,Explain data architecture design.
8. Write a notes on Analytical scalability to big data and Massive Parallel Processing
Platforms.
9. Highlight Big Data Analytics with one case study?
10. Define BigData. Explain the classification of bigdata?
11. Define Scalability and its types along with the examples.
12. Explain the functions of each layer in Big data architecture design with a diagram.
13. Define data preprocessing. Explain in brief the needs of preprocessing?
14. Explain the following terms. i. Scalability & Parallel Processing ii. Grid & Cluster
Computing.
15. What is Cloud Computing? Explain different services of Cloud.
16. Explain any two Big Data different Applications.
17. How does Berkeley data analytics stack help in analytics take?
Module:2 Introduction to Hadoop (T1), Hadoop Distributed File System Basics (T2), Essential
Hadoop Tools (T2).
1. Illustrate the Hadoop core components with neat diagram
2. Discuss the Hadoop system and ecosystem components in four layers
3. Illustrate YARN based execution model and its functions With a neat diagram
4. Discuss the Apache sqoop import and export methods with neat diagram.
5. What are the core components of Hadoop? Explain in brief its each of its components?
6. Explain Hadoop Distributed File System?
7. Define MapReduce Framework and its functions?
8. Write down the steps on the request to MapReduce and the types of process in
MapReduce.
9. Write short noted on Flume Hadoop Tool.
10. What is HDFS? Highlight the important design features of the HDFS
11. Bring out the concepts of the HDFS block replication with an example
12. Explain Apache sqoop import and export method with neat diagram
13. Demonstrate any six HBase commands with output?
14. Write short note on Apache hive.
15. Explain Apache Oozie with neat diagram.
16. Explain YARN application framework.