BDA Question Bank AY 2023-24
BDA Question Bank AY 2023-24
Question Bank
Unit-1 Getting an Overview of Big Data
1. What is Big Data? List and explain the common types of data and their sources.
2. List and discuss the four element of Big Data.
3. What is Big Data Analytics? Describe the three types of analytics.
4. Describe the advantages of Big Data Analytics.
5. Describe the challenges faced during the evolution of Big Data.
6. Write note on the following
a. Big Data Companies
b. Job Titles in Big Data
c. Skills required for Big Data Professionals
7. Write the complete syntax of SELECT command in Hive and write the Hive queries for
the following:
i. Retrieve the all the columns and rows from student table.
ii. Retrieve the sales records that have an amount greater than 15000 from the US
region
iii. Calculate the average marks obtained by students from all semester. Result should
display semester and average marks.
iv. Display the only ten records of students
v. For table Sales (product, category, salesvalue), retrieve the categories of product
with total salesvalue greater than 300.
8. What is the use of Pig? Explain the benefits of Pig.
9. Discuss the two modes used for running the Pig scripts.
10. What are the main reasons for developing Pig Latin?
11. Describe the Pig Latin Application flow with the different types statements used in it.
12. Describe the use of the following operators in the Pig Latin:
i. FOREACH
ii. ASSERT
iii. FILTER
13. Describe the use of the following operators in the Pig Latin:
i. GROUP
ii. ORDER BY
iii. DISTINCT
14. Describe the use of the following operators in the Pig Latin:
i. JOIN
ii. SAMPLE
iii. SPLIT
Unit 5: Understanding Analytics, Analytical Approaches and Tools to Analyze
Data
1. Compare the reporting with analysis.
2. Describe the phases of the Analytic Process
3. Describe the basic analytics in detail.
4. Describe the advanced analytics in detail.
5. Describe the ensemble method of analytical approaches.
6. What is Text Data Analysis? Give some examples of data sources and different data
structure types used in case of text data analysis.
7. List the analytical tools. Explain the features and limitations of R.
8. Describe the analytical tool IBM SPSS with its features.
9. Describe the Statistical Analysis System (SAS) as information delivery system with its
features.
10. Compare the different analytical tools such as R, IBM SPSS and SAS.
Unit 6: Exploring R
1. What is R? Explain the programming features of R.
2. Demonstrate the use of functions that allow users to handle the data in workspace :
a. ls () b) rm () c) save () d) load ()
3. Demonstrate the use of following commands used to import and export large amount of
data in R :
a) read.csv () b) read.table () c) write.csv () d) write.table ()
4. Demonstrate the different ways of combining data by using the merge () function.
5. Demonstrate the use of sort () and order () functions.
6. Demonstrate the use of melt () and dcast () functions.