0% found this document useful (0 votes)
46 views9 pages

CCBD Question Bank

The document outlines the course objectives and outcomes for a Cloud Computing and Big Data course, detailing key concepts such as cloud virtualization, Big Data platforms, and Apache Hadoop. It includes a structured question bank divided into units covering various topics like cloud infrastructure, virtualization, and data processing with HDFS and MapReduce. Additionally, it provides a range of questions categorized by marks to assess students' understanding of the material.

Uploaded by

ashwins1306
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views9 pages

CCBD Question Bank

The document outlines the course objectives and outcomes for a Cloud Computing and Big Data course, detailing key concepts such as cloud virtualization, Big Data platforms, and Apache Hadoop. It includes a structured question bank divided into units covering various topics like cloud infrastructure, virtualization, and data processing with HDFS and MapReduce. Additionally, it provides a range of questions categorized by marks to assess students' understanding of the material.

Uploaded by

ashwins1306
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 9

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

Cloud Computing and Big Data(U20CST615)

Question Bank

Course Objectives

• To define the fundamental ideas behind Cloud Computing.

• To classify the basic ideas and principles in cloud information system.

• To understand the cloud virtualization concepts and vm ware.

• To understand the Big Data Platform and its Use cases

• To provide an overview of Apache Hadoop, Provide HDFS Concepts and Interfacing with HDFS

Course Outcomes
After completion of the course, the students should be able to:
CO1 – Explain the core concepts of the cloud computing paradigm: how and why this paradigm
shift came about, the characteristics, advantages and challenges brought about by the various
models and services in cloud computing. (K3)
CO2 – Apply fundamental concepts in cloud infrastructures to understand the tradeoffs in
power, efficiency and cost, and then study how to leverage and manage single and multiple data
centers to build and deploy cloud applications that are resilient, elastic and cost-efficient. (K2)
CO3 – Illustrate the fundamental concepts of cloud virtualization. (K4)
CO4 – Identify Big Data and its Business Implications. (K2)
CO5 – List the components of Hadoop and Hadoop Eco-System, Access and Process Data on
Distributed File System.(K3)

UNIT I INTRODUCTION (9Hrs)

Introduction to Cloud Computing – The Evolution of Cloud Computing – Hardware Evolution –


Internet Software Evolution – Server Virtualization – Web Services Deliver from the Cloud –
Communication-as-a-Service –Infrastructure-as-a-Service – Monitoring-as-a-Service – Platform-
as-a-Service – Software-as-a-Service. Federation in the Cloud - Presence in the Cloud – Privacy
and its Relation to Cloud-Based Information Systems – Security in the Cloud – Common
Standards in the Cloud – End-User Access to the Cloud Computing.

UNIT II CLOUD INFRASTRUCTURE (9 Hrs)

Introduction – Advancing towards a Utility Model – Evolving IT infrastructure – Evolving


Software Applications – Continuum of Utilities – Standards and Working Groups – Standards
Bodies and Working Groups – Service Oriented Architecture – Business Process Execution
Language – Interoperability Standards for Data Center Management – Utility Computing
Technology.

UNIT III CLOUD VIRTUALIZATION (9 Hrs)

Virtualization – Hyper Threading – Blade Servers – Automated Provisioning – Policy Based


Automation – Application Management – Evaluating Utility Management Technology – Virtual
Test and Development Environment - Data Center Challenges and Solutions - Automating the
Data Center – Basics of VMWare, Advantages of VMware Virtualization, Using Vmware
Workstation, Creating Virtual Machines – understanding Virtual Machines.

UNIT IV INTRODUCTION TO BIG DATA AND HADOOP (9 Hrs)

Types of Digital Data, Introduction to Big Data, Big Data Analytics, History of Hadoop, Apache
Hadoop, Analyzing Data with Unix Tools, Analyzing Data with Hadoop, Hadoop Streaming,
Hadoop Echo System, IBM Big Data Strategy, Introduction to Info Sphere Big Insights and Big
Sheets.

UNIT V HDFS(HADOOP DISTRIBUTED FILE SYSTEM) & MAP REDUCE (9 Hrs)

The Design of HDFS, HDFS Concepts, Command Line Interface, Hadoop File System Interfaces,
Data Flow, Data Ingest with Flume and Scoop and Hadoop Archives, Hadoop I/O: Compression,
Serialization, Avro and File-Based Data Structures. Anatomy of a Map Reduce Job Run, Failures,
Job Scheduling, Shuffle and Sort, Task Execution, Map Reduce Types and Formats, Map Reduce
Features.

Unit I

2Marks

1.
2. Define Cloud Computing.
3. Define Parallel Computing and Centralized computing.
4. List out the cluster design issues.
5. Describe the applications of high performance and high throughput systems.
6. Tabulate the difference between the high performance computing and high throughput
computing
7. Name the essential characteristics of cloud computing.
8. Give the advantages of cloud computing.
9. Highlight the importance of the term “cloud computing.”
10. Identify any two advantages of distributed computing.
11. Bring out the differences between private cloud and public cloud.
12. Illustrate the evolutionary trend towards distributed and cloud computing.
13. What are the characteristics of cloud architecture that separates it from traditional one?
14. Interpret the cloud resource pooling.
15. Outline elasticity in cloud.
16. Mention what is the difference between elasticity and scalability in cloud computing?
17. List few drawbacks of grid computing.
18. How is On Demand provisioning of resources applied in cloud computing?
19. Assess properties of Cloud Computing.
20. Formulate the technologies on which cloud computing relies.
21. Investigate how can a company benefit from cloud computing.
22. Define public clouds.
23. Write a short note on community cloud.
24. Define IaaS.
25. State the differences between PaaS and SaaS
26. Why do we need a hybrid cloud
27. State the role of cloud auditor in cloud.
28. What are the different layers available in cloud architecture design?
29. What are the various components of NIST Cloud computing reference architecture?
30. Differentiate cloud consumer and provider.

5Marks

1. Identify and explain in detail about evolutionary trend of computer technology.


2. Explain the three paradigms in detail.
3. Define and examine in detail about the multi core CPUs and multithreading technologies.
4. Demonstrate in detail about trends towards distributed systems.
5. Illustrate in detail about parallel and distributed programming models
6. Describe the infrastructure requirements for Cloud computing.
7. What are the issues in cluster design? How can they be resolved.
8. Summarize in detail about the degrees of parallelism.
9. Discuss the application of high performance and high throughput system.
10. Describe in detail the Peer to Peer network families.
11. Express in detail about cloud computing architecture over the Internet?
12. Illustrate the cloud architecture in detail
13. Describe the architecture of a cluster with suitable illustrations.
14. Explain evolution of cloud computing.
15. Explain in detail underlying principles of Parallel and Distributed Computing .
16. Explain the trends towards Cloud Computing
17. Outline the similarities and differences between distributed computing, grid computing and
cloud computing.
18. Outline the architecture of cluster cooperative computers with a diagram
19. Give the importance of cloud computing and elaborate the different types of services offered
by it.
20. Explain in detail about Elasticity in Cloud and On-demand Provisioning.

10Marks

1. Discuss about various dimensions of scalability and performance laws in distributed system.
2. It is said, ‘cloud computing can save money’. What is your view? Can you name some open
source cloud computing platform databases? Explain any one database in detail.
3. Create and justify Cloud architecture application design with neat sketch.
4. Briefly explain each of the cloud computing services. Identify two cloud providers by
company name in each service category.
5. I am starting a new company to analyse videos. I’ll need a lot of storage as videos consume
quite a bit of disk. Additionally, I’ll need ample computational power, possibly running
applications concurrently. I have discovered some very good tools to facilitate development in
Windows but the deployment will be more effiicently handled in the Linux environment. All
the pointers say that I need to move to cloud. I have found that SaaS is the most attractive
service, followed by PaaS and IaaS, in that order. Given the above information, which service
do you recommend? Why?
6. Evaluate and contrast the merits and demerit of Cloud deployment models: public, private,
hybrid, community.
7. Evaluate about the architectural design of compute and storage clouds.
8. Under what circumstances should you prefer to use PaaS over IaaS? Formulate it with an
example

UNIT II

2Marks

1. List the essential principles of SOA architecture


2. Define REST and its working.
3. State the most relevant technologies supporting service computing.
4. Identify the role of Web services in cloud technologies.
5. Write the name of Web services tools.
6. Distinguish between physical and virtual clusters.
7. Define Utility Computing Technology.
8. List the cloud Differences in the perspectives of providers, vendors, and users.
9. Define security governance.
10. Differentiate the Physical and Cyber Security Protection at Cloud/Data Centers
11. Generalize about the IAM
12. 5Marks
1. Generalize the ideas of software environments for distributed systems and clouds.
2. List the cloud deployment models and give a detailed note about them.
3. Discuss in detail about the categories of cloud computing.
4. Describe service and deployment models of a cloud computing
5. Explain in detail about Business Process Execution Language.
6. Discuss about the Layered Cloud Architecture Design
7. Summarize about the NIST Cloud Computing Reference Architecture.
8. Discuss the Infrastructure-as-a-Service, Platform as a service and Software as a service.
9. Write the short notes on cloud infrastructure Continuum of Utilities
10. Discuss the features of software as a Service and explain in detail about SaaS with example
11. Explain the software distribution model in which applications are hosted by a vendor or
service provider and made available to customers over a network, typically the Internet
12. Illustrate the features of Platform as a Service
13. Demonstrate in detail about PaaS with example.

10marks

1. Give the diagram Cloud Computing Reference Architecture.


2. Illustrate in detail about The Conceptual Reference Model of cloud
3. Describe the Interoperability Standards for Data Center Management
4. Analyze the challenges in architectural design of cloud
5. Compare: Public. Private and Hybrid clouds
6. Evaluate in detail about Cloud Storage and Storage-as-a-Service – with advantages of Cloud
Storage
7. Explain with neat diagram about the Cloud Storage Providers and Amazon Simple Storage
Service .
8. Describe in detail about SOA and Web services.

UNIT III

2Marks

1. Can one access the files of one VM from another?


2. Explain hypervisor architecture
3. Define para-virtualization?
4. Identify the major players involved in cloud computing.
5. Demonstrate the need of private cloud.
6. Show the interaction between the Actors in the cloud computing.
7. Demonstrate the difference between software as a service and software plus service.
8. Why do we need cloud storage?
9. Analyze the storage as a service.
10. What is VMware Fault Tolerance?
11. What is the usage of VMware tools?
12. Compare service aggregation and service arbitrage.
13. Summarize the benefits and drawbacks of using “Platform as a Service.
14. What is the difference between type 1 and type 2 hypervisior?
15. Write the services in EaaS
16. What are the Advantages of VMware Virtualization
17. What are the benefits of virtualization in the context of cloud computing?
18. Demonstrate the need of virtualization need of multi-core processor.
19. List the five application areas in SaaS applications.

5Marks

1. Explain in details about Virtualization in Cloud Computing andTypes?


2. Write a short note on Hyper Threading
3. What is virtualization? List its benefits and drawbacks.
4. Explain the different approaches used to achieve virtualization with a neat diagram.
5. Differentiate full virtualization, paravirtualization, and hardware assisted virtualization
techniques
6. Explain in details about Automated Provisioning .
7. What is the role of hypervisor in virtualization? Briefly explain the different types of
hypervisors with a neat diagram.
8. Differentiate type 1 and type 2 hypervisors
9. Write short notes on data virtualization and application virtualization
10. Explain in detail about Data Center Virtualization for Cloud Computing
11. Write the short on Virtual Test and Development Environment

10marks

1. Write a short note on Desktop virtualization?


2. Describe the Interactions among VM managers for cloud creation and management; the
manager provides a public API for users to submit and control the VMs
3. Explain implementation levels of virtualization in details.
4. Explain in detail about VMware. How to use VMware workstation.
5. Write the short on
i) Creating VMware
ii) Understanding VMware

UNIT IV

2marks

1. Define MapReduce.
2. What is the role of Reduce function?
3. List out the Hadoop core fundamental layers
4. Compare Reporting and Analysis with its process.
5. Explain the following.
a.Advanced analytics
b. Operationalized analytics
c. Monetized analytics \
6. How to develop an analytical team and what is the skill required for an analyst?
7. Distinguish statistical significance and business importance.
8. What are the roles of analytical team and IT team with a detailed note on text analysis?
9. Explain in detail the commonly used analytical approaches?

10. Discuss in detail the history of analytical tools.

11.How analytical tools have evolved from graphical user interfaces to point solutions to data

visualization tools?

9. Give a detailed note on features and limitations of R programming and IBM SPSS.

10. Explain in detail the following

a. SAS

b. Compare various analytical tools.

5Marks

1. (a) List the main feature of Map Reduce.


(b) Explain working of the following phases of Map Reduce with one common example

(i) Map Phase

(ii) Shuffle and sort phase

(iii) Reducer Phase

2. Describe the working of Map reduce with a relevant example.

3. Discuss the techniques which is used to optimize the map reduce jobs.

4. Discuss the points to be considered while designing a file system in mapreduce.

5. What is HBASE? Give detailed note on features of HBASE.

6. Write a short note on the Hadoop ecosystem and HDFS architecture.

7. How does HDFS ensure data integrity in a Hadoop cluster?

8. Discuss the following terms

a. Streaming information access.

b. Low latency information access.

c. Rest and thrift

d. Org.apcahe.hadoop.io.package

9. What is Meta data? What information does it provide and explain the role of Name node in a HDFS

clusters?

10. Define Command line interface using HDFS files and give a brief note on Hadoop-specific file

system types and HDFS commands.

10Marks

Write the definition of “big data” and under what conditions it is given that name.

2.Demonstrate the differences between Big data and conventional data.

3. Examine the various dimensions of growth of big data.

4. Differentiate between data analysis and data reporting.

5. List the risks involved in using big data.

6. Explain the role of big data analytics.

7. Identify the sources of big data.

8. Analyze the challenges in big data.


9. Summarize the reasons for the domain expertise for any type of data analytics.

10. Define the reason behind the phrase “Web data is the most popular big data” .

UNIT V

2marks

1. Why the accuracy in big data is beneficial ?


2. Analyze the list of data analytical tools.
3. Predict about the list of reporting tools.
4. Discuss about the trends in data analytics tools.
5. Generalize the role of analytical tools in big data.
6. Show the points of similarities between the data mining and data analysis.
7. Differentiate between data analytics and big data analytics.
8. Create a scenario where big Data Analytics be used as a Decision Making tool.
9. Summarize the technologies used to handle big data.
10. State the definition of “Sand Box”.

5Marks

1. Define Big Data . Describe the main features of big data in detail.

2. (i) Explain the main characteristics features and structure of Big data in detail.

(ii) Analyze big data architecture with a neat schematic

diagram.

3. Describe the risks involved in handling Big data.

4. (i) Evaluate ways in which the big data is represented ?

(ii) Assess the structure of big data representation

5. (i) What are the features of Massive parallel processing system?

(ii) Describe the use of Massive Parallel Processing system in big data analytics

6. Discuss the challenges faced by the traditional system.

7. Point out in detail the analysis tools and reporting tools used in Bigdata.

8. Discuss in detail about Analytical data set and the types of analytical data set.

9. Discuss in detail about web data and what does it reveal?

10. Illustrate in detail how big data are effectively filtered and mixed with the traditional one. (13)

11. (i) Describe the importance of tools in Big Data

(ii) Summarize in detail the trends and technology in big data.

12. Formulate the difficulties faced by the traditional systems.


13. Describe how the analytical scalability is handled in big data.

14. (i) Explain in detail about the web data in current action today.

(ii) Examine the modern tools for big data analytics.

10marks

1. Summarize in detail about the challenges of the Big Data in Modern Data Analytics.

2. Hypothesize the statement “Web Data is the Most Popular Big Data” with reference to data analytic
professional.”

3. Infer on the statement “Is the “Big” Part or the “Data” Part More Important “.

4. Formulate the role of analytic sandbox, its benefits and types. Give the definition of Hadoop.

2. Define how Map-Reduce computation is executed .

3. Show the key advantages in Hadoop.

4. Point out the meaning of the term “ Hadoop YARN”.

5. List the core concepts of Hadoop.

6. Define MAP REDUCE concepts.

7. How can a key value pair is formed?

8. Develop the importance of DFS.

9. Differentiate between Hadoop and Map Reduce. 10. Point out the characteristics of Hadoop.

11. Distinguish between Hadoop and Big data.

12. List the advantages of MaPR.

13. Classify the classical components of computer.

14. Express Shuffle and sort Algorithm.

15. Explain the goals of HDFS.

16. What are the list of Hadoop applications ?

17. Classify types of big data.

18. Define the partitions are shuffled in map reduce.

19. Explain the steps in map reduce algorithm.

20. Generalize matrix vector multiplication.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy