0% found this document useful (0 votes)

69 views5 pages

CCD CT 2 Model Answer 2022-23

CCD

Uploaded by

himanshuahirrao04052006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views5 pages

CCD CT 2 Model Answer 2022-23

CCD

Uploaded by

himanshuahirrao04052006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

ALL INDIA SHRI SHIVAJI MEMORIAL SOCIETY’S POLYTECHNIC, PUNE – 01

MODEL ANSWER
Class Test- I (2023-24)

Program Code :- AN5I Semester :- V

Course Title :- Cloud Computing with data Science Course code:- 22594
Marks :- 20 Time :- 1 Hrs.
Artificial intelligence and machine
Program :- Date :-
leaning

Course Outcomes:- C22594.4, C22594.5, C22594.6

Sub.
Que. Total
Que Stepwise Solution Marks
No. Marks
.
1 Attempt any FOUR (C22594.1) 2 8
a Define Data Pipeline 02 02
Define: Define
1. A data pipeline is a method in which raw data is ingested from various data sources and 2M
then ported to data store, like a data lake or data warehouse, for analysis.
2. A data pipeline is a series of processes that migrate data from a source to a destination
database. An example of a technical dependency may be that after assimilating data from
sources, the data is held in a central queue before subjecting it to further validations and
then finally dumping into a destination.
b State the use of Modern data Pipeline. 02 02
1.Data Pipeline is a web service that helps you reliably process and move data between
different AWS compute and storage services, as well as on-premises data sources, at For
specified intervals. each
2.data pipelines provide the foundation for a range of data projects; this can include use – 1
exploratory data analyses, data visualizations, and machine learning tasks. Mark
3.automation of data pipelines allows organizations to extract data at its source,
transform it, integrate it with other sources and fuel business applications and data
analytics
c Describe Any Four Characteristics of Data Pipeline . 02 02
Characteristics OF Data pipeline: For
1.Continuous, extensible data processing each
2.Cloud-enabled elasticity and agility Charact
3.Independent, isolated data processing resources eristics
4.Widespread data access and the ability to self-serve – 1/2
5.High availability and disaster recovery Mark

d Enlist the process Of Designing pipeline. 02 02

Process Of Designing the pipeline in Cloud: Proper

process
Step 1: Determine the goal 2M
When designing a data pipeline, the priority is to identify the outcome or value the
data pipeline will bring to your company or product.
Step 2: Choose the data sources
We then consider the possible data sources that’ll enter the data pipeline .
Step 3: Determine the data ingestion strategy
With the pipeline goal and data sources understood. We need to ask questions
about how the pipeline will collect the data.
Step 4: Design the data processing plan
Once data has been ingested, it has to be processed and transformed for it to be
valuable to downstream systems
Step 5: Set up storage for the output of the pipeline
Once the data has been processed, we must determine the final storage
destination for our data to serve various business use cases.
Step 6: Plan the data workflow
We then need to design the sequencing of processes in the data pipeline.
Step 7: Implement a data monitoring and governance framework
In this step, we establish a data monitoring and governance framework, which
helps us observe the data pipeline to ensure a healthy and efficient channel that’s
reliable, secure, and performs as required.
Step 8: Plan the data consumption layer
This final step determines the various services that’ll consume the processed data
from our data pipeline.
e Explain ETL Process in detail(U) 02 02
1. ETL is a process in Data Warehousing and it stands for Extract, Transform and Any 2
Load. It is a process in which an ETL tool extracts the data from various data source for
systems, transforms it in the staging area, and then finally, loads it into the Data Each 1
Warehouse system. M
2.Extract, transform, and load (ETL) is the process of combining data from multiple
sources into a large, central repository called a data warehouse. ETL uses a set of
business rules to clean and organize raw data and prepare it for storage, data
analytics, and machine learning (ML).

2 A Attempt any TWO (C22319.4) 2 8

a List any four Common issues in Kubernetes
Issues in kubernetes: Any 4
1. Complexity: Kubernetes has a steep learning curve. Setting up, configuring, and four
maintaining a Kubernetes cluster can be challenging, especially for those new to container issues
orchestration. for
2. Resource Costs: While Kubernetes can optimize resource usage, running Kubernetes each
clusters can still be expensive in terms of cloud infrastructure costs. 1M
3. Management Overhead: Operating and managing Kubernetes clusters require ongoing
effort and expertise. This includes tasks like updates, monitoring, and troubleshooting.
4. Networking Complexity: Networking in Kubernetes can be complex, and setting up
communication between services or across clusters can be challenging, especially in a
multi-cloud setup.
5. Lack of Application Visibility: Kubernetes abstracts many infrastructure details, which
can make it difficult to get comprehensive insights into the performance and behavior of
applications.
6. Vendor Lock-In: While Kubernetes promotes portability, some cloud providers offer
managed Kubernetes services (e.g., Amazon EKS, Google GKE) with cloud-specific features,
potentially creating vendor lock-in.
7. Data Management: Handling stateful applications and managing persistent data in a
Kubernetes environment can be challenging.
8. Version Compatibility: Ensuring that your application and its dependencies are
compatible with the Kubernetes version you're using can be a concern, especially when
upgrading
b Draw and Explain Docker Architecture in detail. 4 4

Docker architecture:- Diagra

m
Docker uses a client-server architecture. The Docker client talks to the Docker
1M,Ex
daemon, which does the heavy lifting of building, running, and distributing your
plainati
Docker containers. The Docker client and daemon can run on the same system, or on 3M
you can connect a Docker client to a remote Docker daemon. The Docker client
and daemon communicate using a REST API, over UNIX sockets or a network
interface. Another Docker client is Docker Compose, that lets you work with
applications consisting of a set of containers.

1.The Docker daemon

The Docker daemon (dockerd) listens for Docker API requests and manages Docker
objects such as images, containers, networks, and volumes. A daemon can also
communicate with other daemons to manage Docker services.
2.The Docker client
The Docker client (docker) is the primary way that many Docker users interact with
Docker. When you use commands such as docker run, the client sends these
commands to dockerd, which carries them out. The docker command uses the
Docker API. The Docker client can communicate with more than one daemon.
3.Docker Desktop
Docker Desktop is an easy-to-install application for your Mac, Windows or Linux
environment that enables you to build and share containerized applications and
microservices. Docker Desktop includes the Docker daemon (dockerd), the Docker
client (docker), Docker Compose, Docker Content Trust, Kubernetes, and
Credential Helper. For more information, see Docker Desktop.
4.Docker registries
A Docker registry stores Docker images. Docker Hub is a public registry that
anyone can use, and Docker looks for images on Docker Hub by default. You can
even run your own private registry.When you use the docker pull or docker
run commands, Docker pulls the required images from your configured registry.
When you use the docker push command, Docker pushes your image to your
configured registry.
5.Docker objects
When you use Docker, you are creating and using images, containers, networks,
volumes, plugins, and other objects. This section is a brief overview of some of
those objects.
6.Images
An image is a read-only template with instructions for creating a Docker container.
Often, an image is based on another image, with some additional customization.
For example, you may build an image which is based on the ubuntu image, but
installs the Apache web server and your application, as well as the configuration
details needed to make your application run.
7.Containers
A container is a runnable instance of an image. You can create, start, stop, move,
or delete a container using the Docker API or CLI.
c Explain Elastic Resource in detail 4
1. Elastic resources in cloud computing refer to the ability to dynamically and Explain 4
automatically scale computing resources up or down based on the changing demands of ation
an application or workload. 4M
2. This elasticity allows cloud users to efficiently allocate and de-allocate resources as
needed, which can help optimize performance, cost, and resource utilization.
3. Elasticity is a fundamental feature of cloud computing and is particularly valuable in
scenarios where workloads are unpredictable or vary over time.
4. It enables organizations to optimize their infrastructure in a cost-effective and
responsive manner, supporting the efficient use of cloud resources.
B Attempt any ONE (C22594.3) 4 4
a Explain AWS SageMaker with example.(. 4
AWS Sagemaker:
Explain
ation
4M
1. Amazon SageMaker is a fully managed machine learning service. With SageMaker, data
scientists and developers can quickly and easily build and train machine learning models,
and then directly deploy them into a production-ready hosted environment.
2. It provides an integrated Jupyter authoring notebook instance for easy access to your
data sources for exploration and analysis, so you don't have to manage servers.
3. It also provides common machine learning algorithms that are optimized to run
efficiently against extremely large data in a distributed environment.
4. With native support for bring-your-own-algorithms and frameworks, SageMaker offers
flexible distributed training options that adjust to your specific workflows.
5.Deploy a model into a secure and scalable environment by launching it with a few clicks
from SageMaker Studio or the SageMaker console.
Features Of Sagemaker:
SageMaker geospatial capabilities
Build, train, and deploy ML models using geospatial data.
SageMaker Model Cards
Document information about your ML models in a single place for streamlined
governance and reporting throughout the ML lifecycle.
SageMaker Model Dashboard
A pre-built, visual overview of all the models in your account. Model Dashboard
integrates information from SageMaker Model Monitor, transform jobs,
endpoints, lineage tracking, and CloudWatch so you can access high-level model
information and track model performance in one unified view.
SageMaker Role Manager
Administrators can define least-privilege permissions for common ML activities
using custom and preconfigured persona-based IAM roles.
AutoML step
Create an AutoML job to automatically train a model in SageMaker Pipelines.
Collaboration with shared spaces
A shared space consists of a shared JupyterServer application and a shared
directory. All user profiles in a Domain have access to all shared spaces in the
Domain.
Data Wrangler data preparation widget
Interact with your data, get visualizations, explore actionable insights, and fix data
quality issues.
Inference shadow tests
Evaluate any changes to your model-serving infrastructure by comparing its
performance against the currently deployed infrastructure.
b State any Four ML systems available in Market . 4
ML systems available in Market For
each
Chatbots: Chatbots are available on an organization's website and serve as 1M
customer support. Intelligent bots are able to deduce customer queries by asking a
strategic set of questions. These are available as plug-and-play packages that fit
right into the web page via an endpoint.

Recommender Systems: Recommendation systems use clever statistics to profile

customers and provide them with relevant recommendations. These may take
some time to adjust to your customers' data but can be easily built.

Sentiment Analysis: Marketers use these systems to predict customer sentiments

regarding advertisements and products. These analyses can be used to improve
experience and customer retention.

Fraud detection: Another prominent use of machine learning in business is in

fraud detection, particularly in banking and financial services, where institutions
use it to alert customers of potentially fraudulent use of their credit and debit
cards.

Data Engineering Course Outline
No ratings yet
Data Engineering Course Outline
3 pages
Data Engineering For Machine Learning Pipelines From Python Libraries To ML P
100% (2)
Data Engineering For Machine Learning Pipelines From Python Libraries To ML P
582 pages
TOS-mechanical Drafting 7
100% (2)
TOS-mechanical Drafting 7
1 page
Docker in Action - Manning (2016)
100% (4)
Docker in Action - Manning (2016)
306 pages
Mastering Docker Enterprise
No ratings yet
Mastering Docker Enterprise
474 pages
NCBSSH Tdna PDF
96% (25)
NCBSSH Tdna PDF
42 pages
Dags: The Definitive Guide: Everything You Need To Know About Airflow Dags
100% (1)
Dags: The Definitive Guide: Everything You Need To Know About Airflow Dags
72 pages
Data Pipelines From Zero To Solid
No ratings yet
Data Pipelines From Zero To Solid
58 pages
Human Intimacy Marriage The Family and Its Meaning 11th Edition Cox Test Bank 1
100% (78)
Human Intimacy Marriage The Family and Its Meaning 11th Edition Cox Test Bank 1
24 pages
HCIP-Cloud Computing-Container V1.0 Training Material PDF
No ratings yet
HCIP-Cloud Computing-Container V1.0 Training Material PDF
642 pages
Kubernetes Practicals Ebook
75% (4)
Kubernetes Practicals Ebook
187 pages
Math 6 COT
No ratings yet
Math 6 COT
16 pages
Docker & Kubernetes - 2
100% (1)
Docker & Kubernetes - 2
33 pages
4-Data Processing Pipelines in Science and Business
100% (1)
4-Data Processing Pipelines in Science and Business
22 pages
ST Open Source Data Pipelines Oreilly f22568 202003 en PDF
No ratings yet
ST Open Source Data Pipelines Oreilly f22568 202003 en PDF
79 pages
FINAL DOCUMENT BARON and GROUP
No ratings yet
FINAL DOCUMENT BARON and GROUP
28 pages
Data Engineer Handbook
No ratings yet
Data Engineer Handbook
21 pages
Docker Kubernetes: Dongwon Kim, PHD Big Data Tech. Lab SK Telecom
No ratings yet
Docker Kubernetes: Dongwon Kim, PHD Big Data Tech. Lab SK Telecom
33 pages
Docker & Kubernetes: Dongwon Kim, PHD Big Data Tech. Lab SK Telecom
No ratings yet
Docker & Kubernetes: Dongwon Kim, PHD Big Data Tech. Lab SK Telecom
33 pages
Ch13 Chromosomes
100% (1)
Ch13 Chromosomes
61 pages
MTH 101 (Elementary Mathematics I) - 2223venn
No ratings yet
MTH 101 (Elementary Mathematics I) - 2223venn
19 pages
Approaches To Curriculum Design: Allahn Joyce Cariño Bensali Keesha Ann Baguio Meraflor Balobo
No ratings yet
Approaches To Curriculum Design: Allahn Joyce Cariño Bensali Keesha Ann Baguio Meraflor Balobo
26 pages
Dags Definitive Guide
No ratings yet
Dags Definitive Guide
89 pages
Positive Attitude
100% (1)
Positive Attitude
9 pages
Silk - Yogaacaara Bhik - Su 2007
No ratings yet
Silk - Yogaacaara Bhik - Su 2007
27 pages
What Kind of Citizen
No ratings yet
What Kind of Citizen
31 pages
Untitled
No ratings yet
Untitled
3 pages
Imp Answers CCD Ut
No ratings yet
Imp Answers CCD Ut
14 pages
CICD With Docker Kubernetes Semaphore
No ratings yet
CICD With Docker Kubernetes Semaphore
92 pages
Overview of Some Scholarships
No ratings yet
Overview of Some Scholarships
7 pages
Standards Nursing of Care
No ratings yet
Standards Nursing of Care
7 pages
Chapter 5 CCD
No ratings yet
Chapter 5 CCD
17 pages
Patan Academy of Health Sciences: Merit Order Identification (ID) Number Marks Obtained Remarks
No ratings yet
Patan Academy of Health Sciences: Merit Order Identification (ID) Number Marks Obtained Remarks
19 pages
LKPD Bahasa Inggris Kelas IX - Congratulation
100% (19)
LKPD Bahasa Inggris Kelas IX - Congratulation
2 pages
Internal Auditor Interview Questions
100% (1)
Internal Auditor Interview Questions
2 pages
NCM 113
No ratings yet
NCM 113
4 pages
UNIT 1 To 5
No ratings yet
UNIT 1 To 5
37 pages
Difficulties Following Spoken Directions
No ratings yet
Difficulties Following Spoken Directions
1 page
Data Engineering Nanodegree Program Syllabus PDF
No ratings yet
Data Engineering Nanodegree Program Syllabus PDF
5 pages
How To Build Data Pipelines For Machine Learning - by Shaw Talebi - Towards Data Science
No ratings yet
How To Build Data Pipelines For Machine Learning - by Shaw Talebi - Towards Data Science
21 pages
Leadership: The TQM Framework
No ratings yet
Leadership: The TQM Framework
9 pages
OD M2 Building A Data Lake
No ratings yet
OD M2 Building A Data Lake
59 pages
Microservice 2nd MST Parchi
No ratings yet
Microservice 2nd MST Parchi
1 page
Data Engineering Nanodegree Program Syllabus
No ratings yet
Data Engineering Nanodegree Program Syllabus
16 pages
M&E Exam Notes
No ratings yet
M&E Exam Notes
4 pages
Es Unit 5
No ratings yet
Es Unit 5
14 pages
2021.02 Bachelor Thesis Logbook
No ratings yet
2021.02 Bachelor Thesis Logbook
12 pages
Infrastructure Design For Student Collaboration Projects Using Kubernetes
No ratings yet
Infrastructure Design For Student Collaboration Projects Using Kubernetes
52 pages
Cohesive Energies 1
No ratings yet
Cohesive Energies 1
8 pages
New Product Development Worksheet
No ratings yet
New Product Development Worksheet
3 pages
Syllabus
No ratings yet
Syllabus
2 pages
43 Dhruuv AdvDevOPs
No ratings yet
43 Dhruuv AdvDevOPs
75 pages
Dags Definitive Guide
No ratings yet
Dags Definitive Guide
82 pages
Pls Gca Pca Student Slides 4
No ratings yet
Pls Gca Pca Student Slides 4
173 pages
Docker and Kubernetes
No ratings yet
Docker and Kubernetes
23 pages
Practical Guide To Learn Kubernetes - Kubernetes Essentials
No ratings yet
Practical Guide To Learn Kubernetes - Kubernetes Essentials
165 pages
KPLABS Course - CKA D1 Core Concepts
No ratings yet
KPLABS Course - CKA D1 Core Concepts
22 pages
Accomplishment-Report Parenting and Pfa
No ratings yet
Accomplishment-Report Parenting and Pfa
19 pages
DocScanner 20 Oct 2024 2-19 PM
No ratings yet
DocScanner 20 Oct 2024 2-19 PM
16 pages
Data Engineering by AWS
100% (1)
Data Engineering by AWS
11 pages
Growth and Developement
No ratings yet
Growth and Developement
140 pages
Tarush Exp7
No ratings yet
Tarush Exp7
6 pages
CCD Prelims
No ratings yet
CCD Prelims
11 pages
Unit No 5-1
No ratings yet
Unit No 5-1
16 pages
Docker Exp 07
No ratings yet
Docker Exp 07
5 pages
Data Engineering - Session 03
No ratings yet
Data Engineering - Session 03
26 pages
CCD Unit 5
No ratings yet
CCD Unit 5
6 pages
Kubernetes Basics
No ratings yet
Kubernetes Basics
161 pages
D Report
No ratings yet
D Report
19 pages
CCD Unit 4
No ratings yet
CCD Unit 4
5 pages
1.6 Edp Ex 6 Marktin Steps
No ratings yet
1.6 Edp Ex 6 Marktin Steps
3 pages
Crisis Intervention - Chapter Review - Crossword Labs
No ratings yet
Crisis Intervention - Chapter Review - Crossword Labs
2 pages
Assign 2
No ratings yet
Assign 2
12 pages
Est ch5 Environmental Pollution Mcqs
No ratings yet
Est ch5 Environmental Pollution Mcqs
18 pages
PISA 2025 Science Framework
No ratings yet
PISA 2025 Science Framework
93 pages
UNIT-I EST MCQs by VJTECH
No ratings yet
UNIT-I EST MCQs by VJTECH
8 pages
UNIT-II EST MCQs by VJTECH
No ratings yet
UNIT-II EST MCQs by VJTECH
18 pages
UNIT-III EST MCQs by VJTECH
No ratings yet
UNIT-III EST MCQs by VJTECH
7 pages
Self Assessment
No ratings yet
Self Assessment
14 pages
UNIT-IV EST MCQs by VJTECH
No ratings yet
UNIT-IV EST MCQs by VJTECH
13 pages
22CS911-DEC Unit 5
No ratings yet
22CS911-DEC Unit 5
68 pages
Week 6. Airflow Overview
No ratings yet
Week 6. Airflow Overview
71 pages
1.3 Edp Pract 3 - Biss Idea For Enter &8 Bank Visit
No ratings yet
1.3 Edp Pract 3 - Biss Idea For Enter &8 Bank Visit
4 pages
B2B Markets Week 2 Lecture
No ratings yet
B2B Markets Week 2 Lecture
15 pages
12 A Barem Locala 2025
No ratings yet
12 A Barem Locala 2025
3 pages
Use Case
No ratings yet
Use Case
3 pages
Ak MAD C Practical No.22
No ratings yet
Ak MAD C Practical No.22
2 pages
000000-Dockers & Kubernetes
No ratings yet
000000-Dockers & Kubernetes
60 pages
Airflow Techtonic Template
No ratings yet
Airflow Techtonic Template
18 pages
Assignment Gcc-1 Sol
No ratings yet
Assignment Gcc-1 Sol
8 pages
Data Engineering and Data Engineer - Students
No ratings yet
Data Engineering and Data Engineer - Students
56 pages
Revised Cloud PT2 QB 2023
No ratings yet
Revised Cloud PT2 QB 2023
18 pages
Main First Chapter
No ratings yet
Main First Chapter
91 pages
Pant
No ratings yet
Pant
8 pages
Devops Module 3 Part B
No ratings yet
Devops Module 3 Part B
11 pages
Setting Up Airflow With Docker From Installation To Data Processing
No ratings yet
Setting Up Airflow With Docker From Installation To Data Processing
10 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CCD CT 2 Model Answer 2022-23

Uploaded by

CCD CT 2 Model Answer 2022-23

Uploaded by

ALL INDIA SHRI SHIVAJI MEMORIAL SOCIETY’S POLYTECHNIC, PUNE – 01

Program Code :- AN5I Semester :- V

Course Outcomes:- C22594.4, C22594.5, C22594.6

d Enlist the process Of Designing pipeline. 02 02

Process Of Designing the pipeline in Cloud: Proper

2 A Attempt any TWO (C22319.4) 2 8

Docker architecture:- Diagra

1.The Docker daemon

Recommender Systems: Recommendation systems use clever statistics to profile

Sentiment Analysis: Marketers use these systems to predict customer sentiments

Fraud detection: Another prominent use of machine learning in business is in

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.