CCD CT 2 Model Answer 2022-23
CCD CT 2 Model Answer 2022-23
MODEL ANSWER
Class Test- I (2023-24)
Sub.
Que. Total
Que Stepwise Solution Marks
No. Marks
.
1 Attempt any FOUR (C22594.1) 2 8
a Define Data Pipeline 02 02
Define: Define
1. A data pipeline is a method in which raw data is ingested from various data sources and 2M
then ported to data store, like a data lake or data warehouse, for analysis.
2. A data pipeline is a series of processes that migrate data from a source to a destination
database. An example of a technical dependency may be that after assimilating data from
sources, the data is held in a central queue before subjecting it to further validations and
then finally dumping into a destination.
b State the use of Modern data Pipeline. 02 02
1.Data Pipeline is a web service that helps you reliably process and move data between
different AWS compute and storage services, as well as on-premises data sources, at For
specified intervals. each
2.data pipelines provide the foundation for a range of data projects; this can include use – 1
exploratory data analyses, data visualizations, and machine learning tasks. Mark
3.automation of data pipelines allows organizations to extract data at its source,
transform it, integrate it with other sources and fuel business applications and data
analytics
c Describe Any Four Characteristics of Data Pipeline . 02 02
Characteristics OF Data pipeline: For
1.Continuous, extensible data processing each
2.Cloud-enabled elasticity and agility Charact
3.Independent, isolated data processing resources eristics
4.Widespread data access and the ability to self-serve – 1/2
5.High availability and disaster recovery Mark