0% found this document useful (0 votes)

7 views

UNIT-1 Distributed System

The document provides an overview of distributed systems, defining them as a collection of independent computers that function as a single system. It discusses various architectures, advantages, disadvantages, and use cases of distributed systems, as well as the differences between distributed systems and microservices. Additionally, it outlines distributed computing system models, including physical, architectural, and fundamental models, while addressing key concepts such as resource sharing, fault tolerance, and security.

Uploaded by

daniellawrence4150

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

UNIT-1 Distributed System

Uploaded by

daniellawrence4150

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Introduction to distributed systems – system models – architecture and fundamental

models – types of networks – network principles – internet protocols – the API for internet
protocols – external data representation and marshalling – client-server communication
– group communication

Introduction to distributed systems

What is a Distributed System?

A distributed system is a collection of independent computers that appear to the users of the
system as a single coherent system. These computers or nodes work together, communicate
over a network, and coordinate their activities to achieve a common goal by sharing resources,
data, and tasks.

Difference between centralized system and distributed system

All data and computational resources are kept and controlled in a single central place, such as
a server, in a centralized system. Applications and users connect to this hub in order to access
and handle data. Although this configuration is easy to maintain and secure, if too many users
access it simultaneously or if the central server malfunctions, it could become a bottleneck.

A distributed system, on the other hand, disperses data and resources over several servers or
locations, frequently across various physical places. Better scalability and reliability are made
possible by this configuration since the system can function even in the event of a component
failure. However, because of their numerous points of interaction, distributed systems can be
more difficult to secure and administer.

Architectures of Distributed systems

Below are some of the common distributed system architectures:

• Client-Server Architecture:
o In this setup, servers provide resources or services, and clients request them.
Clients and servers communicate over a network.
o Examples: Web applications, where browsers (clients) request pages from web
servers.
• Peer-to-Peer (P2P) Architecture:

o Each node, or “peer,” in the network acts as both a client and a server, sharing
resources directly with each other.

o Examples: File-sharing networks like BitTorrent, where files are shared

between users without a central server.

• Three-Tier Architecture:

o This model has three layers: presentation (user interface), application (business
logic), and data (database). Each layer is separated to allow easier scaling and
maintenance.

o Examples: Many web applications use this to separate user interfaces, logic
processing, and data storage.

• Microservices Architecture:

o The application is split into small, independent services, each handling specific
functions. These services communicate over a network, often using REST APIs
or messaging.

o Examples: Modern web applications like Netflix or Amazon, where different

services handle user accounts, orders, and recommendations independently.

• Service-Oriented Architecture (SOA):

o Similar to microservices, SOA organizes functions as services. However, SOA

typically uses an enterprise service bus (ESB) to manage communication
between services.
o Examples: Large enterprise applications in finance or government, where
different services handle various aspects of business processes.

• Event-Driven Architecture:
o Components interact by sending and responding to events rather than direct
requests. An event triggers specific actions or processes in various parts of the
system.

o Examples: Real-time applications like IoT systems, where sensors trigger

actions based on detected events.

The most common forms of distributed systems today operate over the internet, handing off
workloads to dozens of cloud-based virtual server instances that are created as needed, and
then terminated when the task is complete.
Example of a Distributed System

Any Social Media can have its Centralized Computer Network as its Headquarters and
computer systems that can be accessed by any user and using their services will be the
Autonomous Systems in the Distributed System Architecture.

• Distributed System Software: This Software enables computers to coordinate their

activities and to share the resources such as Hardware, Software, Data, etc.

• Database: It is used to store the processed data that are processed by each Node/System
of the Distributed systems that are connected to the Centralized network.

• As we can see that each Autonomous System has a common Application that can have
its own data that is shared by the Centralized Database System.

• To Transfer the Data to Autonomous Systems, Centralized System should be having a

Middleware Service and should be connected to a Network.

• Middleware Services enable some services which are not present in the local systems
or centralized system default by acting as an interface between the Centralized System
and the local systems. By using components of Middleware Services systems
communicate and manage data.

• The Data which is been transferred through the database will be divided into segments
or modules and shared with Autonomous systems for processing.

• The Data will be processed and then will be transferred to the Centralized system
through the network and will be stored in the database.
Characteristics of Distributed System

• Resource Sharing: It is the ability to use any Hardware, Software, or Data anywhere
in the System.

• Openness: It is concerned with Extensions and improvements in the system (i.e., How
openly the software is developed and shared with others)

• Concurrency: It is naturally present in Distributed Systems, that deal with the same
activity or functionality that can be performed by separate users who are in remote
locations. Every local system has its independent Operating Systems and Resources.

• Scalability: It increases the scale of the system as a number of processors communicate

with more users by accommodating to improve the responsiveness of the system.

• Fault tolerance: It cares about the reliability of the system if there is a failure in
Hardware or Software, the system continues to operate properly without degrading the
performance the system.

• Transparency: It hides the complexity of the Distributed Systems to the Users and
Application programs as there should be privacy in every system.

Advantages of Distributed System

Below are some of the advantages of Distributed System:

• Scalability: Distributed systems can easily grow by adding more computers (nodes),
allowing them to handle increased demand without significant reconfiguration.

• Reliability and Fault Tolerance: If one part of the system fails, others can take over,
making distributed systems more resilient and ensuring services remain available.
• Performance: Workloads can be split across multiple nodes, allowing tasks to be
completed faster and improving overall system performance.

• Resource Sharing: Distributed systems allow resources like data, storage, and
computing power to be shared across nodes, increasing efficiency and reducing costs.

• Geographical Distribution: Since nodes can be in different locations, distributed

systems can serve users globally, providing faster access to resources based on location.

Disadvantages of Distributed System

Below are some of the disadvantages of Distributed System:

• Relevant Software for Distributed systems does not exist currently.

• Security possess a problem due to easy access to data as the resources are shared to
multiple systems.

• Networking Saturation may cause a hurdle in data transfer i.e., if there is a lag in the
network then the user will face a problem accessing data.

• In comparison to a single user system, the database associated with distributed systems
is much more complex and challenging to manage.

• If every node in a distributed system tries to send data at once, the network may become
overloaded.

Use cases of Distributed System

• Finance and Commerce: Amazon, eBay, Online Banking, E-Commerce websites.

• Information Society: Search Engines, Wikipedia, Social Networking, Cloud

Computing.

• Cloud Technologies: AWS, Salesforce, Microsoft Azure, SAP.

• Entertainment: Online Gaming, Music, youtube.

• Healthcare: Online patient records, Health Informatics.

• Transport and logistics: GPS, Google Maps.

Are Distributed Systems and Microservices the Same?

Distributed systems and microservices are related concepts but not the same. Let’s break down
the differences:

1. Distributed Systems:

• A distributed system is a collection of independent computers that appear to its

users as a single coherent system.
• In a distributed system, components located on networked computers
communicate and coordinate their actions by passing messages.

• Distributed systems can encompass various architectures, including client-

server, peer-to-peer, and more.

2. Microservices:

• Microservices is an architectural style that structures an application as a

collection of small, autonomous services, modeled around a business domain.

• Each microservice is a self-contained unit that can be developed, deployed, and

scaled independently.

• Microservices communicate with each other over a network, typically using

lightweight protocols like HTTP or messaging queues.

While microservices can be implemented in a distributed system, they are not same.
Microservices focus on architectural design principles, emphasizing modularity, scalability,
and flexibility, whereas distributed systems encompass a broader range of concepts, including
communication protocols, fault tolerance, and concurrency control, among others.

System models – architecture and fundamental models

Distributed Computing System Models

Distributed computing is a system where processing and data storage is distributed

across multiple devices or systems, rather than handled by a single central device. In this article,
we will see Distributed Computing System Models.

Types of Distributed Computing System Models

I. Physical Model

A physical model represents the underlying hardware elements of a distributed system. It

encompasses the hardware composition of a distributed system in terms of computers and other
devices and their interconnections. It is primarily used to design, manage, implement, and
determine the performance of a distributed system.
A physical model majorly consists of the following components:

1. Nodes

Nodes are the end devices that can process data, execute tasks, and communicate with the other
nodes. These end devices are generally the computers at the user end or can be servers,
workstations, etc.

• Nodes provision the distributed system with an interface in the presentation layer that
enables the user to interact with other back-end devices, or nodes, that can be used for
storage and database services, processing, web browsing, etc.

• Each node has an Operating System, execution environment, and different middleware
requirements that facilitate communication and other vital tasks.,

2. Links

Links are the communication channels between different nodes and intermediate devices.
These may be wired or wireless. Wired links or physical media are implemented using copper
wires, fiber optic cables, etc. The choice of the medium depends on the environmental
conditions and the requirements. Generally, physical links are required for high-performance
and real-time computing. Different connection types that can be implemented are as follows:

• Point-to-point links: Establish a connection and allow data transfer between only two
nodes.

• Broadcast links: It enables a single node to transmit data to multiple nodes

simultaneously.

• Multi-Access links: Multiple nodes share the same communication channel to transfer
data. Requires protocols to avoid interference while transmission.
3. Middleware

These are the softwares installed and executed on the nodes. By running middleware on each
node, the distributed computing system achieves a decentralised control and decision-making.
It handles various tasks like communication with other nodes, resource management, fault
tolerance, synchronisation of different nodes and security to prevent malicious and
unauthorised access.

4. Network Topology
This defines the arrangement of nodes and links in the distributed computing system. The most
common network topologies that are implemented are bus, star, mesh, ring or hybrid. Choice
of topology is done by determining the exact use cases and the requirements.

5. Communication Protocols

Communication protocols are the set rules and procedures for transmitting data from in the
links. Examples of these protocols include TCP, UDP, HTTPS, MQTT etc. These allow the
nodes to communicate and interpret the data.
II. Architectural Model
Architectural model in distributed computing system is the overall design and structure of the
system, and how its different components are organised to interact with each other and provide
the desired functionalities. It is an overview of the system, on how will the development,
deployment and operations take place. Construction of a good architectural model is required
for efficient cost usage, and highly improved scalability of the applications.

The key aspects of architectural model are:

1. Client-Server model

It is a centralised approach in which the clients initiate requests for services and severs respond
by providing those services. It mainly works on the request-response model where the client
sends a request to the server and the server processes it, and responds to the client accordingly.

• It can be achieved by using TCP/IP, HTTP protocols on the transport layer.

• This is mainly used in web services, cloud computing, database management systems
etc.
2. Peer-to-peer model

It is a decentralised approach in which all the distributed computing nodes, known as peers,
are all the same in terms of computing capabilities and can both request as well as provide
services to other peers. It is a highly scalable model because the peers can join and leave the
system dynamically, which makes it an ad-hoc form of network.

• The resources are distributed and the peers need to look out for the required resources
as and when required.

• The communication is directly done amongst the peers without any intermediaries
according to some set rules and procedures defined in the P2P networks.

• The best example of this type of computing is BitTorrent.

3. Layered model

It involves organising the system into multiple layers, where each layer will provision a specific
service. Each layer communicated with the adjacent layers using certain well-defined protocols
without affecting the integrity of the system. A hierarchical structure is obtained where each
layer abstracts the underlying complexity of lower layers.

4. Micro-services model

In this system, a complex application or task, is decomposed into multiple independent tasks
and these services running on different servers. Each service performs only a single function
and is focussed on a specific business-capability. This makes the overall system more
maintainable, scalable and easier to understand. Services can be independently developed,
deployed and scaled without affecting the ongoing services.
III. Fundamental Model

The fundamental model in a distributed computing system is a broad conceptual framework

that helps in understanding the key aspects of the distributed systems. These are concerned
with more formal description of properties that are generally common in all architectural
models. It represents the essential components that are required to understand a distributed
system’s behaviour. Three fundamental models are as follows:

1. Interaction Model
Distributed computing systems are full of many processes interacting with each other in highly
complex ways. Interaction model provides a framework to understand the mechanisms and
patterns that are used for communication and coordination among various processes. Different
components that are important in this model are –

• Message Passing – It deals with passing messages that may contain, data, instructions,
a service request, or process synchronisation between different computing nodes. It may
be synchronous or asynchronous depending on the types of tasks and processes.
• Publish/Subscribe Systems – Also known as pub/sub system. In this the publishing
process can publish a message over a topic and the processes that are subscribed to that
topic can take it up and execute the process for themselves. It is more important in an
event-driven architecture.
2. Remote Procedure Call (RPC)

It is a communication paradigm that has an ability to invoke a new process or a method on a

remote process as if it were a local procedure call. The client process makes a procedure call
using RPC and then the message is passed to the required server process using communication
protocols. These message passing protocols are abstracted and the result once obtained from
the server process, is sent back to the client process to continue execution.

1. Failure Model
This model addresses the faults and failures that occur in the distributed computing system. It
provides a framework to identify and rectify the faults that occur or may occur in the system.
Fault tolerance mechanisms are implemented so as to handle failures by replication and error
detection and recovery methods. Different failures that may occur are:

• Crash failures – A process or node unexpectedly stops functioning.

• Omission failures – It involves a loss of message, resulting in absence of required

communication.

• Timing failures – The process deviates from its expected time quantum and may lead
to delays or unsynchronised response times.

• Byzantine failures – The process may send malicious or unexpected messages that
conflict with the set protocols.

2. Security Model

Distributed computing systems may suffer malicious attacks, unauthorised access and data
breaches. Security model provides a framework for understanding the security requirements,
threats, vulnerabilities, and mechanisms to safeguard the system and its resources. Various
aspects that are vital in the security model are:
• Authentication: It verifies the identity of the users accessing the system. It ensures that
only the authorised and trusted entities get access. It involves –

o Password-based authentication: Users provide a unique password to prove

their identity.

o Public-key cryptography: Entities possess a private key and a corresponding

public key, allowing verification of their authenticity.

o Multi-factor authentication: Multiple factors, such as passwords, biometrics,

or security tokens, are used to validate identity.

• Encryption:

o It is the process of transforming data into a format that is unreadable without a

decryption key. It protects sensitive information from unauthorized access or
disclosure.
• Data Integrity:
o Data integrity mechanisms protect against unauthorised modifications or
tampering of data. They ensure that data remains unchanged during storage,
transmission, or processing. Data integrity mechanisms include:
o Hash functions – Generating a hash value or checksum from data to
verify its integrity.

o Digital signatures – Using cryptographic techniques to sign data and

verify its authenticity and integrity.
Network principles

Distributed System Principles

Distributed systems are networks of interconnected computers that work together to solve
complex problems or perform tasks, using resources and communication protocols to achieve
efficiency, scalability, and fault tolerance. From understanding the fundamentals of distributed
computing to navigating the challenges of scalability, fault tolerance, and consistency, this
article provides a concise overview of key principles essential for building resilient and
efficient distributed systems.
Important Topics for Distributed System Principles

• Design Principles for Distributed Systems

• What is Distributed Coordination?

• Fault Tolerance in Distributed Systems

• Distributed Data Management

• Distributed Systems Security

• Examples of Distributed Systems

Design Principles for Distributed Systems

To make good distributed systems, you need to follow some important rules:

1. Decentralization

Decentralization in distributed systems means spreading out control and decision-making

across many nodes instead of having one main authority. This helps make the system more
reliable and resistant to problems because if one part fails, the whole system does not crash.

• Each node in a decentralized system works on its own but also works together with
others to get things done. So, if one node stops working, it does not affect the whole
system much because the others can still work independently.

• Decentralization is often done by using methods like peer-to-peer networking, where

nodes talk directly to each other without needing a central server, and distributed
consensus algorithms, which help nodes agree on things without needing a central boss.

2. Scalability

Scalability means how well a distributed system can handle more work and needs for resources.
If more people start using a service or if there's more data to process, a scalable system can
handle it without slowing down much.
• There are two types: horizontal and vertical. Horizontal scalability means adding more
computers to the system, while vertical scalability means making each computer more
powerful.

• Techniques like spreading the work evenly, dividing it into parts, and sharing the load
help make sure the system runs smoothly even as it gets bigger.

3. Fault Tolerance

Fault tolerance is about how well a distributed system can handle things going wrong. It means
the system can find out when something's not working right, fix it, and keep running smoothly.

• Since problems are bound to happen in complex systems, fault tolerance is crucial for
making sure the system stays reliable and available.

• Techniques like copying data or tasks onto different computers, keeping extra resources
just in case, and having plans to detect and recover from errors help reduce the impact
of failures.

• Also, there are strategies for automatically switching to backups when needed and for
making sure the system can still work even if it's not at full capacity.

4. Consistency
Consistency means making sure all parts of a distributed system have the same information
and act the same way, even if lots of things are happening at once. If things are not consistent,
it can mess up the data, break rules, and cause mistakes.

• Distributed systems keep things consistent by using methods like doing multiple tasks
together so they all finish or using locks to stop different parts from changing shared
things at the same time.
• There are different levels of consistency, like strong consistency where everything is
always the same, eventual consistency where it might take time but will get there, and
causal consistency which is somewhere in between. These levels depend on how
important it is for the system to work fast, be available, and handle problems.
5. Performance Optimization

Performance optimization means making a distributed system work faster and better by
improving how data is stored, how computers talk to each other, and how tasks are done.
• For example, using smart ways to store data across many computers and quickly find
what's needed.
• Also, using efficient ways for computers to communicate, like sending messages in a
smart order to reduce delays. And, using clever ways to split up tasks between
computers and work on them at the same time, which speeds things up.
What is Distributed Coordination?

Distributed coordination is important for making sure all the parts of a distributed system work
together smoothly to achieve same goals. In a distributed setup, lots of independent computers
are working, coordination is crucial for making sure everyone is on the same page, managing
resources fairly, and keeping everything running smoothly. Let's break down the main parts of
distributed coordination:

1. Distributed Consensus Algorithms

These are like rulebooks that help all the computers in a system agree on important things, even
if some of them fail or get disconnected. Two common algorithms are Paxos and Raft.
• Paxos: It's a way for computers to agree on stuff even if some of them stop working
properly. It has a leader who guides the process.

• Raft: This algorithm makes it simpler for computers to agree by breaking it down into
smaller steps.

2. Distributed Locking Mechanisms

These are used to make sure different computers don't mess with the same thing at the same
time, which could cause problems like data errors or confusion.
• Mutex Locks: These ensure that only one computer can use something at a time.

• Semaphore Locks: They let a few computers use something together but not too many.

3. Message Passing Protocols

These help computers talk to each other so they can share information and coordinate what
they're doing. They make sure messages get where they need to go and that everything keeps
working even if there are problems.
• MQTT: It's good for sending messages in situations where there might be slow or weak
connections, like in Internet of Things devices.
• AMQP: This protocol is strong and reliable, perfect for big business systems where
messages need to get through no matter what.
Fault Tolerance in Distributed Systems

Fault tolerance is super important in designing distributed systems because it helps keep the
system running even when things go wrong, like if a computer breaks or the network has
problems. Here are some main ways to handle faults in distributed systems:

• Replication: Making copies of data or tasks on different computers so if one fails,

there's still a backup. This can be done with data, processing, or services.

• Redundancy: Keeping extra copies of important stuff like hardware, software, or data
so if something breaks, there's a backup ready to take over. This helps avoid downtime
and keeps the system running smoothly.
• Error Detection and Recovery: Having tools in place to spot when something goes
wrong and fix it before it causes big problems. This might involve checking if
everything's okay, diagnosing issues, and taking steps to get things back on track.

• Automatic Failover: Setting up the system to automatically switch to backup resources

or computers if something breaks. This happens without needing someone to step in,
keeping the system going without interruptions.

• Graceful Degradation: If something goes wrong, instead of crashing completely, the

system can reduce its workload or quality to keep running at least partially. This helps
avoid big meltdowns and keeps things going as smoothly as possible.

Distributed Data Management

Managing data in distributed systems is very important. It means handling data across many
computers while making sure it's consistent, reliable, and can handle a lot of work. In these
systems, data is spread across different computers to make things faster, safer, and able to
handle more work. Now, let's look at the main ways we do this and the technologies we use.
• Sharding: Splitting a big dataset into smaller parts and spreading them across different
computers. Each computer handles its own part, which helps speed things up and avoids
overloading any single computer.

• Replication: Making copies of data and storing them on different computers. This
ensures that even if one computer fails, there are backups available. It also helps data
get to where it's needed faster.

• Consistency Models: These are rules that decide how data changes are seen across
different computers.

• Distributed Databases: These are databases spread across many computers. They use
techniques like sharding and replication to make sure data is available, consistent, and
safe. Examples: Cassandra, MongoDB.

• Distributed File Systems: These are like big digital storage spaces spread across many
computers. They break data into chunks and spread them out for faster access and
backup. Examples: HDFS, Amazon S3.

Distributed Systems Security

Security is important in distributed systems because they are complicated and spread out across
many computers. We need to keep sensitive data safe, make sure our messages are not tampered
with, and protect against hackers. Here are the main ways we do this:

• Encryption: This means making data unreadable to anyone who shouldn't see it. We
do this when data is moving between computers or when it's stored somewhere. It keeps
sensitive information safe even if someone tries to snoop.
• Authentication: This is about making sure that the people, devices, or services trying
to access the system are who they say they are. We use things like passwords, fingerprint
scans, or special codes to check their identity.

• Access Control: This is like having locked doors that only certain people can open. We
decide who can see or change things in the system and make sure nobody else can get
in where they shouldn't.

• Audit Logging: This means keeping a record of everything that happens in the system
so we can check if something bad has happened or if someone tried to break in. It's like
having security cameras everywhere.

• DDoS Mitigation: Sometimes bad actors try to overwhelm the system with too much
traffic to shut it down. We use special tools to filter out this bad traffic and keep the
system running smoothly.

Examples of Distributed Systems

1. Google's Infrastructure

Google's setup is a big example of how distributed systems can work on a large scale. They use
stuff like Google File System (GFS), Bigtable, and MapReduce to manage huge amounts of
data. This helps them offer services like search, cloud computing, and real-time analytics
without any hiccups.

• Google File System (GFS):

o GFS is a special way of organizing and handling big amounts of data across
many computers. It's made to work even if some of those computers stop
working.

o GFS copies the data in different places to keep it safe, and it makes sure we can
still get to the data even if something goes wrong with one of the computers.
• Bigtable:

o Bigtable is a special kind of storage system that can hold huge amounts of
organized data across many computers. It's great for storing lots of information
and quickly finding what you need.

o Bigtable is used in things like Google Search, Gmail, and Google Maps because
it's so good at handling massive amounts of data efficiently.

• MapReduce:
o MapReduce is a way of programming and handling big amounts of data spread
across many computers. It's like having lots of people working on different parts
of a big project at the same time.
o This helps to get things done faster and handle really huge amounts of data. It's
great for jobs like analyzing data or doing tasks in big batches.
2. Twitter

Twitter uses a bunch of fancy computer systems to handle all the people who use it and the
messages they send in real-time. They use things like Apache Mesos and Apache Aurora to
make sure everything works smoothly even when there are millions of tweets happening every
day. It's like having a really strong foundation to support a huge building - it keeps everything
running smoothly and reliably.

• Microservices Architecture:
o Twitter's setup is a puzzle where each piece does its own job. They've divided
their system into smaller parts, called microservices, and each one takes care of
a different thing, like sending tweets or handling notifications.

o By doing this, Twitter can adjust things easily when lots of people are using it,
making sure it runs smoothly no matter what.

• Apache Mesos:

o Boss for a bunch of computers, helping them share and use their power better.
It handles things like how much memory or space each computer has and makes
sure everything runs smoothly.

o For Twitter, Mesos is super helpful because it helps them run lots of little
programs more efficiently, saving time and making things easier to manage.

• Apache Aurora:

o Smart manager for computer systems. It helps organize and run different tasks
and services on a bunch of machines.

o It's designed to make sure everything runs smoothly, even if something goes
wrong with one of the machines.

o With Aurora, Twitter can easily set up and manage its services, making sure
they're always available and working well.

Internet protocols – The API for internet protocols

Application Program Interface (API) for internet protocols

API is a programming interface between application programs and communication subsystems

based on open network protocols. The API lets any application program operating in its own
MVS address space to access and use communication services provided by an MVS subsystem
that implements this interface. TCP access, which provides communication services using
TCP/IP protocols, is an example of such a subsystem.

This programmer's reference describes an interface to the transport layer of the Basic Reference
Model of Open Systems Interconnection (OSI). Although the API is capable of interfacing to
proprietary protocols, the Internet open network protocols are the intended providers of the
transport service. This document uses the term "open" to emphasize that any system
conforming to one of these standards can communicate with any other system conforming to
the same standard, regardless of vendor. These protocols are contrasted with proprietary
protocols that generally support a closed community of systems supplied by a single vendor
External Data Representation and Marshalling.

External data representation and marshalling

Data Representation & Marshalling

The information stored in running programs is represented as data structures – for example, by
sets of interconnected objects – whereas the information in messages consists of sequences of
bytes. Irrespective of the form of communication used, the data structures must be flattened
(converted to a sequence of bytes) before transmission and rebuilt on arrival.

The individual primitive data items transmitted in messages can be data values of many
different types, and not all computers store primitive values such as integers in the same order.
The representation of floating-point numbers also differs between architectures. To
support any data type that can be passed as an argument or returned as a result must be able
to be flattened and the individual primitive data values represented in an agreed format.

External data representation– an agreed standard for the representation of data structures and
primitive values

Marshalling– the process of taking a collection of data items and assembling them into a form
suitable for transmission in a message

Unmarshalling– is the process of disassembling them on arrival into an equivalent

representation at the destination The marshalling and unmarshalling are intended to be carried
out by the middleware layer.
Client-server communication – Group communication

Group Communication

What is a group?

• A number of processes which cooperate to provide a service.

• An abstract identity to name a collection of processes.

Group Communication: For coordination among processes of a group.

Who Needs Group Communication?

• Highly available servers (client-server)

• Database Replication
• Multimedia Conferencing
• Online Games
• Cluster management
Client Server Communication

Client and server communication take place when both are connected to each other via a
network. Client and the server are two individual computing systems having their own
operating system, applications and functions. When connected via a network they are able to
share their applications with each other.

It is not necessary that client and server use a same platform as operating system, many varied
operating systems can be connected with each other for advanced communication using
communication protocol. The responsibility of implementing the communication protocol lies
with an application known as communication software.

Using the features of a communication software client and server can exchange files and data
for effective communication. The process of communication between client and server can be
explained as follows:

• Data resides in the server.

• Client system sends a query to the server
• Server searches the data for the information to be exchanged
• Server sends the requested data in the form of final result

YuMiLib Reference Handbook - v1.00.0200.49
No ratings yet
YuMiLib Reference Handbook - v1.00.0200.49
107 pages
DBA Commands
No ratings yet
DBA Commands
414 pages
Distributed System Notes
No ratings yet
Distributed System Notes
17 pages
Distributed Systems - 1
No ratings yet
Distributed Systems - 1
8 pages
Distributed system
No ratings yet
Distributed system
28 pages
UNIT-1 by Satish
No ratings yet
UNIT-1 by Satish
37 pages
Unit 1distributed
No ratings yet
Unit 1distributed
18 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
12 pages
Unit 1 CC
No ratings yet
Unit 1 CC
25 pages
Week 1
No ratings yet
Week 1
15 pages
Distributed System Notes
No ratings yet
Distributed System Notes
27 pages
Class Notes
No ratings yet
Class Notes
36 pages
Cloud Aasigment1&2
No ratings yet
Cloud Aasigment1&2
30 pages
DISTRIBUTED SYSTEMS_dis unit 1-5
No ratings yet
DISTRIBUTED SYSTEMS_dis unit 1-5
29 pages
Chapter (1) Introduction To Distributed Systems
No ratings yet
Chapter (1) Introduction To Distributed Systems
15 pages
What Is A Distributed System - GeeksforGeeks
No ratings yet
What Is A Distributed System - GeeksforGeeks
8 pages
DC Unit 1
No ratings yet
DC Unit 1
48 pages
LU1-Introduction To Distributed System
No ratings yet
LU1-Introduction To Distributed System
42 pages
Lecture 1 Introduction To Distributed Systems - 034922
No ratings yet
Lecture 1 Introduction To Distributed Systems - 034922
6 pages
Distributed Systems: Dr.P.Amudha Associate Professor
100% (4)
Distributed Systems: Dr.P.Amudha Associate Professor
38 pages
DS 1
No ratings yet
DS 1
12 pages
Distributed system
No ratings yet
Distributed system
48 pages
Lecture 1 - Distributed Systems Manju Rana
No ratings yet
Lecture 1 - Distributed Systems Manju Rana
30 pages
DS
No ratings yet
DS
55 pages
Distributed Systems
No ratings yet
Distributed Systems
10 pages
Distributed System
No ratings yet
Distributed System
76 pages
Distributed sys-WPS Office
No ratings yet
Distributed sys-WPS Office
9 pages
DC Notes for Students
No ratings yet
DC Notes for Students
93 pages
Unit 1
No ratings yet
Unit 1
55 pages
InfoNet Article06
No ratings yet
InfoNet Article06
5 pages
DS
No ratings yet
DS
9 pages
MC4203 - Cloud Computing Technologies
No ratings yet
MC4203 - Cloud Computing Technologies
98 pages
Week 1
No ratings yet
Week 1
22 pages
Distributed System Unit No 1
No ratings yet
Distributed System Unit No 1
11 pages
Chapter 1 Slides - DS
No ratings yet
Chapter 1 Slides - DS
35 pages
UNIT I NOTES DC
No ratings yet
UNIT I NOTES DC
28 pages
Distributed Systems U1 U2
No ratings yet
Distributed Systems U1 U2
73 pages
Distributed Systems: An Introduction
No ratings yet
Distributed Systems: An Introduction
43 pages
Slides 01-I
No ratings yet
Slides 01-I
26 pages
Chapter 1.2
No ratings yet
Chapter 1.2
18 pages
Chapter 1
No ratings yet
Chapter 1
117 pages
UNIT-1 NOTES
No ratings yet
UNIT-1 NOTES
23 pages
03-Real Time and Distributed Computing Systems - 4paginas
No ratings yet
03-Real Time and Distributed Computing Systems - 4paginas
4 pages
Unit 4 Os
No ratings yet
Unit 4 Os
21 pages
Distributed System
No ratings yet
Distributed System
129 pages
Chapter-1Introduction To DS, Issues and Architecture
No ratings yet
Chapter-1Introduction To DS, Issues and Architecture
38 pages
CHAPTER1 Update
No ratings yet
CHAPTER1 Update
20 pages
Distributed Systems Architecture and Models
No ratings yet
Distributed Systems Architecture and Models
58 pages
Lecture 1 - Fundamentals of Distributed System
No ratings yet
Lecture 1 - Fundamentals of Distributed System
13 pages
OS ch 4 Introduction to Distributed system
No ratings yet
OS ch 4 Introduction to Distributed system
46 pages
Design of Parallel and Distributed Systems: Dr. Seemab Latif
No ratings yet
Design of Parallel and Distributed Systems: Dr. Seemab Latif
36 pages
A Compendium On Distributed Systems
No ratings yet
A Compendium On Distributed Systems
8 pages
CH 4 Distributed Operating System Final.docx
No ratings yet
CH 4 Distributed Operating System Final.docx
58 pages
Unit-1 Distributed Databases
No ratings yet
Unit-1 Distributed Databases
34 pages
CH 1 Distributed System
No ratings yet
CH 1 Distributed System
13 pages
Lecture 2 on Distributed Systems
No ratings yet
Lecture 2 on Distributed Systems
45 pages
Distributed Computing: Unit-1 (
No ratings yet
Distributed Computing: Unit-1 (
47 pages
DS Answer PDF
No ratings yet
DS Answer PDF
79 pages
CH 1 Distributed System
No ratings yet
CH 1 Distributed System
12 pages
Distributed Systems: Chapter 1 - Introduction
100% (2)
Distributed Systems: Chapter 1 - Introduction
74 pages
Cloud Computing For Noobs
From Everand
Cloud Computing For Noobs
Silas Meadowlark
No ratings yet
Cloud Computing Made Simple: Navigating the Cloud: A Practical Guide to Cloud Computing
From Everand
Cloud Computing Made Simple: Navigating the Cloud: A Practical Guide to Cloud Computing
Poonam Devi
No ratings yet
UNIT_1_GEN_AI[1]
No ratings yet
UNIT_1_GEN_AI[1]
16 pages
UNIT 2 System Security Management
No ratings yet
UNIT 2 System Security Management
30 pages
ASWIN TS GAN simplified notes unit 4 gen ai[1]
No ratings yet
ASWIN TS GAN simplified notes unit 4 gen ai[1]
5 pages
Unit - 4
No ratings yet
Unit - 4
46 pages
Unit - 3
No ratings yet
Unit - 3
55 pages
Unit - 5
No ratings yet
Unit - 5
58 pages
Unit - 1
No ratings yet
Unit - 1
69 pages
Unit - 2
No ratings yet
Unit - 2
55 pages
A37-Assignment 8
No ratings yet
A37-Assignment 8
10 pages
C PDF
No ratings yet
C PDF
91 pages
Operating Computer Using Gui Based Operating System
100% (1)
Operating Computer Using Gui Based Operating System
5 pages
Chapter 9 Introduction of Object Oriented Programming: Lecturer: Mrs Rohani Hassan
No ratings yet
Chapter 9 Introduction of Object Oriented Programming: Lecturer: Mrs Rohani Hassan
40 pages
Alvin Chiweshe CV (2)
No ratings yet
Alvin Chiweshe CV (2)
3 pages
Ebook Calculating Roi For Process Automation
No ratings yet
Ebook Calculating Roi For Process Automation
18 pages
Spark Physical and Logical Plan Analysis
No ratings yet
Spark Physical and Logical Plan Analysis
7 pages
Is There Bitcoin Atm Mexicali Baja California - Google Search
No ratings yet
Is There Bitcoin Atm Mexicali Baja California - Google Search
1 page
2023 2024 Emptech Module 04 Imaging and Design For Online Environment
No ratings yet
2023 2024 Emptech Module 04 Imaging and Design For Online Environment
18 pages
CMP310 01 Syllabus Spring 2024
No ratings yet
CMP310 01 Syllabus Spring 2024
5 pages
Hola
No ratings yet
Hola
4,492 pages
Shrinivas J.
No ratings yet
Shrinivas J.
2 pages
Unit-3: 2160711 Dot Net Technology
No ratings yet
Unit-3: 2160711 Dot Net Technology
71 pages
Lists and Array Variables
No ratings yet
Lists and Array Variables
5 pages
Bitdefender GravityZone Elite Datasheet
No ratings yet
Bitdefender GravityZone Elite Datasheet
4 pages
Manual ThermoscanIP en
No ratings yet
Manual ThermoscanIP en
12 pages
Advt. Apprentices 2025-26
No ratings yet
Advt. Apprentices 2025-26
34 pages
(ID 1058763.1) Interoperability Notes EBS R12 With Database 11gR2 29 Nov 21010
No ratings yet
(ID 1058763.1) Interoperability Notes EBS R12 With Database 11gR2 29 Nov 21010
9 pages
Class 10 Artificial Intelligence Sample Paper Set 14
No ratings yet
Class 10 Artificial Intelligence Sample Paper Set 14
9 pages
3.2 Software Design Specification: Project Monitoring System
No ratings yet
3.2 Software Design Specification: Project Monitoring System
18 pages
Remove Background - AI-Powered Free Online Tool kaze.ai
No ratings yet
Remove Background - AI-Powered Free Online Tool kaze.ai
1 page
5IT Rev 1
No ratings yet
5IT Rev 1
4 pages
Maharaja Agrasen Institute of Technology: PSP Area, Plot No.1, Sector-22, Rohini, Delhi-110086
No ratings yet
Maharaja Agrasen Institute of Technology: PSP Area, Plot No.1, Sector-22, Rohini, Delhi-110086
22 pages
Snowflake Bentley Pages 1-31 - Flip PDF Download - Fliphtml5
No ratings yet
Snowflake Bentley Pages 1-31 - Flip PDF Download - Fliphtml5
31 pages
Multicloud Architect 2
No ratings yet
Multicloud Architect 2
36 pages
DW Slides
No ratings yet
DW Slides
246 pages
Disk Operating System (DOS) 5.1
No ratings yet
Disk Operating System (DOS) 5.1
12 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.