0% found this document useful (0 votes)

2 views

CSE211 Computer Architecture

Uploaded by

kartavya9878

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

CSE211 Computer Architecture

Uploaded by

kartavya9878

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

CSE211

Computer
Architecture
Modules 14 to 21
Multi-threading
Multithreading allows multiple threads to execute simultaneously,
enhancing parallelism and resource utilization. It can be categorized into
fine-grain and coarse-grain multithreading.
Simultaneous multithreading (SMT) enables issuing instructions from
different threads into various functional units at the same time,
maximizing the use of processor resources.
SMT is a hardware technique that allows multiple threads to share the
execution resources of a single processor core. This is achieved by
interleaving the instruction execution of different threads.
By allowing multiple threads to share the execution resources, SMT can
increase the utilization of the processor and improve overall
performance.
While increasing the number of threads in SMT can enhance parallelism,
it is crucial to balance the number of threads with the architecture's
ability to manage resources effectively.
Parallelism vs
Synchronization
Parallel programming allows multiple programs or threads to run
simultaneously, which is essential for improving performance in
modern computer architectures.
Synchronization is crucial for coordinating communication
between concurrent processes, ensuring that shared resources
are accessed safely.
The producer-consumer model illustrates how one entity
produces data while another consumes it, highlighting the need
for effective communication and resource management.
Mutual exclusion ensures that only one processor accesses a
shared resource at a time, preventing conflicts and ensuring data
integrity. To implement we use strategies like:
Exclusive Access
Lock Mechanisms
Avoiding Race Conditions
Synchronization
Producer consumer problem
In a producer-consumer scenario, a producer generates
values while consumers read and process those values.
When there are two consumers, issues can arise if they
access shared data simultaneously.
Sequential consistency ensures that operations appear to
occur in a specific order, preventing reordering of reads
and writes, which is beneficial for maintaining data
integrity.
Producer: Generates a data item. Consumer: Checks if the buffer is
Adds the item to the buffer. empty.
If the buffer is full, the producer If the buffer is not empty, removes an
item from the buffer and processes
may be blocked until space it.
becomes available. If the buffer is empty, the consumer
may be blocked until a new item is
added.
Mutual exclusion
Understanding Mutual Exclusion
Mutual exclusion is essential for preventing multiple processes from
accessing shared resources simultaneously, which can lead to
inconsistencies.
Atomic operations are crucial for implementing mutual exclusion,
allowing operations to be completed without interruption from other
processes.
Atomic Operations and Their Implementation
The test and set operation is a fundamental atomic operation that
checks a memory address and modifies it atomically, ensuring that no
other operations interfere during this process.
More advanced atomic operations, such as compare and swap, enhance
functionality by allowing conditional updates based on the current value
in memory.
Sequential consistency
Sequential Consistency ensures that the execution sequence of
instructions from all processors appears as a valid interleaving of
their individual instruction orders.
It is a strong model that guarantees that all processors see the
same order of operations, which is not typically implemented in
modern computers due to performance constraints.
Examples of Valid and Invalid Orders
Valid sequentially consistent orders can include various
interleavings, such as executing instructions from different
processors in a way that respects their individual order.
An invalid order occurs when the relative order of operations from
a single processor is violated, leading to inconsistencies in the
observed results.
Issues in Sequential Consistency
Performance Overhead
Hardware Complexity
Programming Complexity
Practical Limitations
Distributed Systems
True sequential consistency is challenging to achieve,
especially with caches, as data visibility between
processors becomes a concern.
Inspiration and creativity
Definition of Race Conditions: A race condition occurs when two or more threads or
processes access shared data and try to change it at the same time. The final outcome
depends on the timing of their execution, which can lead to unpredictable results.
Role of Sequential Consistency: Sequential consistency provides a model that ensures
all memory operations appear to occur in a specific order. This means that if a program
adheres to sequential consistency, the operations from different threads will be
interleaved in a way that respects the order of operations from each individual thread.
Prevention of Race Conditions: By enforcing a sequentially consistent memory model,
the likelihood of race conditions is reduced. Since all threads see the same order of
operations, it becomes easier to reason about the state of shared data and avoid
conflicts.
Simplified Reasoning: With sequential consistency, programmers can assume that
operations will execute in a predictable manner, making it easier to identify potential
race conditions and implement appropriate synchronization mechanisms.
Weak Models and Race Conditions: In contrast, weaker memory models may allow for
out-of-order execution and different visibility of operations, increasing the risk of race
conditions. Programmers must be more cautious and implement additional
synchronization to ensure correctness.
Locks
Locks (mutexes) allow mutual exclusion,
ensuring that only one process can execute a
critical section of code at any given time.
Mutual Exclusion: Locks ensure that only one
thread can access a critical section of code at
a time. This prevents race conditions where
multiple threads might try to read or write
shared data simultaneously.
Synchronization: By locking a resource, a
thread can safely perform operations without
interference from other threads, ensuring
data integrity.
Semaphores provide
Semaphores
a more flexible
approach, allowing a specified number of
processes to enter a critical section
concurrently, which is useful in scenarios
with multiple resources.

Semaphores
Controlled Access: Semaphores allow a
specified number of threads to access a
resource concurrently.
Flexibility: Unlike locks, which only allow
one thread at a time, semaphores can be
configured to permit a certain number of
threads (N) to enter a critical section.
Memory fences and models
Memory Fences and Their Importance
Memory fences (or barriers) are introduced to ensure that
certain memory operations are completed before others
begin, helping to maintain order and consistency.
Different types of memory fences exist, such as load memory
fences and directional memory fences, which provide varying
levels of control over memory operations.
Weak Memory Models
Most modern processors implement weaker memory models
rather than strict sequential consistency, allowing for
performance optimizations through reordering.
Examples of memory ordering models include total store
ordering, partial store ordering, and weak ordering, each with
specific rules about how loads and stores can be reordered.
Memory Bus
The memory bus is a type
of computer bus, usually
in the form of a set of wires
or conductors which
connects electrical
components and allow
transfers of data and
addresses from the main
memory to the central
processing unit (CPU) or a
memory controller.
Bus- based multiprocessor
A bus-based multiprocessor system is a type of
parallel computing architecture where
multiple processors share a common bus to
communicate with each other and access
shared memory.
Key Components of a Bus-Based
Multiprocessor:
Processors: Multiple processors, each with
its own registers and local cache.
Shared Memory: A common memory area
accessible to all processors.

Bus: A communication channel that

connects the processors and the shared
memory.
Cache Memory: High-speed memory that
stores frequently accessed data for each
processor.
Message passing
Shared Memory Architecture
In shared memory systems, one core can write data to a
memory address, and another core can read from that
address without needing to know which core will read it in
the future.
This model allows for implicit communication, but it often
requires locking mechanisms to ensure data consistency
between writes and reads.
Explicit Message Passing
Explicit message passing requires a sender to specify a
destination when sending data, using an API that includes
send and receive functions.
The receive function can be designed to accept data from any
source or a specific source, allowing for more controlled
communication.
Memory in Multiprocessor system
Multi-core bus systems August 2026

By placing two or more processor cores on the same device,

it can use shared components -- such as common internal
buses and processor caches -- more efficiently.
Shared memory vs message passing
Shared Memory
Communication Method: Implicit communication through loads and stores to
shared memory addresses.
Destination Knowledge: The sender does not need to know which core or process
will read the data.
Synchronization: Requires explicit synchronization mechanisms (like locks and flags)
to prevent race conditions.
Memory Access: Memory is shared among all cores or processes, allowing for easy
access to shared data structures.
Explicit Message Passing
Communication Method: Explicit communication using send and receive functions.
Destination Knowledge: The sender must specify the destination when sending
data.
Synchronization: Synchronization is built into the messaging model, as sending and
receiving messages inherently creates a producer-consumer relationship.
Memory Access: Memory is typically private to each process or core, meaning data
must be sent explicitly between them.

JusPay Hackathon II
No ratings yet
JusPay Hackathon II
26 pages
CSE211 Computer Architecturemodule 18-21
No ratings yet
CSE211 Computer Architecturemodule 18-21
19 pages
10-Multithreading
No ratings yet
10-Multithreading
60 pages
29-3-42-36-20241109_144133
No ratings yet
29-3-42-36-20241109_144133
27 pages
comporg6_ch12
No ratings yet
comporg6_ch12
36 pages
KTMTSS Shared Memory Multiprocessor
No ratings yet
KTMTSS Shared Memory Multiprocessor
29 pages
Memory in Multiprocessor System
No ratings yet
Memory in Multiprocessor System
52 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
51 pages
CH17 COA9e
No ratings yet
CH17 COA9e
51 pages
Shared Memory Multiprocessors: Logical Design and Software Interactions
No ratings yet
Shared Memory Multiprocessors: Logical Design and Software Interactions
107 pages
2. Parallel Computers
No ratings yet
2. Parallel Computers
39 pages
Unit-5 Part-2
No ratings yet
Unit-5 Part-2
22 pages
Slides Taken From: Parallel Computing Platforms
No ratings yet
Slides Taken From: Parallel Computing Platforms
11 pages
Parallel Computer Architecture A Hardware-Software
No ratings yet
Parallel Computer Architecture A Hardware-Software
18 pages
1. GPU Unit-1
No ratings yet
1. GPU Unit-1
10 pages
APznzaaBPbq19r7DttJsFJDiz6xdljQmPxg0oflqRAoyoqcN6IEEo4yrW Ck8XgHkH5PDMZIHRNz7h0ZpQWHOHwyjvO3PX93sVHvLd5fwcGETUu8XvmdTkaodNRbNrLgkDFPQZVQMfz8KHkZay30aqD0CVLA10PSummzrUt1vN32NEahcaq-m3CTYqZXjSBaBus9kPl5fj8KDKPT (1)
No ratings yet
APznzaaBPbq19r7DttJsFJDiz6xdljQmPxg0oflqRAoyoqcN6IEEo4yrW Ck8XgHkH5PDMZIHRNz7h0ZpQWHOHwyjvO3PX93sVHvLd5fwcGETUu8XvmdTkaodNRbNrLgkDFPQZVQMfz8KHkZay30aqD0CVLA10PSummzrUt1vN32NEahcaq-m3CTYqZXjSBaBus9kPl5fj8KDKPT (1)
80 pages
unit6
No ratings yet
unit6
36 pages
Thread Level Parallelism
No ratings yet
Thread Level Parallelism
21 pages
Unit6 - Microprocessor - Final 1
No ratings yet
Unit6 - Microprocessor - Final 1
30 pages
HPA - Notes
No ratings yet
HPA - Notes
5 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
Parallel Processing
No ratings yet
Parallel Processing
31 pages
Multiprocessors
No ratings yet
Multiprocessors
39 pages
CS Chap7 Multicores Multiprocessors Clusters
No ratings yet
CS Chap7 Multicores Multiprocessors Clusters
65 pages
Threads_on_a_Multi_Core_Processor_1737287536
No ratings yet
Threads_on_a_Multi_Core_Processor_1737287536
9 pages
Part 1 - Lecture 2 - Parallel Hardware
No ratings yet
Part 1 - Lecture 2 - Parallel Hardware
60 pages
Operating System 6
No ratings yet
Operating System 6
16 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
18 pages
Java Concurrency
No ratings yet
Java Concurrency
70 pages
2ad6a430 1637912349895
No ratings yet
2ad6a430 1637912349895
51 pages
DSM
No ratings yet
DSM
36 pages
07 Multiprocessors MF PDF
No ratings yet
07 Multiprocessors MF PDF
99 pages
09 Communication models of Parallel platforms
No ratings yet
09 Communication models of Parallel platforms
25 pages
pp unit5
No ratings yet
pp unit5
43 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
33 pages
Module 4- Architecture
No ratings yet
Module 4- Architecture
22 pages
Multi-Processor / Parallel Processing
No ratings yet
Multi-Processor / Parallel Processing
12 pages
Multi-Processor-Parallel Processing PDF
No ratings yet
Multi-Processor-Parallel Processing PDF
12 pages
Multi-Processor / Parallel Processing
No ratings yet
Multi-Processor / Parallel Processing
12 pages
Chap05
No ratings yet
Chap05
13 pages
Chap 05
No ratings yet
Chap 05
13 pages
What Is Parallel Computing
No ratings yet
What Is Parallel Computing
9 pages
ACA Unit 4
No ratings yet
ACA Unit 4
41 pages
Unit VI
No ratings yet
Unit VI
50 pages
09 Communication models of Parallel platforms
No ratings yet
09 Communication models of Parallel platforms
25 pages
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
No ratings yet
What Is Serial Computing?: Traditionally, Software Has Been Written For Serial Computation
22 pages
Concurrency Primer
No ratings yet
Concurrency Primer
12 pages
What Every Systems Programmer Should Know About Concurrency: Matt Kline
No ratings yet
What Every Systems Programmer Should Know About Concurrency: Matt Kline
12 pages
Module 2 - Parallel Computing
No ratings yet
Module 2 - Parallel Computing
55 pages
UNIT 2 CLOUD COMPUTING - converted
No ratings yet
UNIT 2 CLOUD COMPUTING - converted
19 pages
PDC-architectures
No ratings yet
PDC-architectures
24 pages
Lecture-13-14 Parallel and Distributed Systems Programming Models-Jameel
No ratings yet
Lecture-13-14 Parallel and Distributed Systems Programming Models-Jameel
70 pages
ACA UNIT-5 Notes
No ratings yet
ACA UNIT-5 Notes
15 pages
Synchronization
No ratings yet
Synchronization
11 pages
Unit 1 - Part - 2
No ratings yet
Unit 1 - Part - 2
30 pages
Unit 4
No ratings yet
Unit 4
42 pages
Multiprocessors: COMP 211 - Computer Systems Organization and Architecture
No ratings yet
Multiprocessors: COMP 211 - Computer Systems Organization and Architecture
29 pages
Advanced Fuse Implementation: Definitive Reference for Developers and Engineers
From Everand
Advanced Fuse Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Concurrency and Multithreading in C: POSIX Threads and Synchronization
From Everand
Concurrency and Multithreading in C: POSIX Threads and Synchronization
Larry Jones
No ratings yet
Optimized Caching Techniques: Application for Scalable Distributed Architectures
From Everand
Optimized Caching Techniques: Application for Scalable Distributed Architectures
Peter Jones
No ratings yet
JavaScript File Handling from Scratch: A Practical Guide with Examples
From Everand
JavaScript File Handling from Scratch: A Practical Guide with Examples
William E. Clark
No ratings yet
Solution To Complete Your Torrent Download After The Tracker Goes Down by (Achwaq Khalid)
No ratings yet
Solution To Complete Your Torrent Download After The Tracker Goes Down by (Achwaq Khalid)
8 pages
Torrent Trackers
No ratings yet
Torrent Trackers
6 pages
Torrent Trackers
No ratings yet
Torrent Trackers
13 pages
Concurrency Control Protocol
No ratings yet
Concurrency Control Protocol
65 pages
Concurrency Control Techniques
No ratings yet
Concurrency Control Techniques
64 pages
Final Unit 5 Spos 2023
No ratings yet
Final Unit 5 Spos 2023
55 pages
Assignment1 DC
No ratings yet
Assignment1 DC
7 pages
Ret
No ratings yet
Ret
5 pages
multiversition cuncurrency control
No ratings yet
multiversition cuncurrency control
9 pages
Module 3 PPT
No ratings yet
Module 3 PPT
44 pages
A Survey of Blockchain Consensus Protocols: JIE XU Cong Wang Xiaohua JIA
No ratings yet
A Survey of Blockchain Consensus Protocols: JIE XU Cong Wang Xiaohua JIA
35 pages
Unit III.pptx
No ratings yet
Unit III.pptx
58 pages
Complete Download The Science of the Blockchain 1st Edition Roger Wattenhofer PDF All Chapters
100% (19)
Complete Download The Science of the Blockchain 1st Edition Roger Wattenhofer PDF All Chapters
60 pages
High_performance_cluster_computing_Book
No ratings yet
High_performance_cluster_computing_Book
2 pages
Mutex Locks (Os Presentation)
No ratings yet
Mutex Locks (Os Presentation)
19 pages
3710213_merged
No ratings yet
3710213_merged
7 pages
Distributed system
No ratings yet
Distributed system
28 pages
Question Paper Code:: Reg. No.
No ratings yet
Question Paper Code:: Reg. No.
2 pages
DBMS View Serializability
No ratings yet
DBMS View Serializability
7 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
44 pages
Ch05 Problems Solving
No ratings yet
Ch05 Problems Solving
8 pages
UNIT-2 Deadlock Avoidance
No ratings yet
UNIT-2 Deadlock Avoidance
18 pages
4 Chap slides - Replication
No ratings yet
4 Chap slides - Replication
15 pages
OS NOTES UNIT 3(2)
No ratings yet
OS NOTES UNIT 3(2)
26 pages
L-09
No ratings yet
L-09
27 pages
WASE 2021 - Distributed Computing Handout - S2 - 23
No ratings yet
WASE 2021 - Distributed Computing Handout - S2 - 23
6 pages
Chapter 7-Consistency and Replication
No ratings yet
Chapter 7-Consistency and Replication
30 pages
23CB401_UNIT 2_Notes
No ratings yet
23CB401_UNIT 2_Notes
26 pages
BCF_Questionbank_All_units
No ratings yet
BCF_Questionbank_All_units
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CSE211 Computer Architecture

Uploaded by

CSE211 Computer Architecture

Uploaded by

CSE211

Bus: A communication channel that

By placing two or more processor cores on the same device,

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.