0% found this document useful (0 votes)

53 views52 pages

Exploring The Oracle Latches

This document discusses exploring Oracle latches using Solaris DTrace. It begins with background on the author and an introduction to Oracle performance improvements over time. It then discusses how DTrace can be used as a "stroboscopic light" to investigate Oracle latches in real time by counting latch spins, tracing waits, and measuring times and distributions. Key routines for acquiring and freeing latches are identified. Fields for instrumenting latch gets are also described.

Uploaded by

quispatdotanla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views52 pages

Exploring The Oracle Latches

Uploaded by

quispatdotanla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Andrey Nikolaev

RDTEX, Russia

Exploring Oracle RDBMS latches

(spinlocks)
using Solaris DTrace

MEDIAS - 2011
May 8-15
Who am I

• Andrey.Nikolaev@rdtex.ru

• http://andreynikolaev.wordpress.com

• Graduated from MIPT in 1987

• 1987-1996 at COMPAS group, IHEP, Protvino

• Currently at RDTEX, Oracle First Line Support company

• Specialize in Oracle performance tuning

• Over 20 years of Oracle related experience as a research

scientist, developer, DBA, performance consultant, trainer …
Introduction

for non-Oracle

auditory
Oracle RDBMS
performance improvements timeline:
v. 2 (1979): the first commercial SQL RDBMS
v. 3 (1983): the first database to support SMP
v. 4 (1984): read-consistency, Database Buffer Cache
v. 5 (1986): Client-Server, Clustering, Distributing Database, SGA
v. 6 (1988): procedural language (PL/SQL), undo/redo, latches
v. 7 (1992): Library Cache, Shared SQL, Stored procedures, 64bit
v. 8/8i (1999): Object types, Java, XML
v. 9i (2000): Dynamic SGA, Real Application Clusters
v. 10g (2003): Enterprise Grid Computing, Self-Tuning, mutexes
v. 11g (2008): Results Cache, SQL Plan Management, Exadata
v. 12c (2011): ?Cloud? Not yet released … to be continued
Oracle Database Architecture: Overview

Oracle SMON PMON RECO Others

instance: SGA: Locks Shared pool

Database Redo log Library cache
buffer cache buffer Mutexes
PGA Data dictionary cache
Server Latches Latches
process
Latches
DBWn CKPT LGWR ARCn

Control Online Archived

User Data files redo logs log files
files
process
Oracle Database Locks
Why Oracle needs Performance Tuning?

• More then 100 books on Amazon. Need for mainstream science support!
• Complex and variable workloads. Every database is unique.
• Complex internals. 344 "Standard" / 2665 "Hidden" tunable parameters.
• Complicated physical database and schema design decisions.
• Concurrency and Scalability issues.
• Insufficient developers education.
• "Database Independence" issues.
• Self-tuning anomalies. SQL plan instabilities.
• OS and Hardware issues.
• More than 10 million bug reports on MyOracleSupport.
Oracle is well instrumented software:

• Oracle Statistics. "What sessions have done?". 628 statistics in 11.2.0.2

• Oracle Wait Interface. "How Oracle sessions have waited?". 1142 Wait
events
• AWR/ASH/ADDM, Advisors, MyOracleSupport diagnostics and tuning
tools, …
• Visualization challenge. Oracle Enterprise Manager, Quest Spotlight,
Embarcadero DB Optimizer, private tools, etc…
• More than 2000 internal "dynamic performance" X$ tables:
• Needed for advanced diagnostics
• Lack of documentation
• Constantly changing.
Episode of latch contention:

Oracle instance hangs due to heavy "cache buffers chains" latch contention
The presentation goals:

The goals of this work are:

• Explore one of Oracle serialization mechanisms: latches
(spinlocks)
• Explore latch efficiency and possibilities of diagnostics and
performance tuning.
• Explore how to interpret latch related performance counters.
• Explore latch spinning and waiting policies.
• Explore influence of Oracle parameters and adjustment of the
number of spins for the latch before waiting
Review of serialization mechanisms in Oracle
• Latches are simple, low-level serialization mechanisms that coordinate
multiuser access to shared data structures, objects, and files. … Oracle®
Database Concepts 11.2

• Latch uses atomic hardware instructions for Immediate Get

• If missed, latch spins by polling location during Spin Get
• In spin get not succeed, latch sleeps for wait get.
• KGX Mutexes appeared in latest Oracle versions inside Library Cache only
Locks Latches Mutexes
Access Several Modes Types and Modes Operations
Acquisition FIFO SIRO (spin) + FIFO SIRO
SMP Atomicity No Yes Yes
Timescale > Milliseconds Microseconds SubMicroseconds
Life cycle Dynamic Static Dynamic
Classic spinlocks
• Wiki: "… spinlock … waits in a loop repeatedly checking until the lock
becomes available …"
• Introduced by Edsger Dijkstra in “Solution of a Problem in Concurrent
Programming Control” CACM. 1965
• Have been thoroughly investigated since that time. See "The Art of
Multiprocessor Programming", M. Herlihy and N. Shavit, Chapter 07
Spin Locks and Contention
• Many sophisticated spinlock realizations were proposed and evaluated
(TS, TTS, Delay, MCS, Anderson,...) for high bus utilization ~100%
• Two general types:
• System spinlock. Kernel OS threads cannot wait. Major metrics:
atomic operations frequency. Shared bus utilization.
• User spinlock. Oracle latch and mutex. Average lock holding time ~
10 musec. It is more efficient to poll a lock rather than pre-empt the
thread doing 1 msec context switch. Metrics: CPU and elapsed times.
Spinlock realizations
Spinlock: Pseudocode: Problems:

TS while(Test_and_Set(lock)); Bus saturation by atomic

operations
pre-11.2 mutex
TTS while(lock||Test_and_Set(lock)); Invalidation storms
(“open door”, “thundering
Oracle latch herds”).
Delay Adjustable delay after noticing Higher elapsed time
under contention
Mutex with patch that lock was released
6904068
Anderson, MCS, Queues. Widely used in Java, CPU and memory
etc. overhead, preemption
Linux kernel … not in Oracle
issues
Anderson (1990) system spinlocks tests:

T.E. Anderson, “The Performance of Spin-Lock Alternatives for Shared-Memory Multiprocessors,”

IEEE Trans. Parallel and Distributed Systems, Vol. 1, No. 1, Jan. 1990, pp. 6-16.
DTrace. Solaris 10 Dynamic Tracing framework:
• Event-driven, kernel-based instrumentation allows to see all OS activity
• Dynamically interpreted C-like language to customize profiling
• No application changes needed to use DTrace
• Define the probes (triggers) to trap and write the handlers (actions).
• A lot of probes in Solaris kernel and ability to instrument every user
instruction:
provider:module:function:name
pid1910:oracle:kslgetl:entry
• A provider is a methodology for instrumenting the system: pid, fbt,
syscall, sysinfo, vminfo …
• Action is D routine to execute when a probe is hit
• Predicates define criteria for actions.
DTrace as a stroboscopic light:

DTrace allows us to investigate how Oracle latches perform in real time:

• Count the latch spins
• Trace how the latch waits
• Measure times and distributions
• Compute additional latch statistics
DTrace reveals latch interface routines:
Oracle calls the following functions to acquire the latch:
• kslgetl(laddr, wait, why, where) - get exclusive latch
• kslgetsl (laddr,wait,why,where,mode) - get shared latch
• …
• kslfre(laddr) - free the latch
Oracle give us possibility to do the same by oradebug call
Function arguments meaning:
• laddres – address of latch in SGA
• wait – flag for no-wait or wait latch acquisition
• where – integer code for location from where the latch is acquired.
• why - integer context of why the latch is acquiring at this “where”.
• mode – requesting state for shared lathes. 8 – SHARED mode. 16 –
EXCLUSIVE mode
Latch is holding by process, not session:

Process fixed array: List of all latches:

v$process -> x$ksupr v$latch ->x$ksllt

Struct ksllt{
Struct ksupr {
…
…
}
Struct kslla{
ksllt *ksllalat[14];
}
…}

Each process has an array of references to the latches it is holding

Process latching info is the kslla structure embedded in the process state object
The latch get instrumentation:

X$KSUPR.KSLLA% fields instrument the latch get:

• ksllalaq – address of latch acquiring. Populated during immediate get
(and spin before 11g)
• ksllawat - latch being waited for. This is v$process.latchwait
• ksllawhy – “why” for the latch being waited for
• ksllawere – “where” for the latch being waited for
• ksllalow – bit array of levels of currently holding latches
• ksllaspn - latch this process is spinning on. v$process.latchspin. Not
populated since 8.1
• ksllaps% - inter-process post statistics
The latch structure – ksllt:

struct ksllt {
<Latch>

“where” and “why”

Level, latch#, class, other attributes
Statistics
Latch wait list header
…
Latch size by version:
x$ksmfsv – list of all fixed SGA variables:
SELECT DISTINCT ksmfssiz
FROM x$ksmfsv
WHERE ksmfstyp = 'ksllt';
*nix 32bit *nix 64bit Windows 32bit

7.3.4 92 - 120
8.0.6 104 - 104
8.1.7 104 144 104
9.0.1 ? 200 160
9.2.0 196 240 200
10.1.0 ? 256 208
10.2.0 - 11.2.0.2 100 160 104

Latch structure was bigger in 10.1 due to additional latch statistics

Oracle latch is not just a single memory
location:
 Before 11g. Value of first latch byte (word for shared latches) was
used to determine latch state:

0x00 – latch is free

0xFF – exclusive latch is busy. Was 0x01 in Oracle 7

0x01,0x02,… - shared latch holding by 1,2, … processes simultaneously

0x20000000 | pid - shared latch holding exclusively

 In 11g first latch word show the pid of the latch holder

0x00 – latch free

0x12 – Oracle process with pid 18 holds the exclusive latch

Latch attributes

Each latch have at least the following attributes in kslldt :

 Name Latch name as appeared in V$ views
 SHR. Is the latch Shared? Shared latch is “Read-Write” spinlock.
 PAR. Is the latch Solitary or Parent for the family of child latches?
 G2C. Can two child latches be simultaneously requested in wait mode
 LNG. Is wait posting used for this latch? Obsolete since Oracle 9.2.
 UFS. Is the latch Ultrafast? It will not increment miss statistics when
STATISTICS_LEVEL=BASIC. 10.2 and above
 Level. 0-14. To prevent deadlocks latches can be requested in only in
increasing level order
 Class. 0-7. Spin and wait class assigned to the latch. 9.2 and above.
Latches by Oracle version
Oracle Number of latches PAR G2C LNG UFS SHARED
version
7.3.4.0 53 14 2 3 - -
8.0.6.3 80 21 7 3 - 3
8.1.7.4 152 48 19 4 - 9
9.2.0.8 242 79 37 - - 19
10.2.0.2 385 114 55 - 4 47
10.2.0.3 388 117 58 - 4 48
10.2.0.4 394 117 59 - 4 50
11.1.0.6 496 145 67 - 6 81
11.1.0.7 502 145 67 - 6 83
11.2.0.1 535 149 70 - 6 86
Latch trees

“Rising level” rule leads to “trees” of processes waiting for and holding the
latches:
ospid: 28067 sid: 1677 pid: 61
holding: 3800729f0 'shared pool' (156) level=7 child=1 whr=1602 kghupr1
waiter: ospid: 129 sid: 72 pid: 45
holding: a154b7120 'library cache' (157) level=5 child=17 whr=1664 kglupc: child
waiter: ospid: 18255 sid: 65 pid: 930
waiter: ospid: 6690 sid: 554 pid: 1654
waiter: ospid: 4685 sid: 879 pid: 1034
…
waiter: ospid: 29749 sid: 180 pid: 155
holding: a154b7db8 'library cache' (157) level=5 child=4 whr=1664 kglupc: child
waiter: ospid: 13104 sid: 281 pid: 220
waiter: ospid: 24089 sid: 565 pid: 636
waiter: ospid: 25002 sid: 621 pid: 1481
waiter: ospid: 16930 sid: 1046 pid: 783

Direct SGA access program output for 9.2.0.6 instance with too small shared pool.
Waiting for the latch

S G A
Latch

Process Process Process A

holds a
B A latch
Process B waits
(spins and
sleeps)

CPU 1 CPU 2
Latch Acquisition in Wait Mode

Version from contemporary 11.2 documentation. Was really

used ten years ago in Oracle 7.3-8.1

Latch wait get (kslgetl(laddress,1,…)):

• One fast Immediate get, no spin
• Spin get: check the latch upto _SPIN_COUNT times
• Sleep on "latch free" event with exponential backoff
• Repeat
8i Latch get code flow using Dtrace

kslgetl(0x200058F8,1,2,3) - KSL GET exclusive Latch# 29

kslges(0x200058F8, ...) - wait get of exclusive latch
skgsltst(0x200058F8) ... call repeated 2000 times = SPIN_COUNT
pollsys(...,timeout=10 ms,...) - Sleep 1
skgsltst(0x200058F8) ... call repeated 2000 times
pollsys(...,timeout=10 ms,...) - Sleep 2
skgsltst(0x200058F8) ... call repeated 2000 times
pollsys(...,timeout=10 ms,...) - Sleep 3
skgsltst(0x200058F8) ... call repeated 2000 times
pollsys(...,timeout=30 ms,...) - Sleep 4 …

• … Event 10046 trace:

• WAIT #0: nam='latch free' ela= 0 p1=536893688 p2=29 p3=0
• WAIT #0: nam='latch free' ela= 0 p1=536893688 p2=29 p3=1
• WAIT #0: nam='latch free' ela= 0 p1=536893688 p2=29 p3=2
Exponential backoff was inefficient

• 0.01-0.01-0.01-0.03-0.03-0.07-0.07-0.15-0.23-0.39-0.39-
0.71-0.71-1.35-1.35-2.0-2.0-2.0-2.0...sec
[( N wait + 1) / 2 ]
• timeout = 2 −1
• Typical latch holding time is 10 musec!

• Most waits were for nothing – latch already was free

• Latch utilization could not be more 70%

• Lot of unnecessary spins – provokes CPU thrashing

9.2-11g exclusive latch get flow using Dtrace

Semop – infinite wait until posted!

kslgetl(0x50006318, 1)
-> sskgslgf(0x50006318)= 0 -immediate latch get
-> kslges(0x50006318, ...) -wait latch get
-> skgslsgts(...,0x50006318, ...) -spin latch get
->sskgslspin(0x50006318)
... - repeated 20000 cycles = 10*_SPIN_COUNT!
-> kskthbwt(0x0)
-> kslwlmod() - set up Wait List
-> sskgslgf(0x50006318)= 0 -immediate latch get
-> skgpwwait -sleep latch get
semop(11, {17,-1,0}, 1)
Contemporary latch spins and waits

• Hidden latch wait revolution. In Oracle 9.2-11.2, all the latches in

default class 0 rely on wait posting. Latch is sleeping without any
timeout.
• If wakeup post is lost in OS, waiters will sleep infinitely.
• Latches assigned to non-default class wait until timeout.
• By default process spin 20000 cycles. Latch is TTS spinlock
• The _SPIN_COUNT parameter (by default 2000) is effectively
static for exclusive latches.
• _LATCH_CLASS_0 initialization parameter determine exclusive
latch wait and spin.
Nonstandard class latches
• Latch can be assigned to one of eight classes having different spin and
wait policies. Standard class 0 latch use wait posting.
• _LATCH_CLASS_X = “Spin Yield Waittime Sleep0 Sleep1 … Sleep7"
• Nonstandard class latch loops upto “Spin” cycles, then yields CPU. This
is repeated “Yield” times. Then the process sleeps for “SleepX”
microseconds using pollsys() (not semtimedop()) system call.
• If “Yield” !=0 repeat “Yield” times:

Loop up to “Spins” cycles

Yield CPU using yield() (or sched_yield())
• Sleep for “SleepX” usecs
• Then spin again …
Shared latch acquisition

• Shared latch spin in Oracle 9.2-11g is governed by

_SPIN_COUNT value and can be dynamically tuned

• X mode shared latch get spins by default up to 4000 cycles.

• S mode does not spin at all (or spins in unknown way)

S mode get X mode get

Held in S mode Compatible 2*_SPIN_COUNT

Held in X mode 0 2*_SPIN_COUNT

Blocking mode 0 2*_SPIN_COUNT

Latch Release

• Free the latch – kslfre(laddr)

• Oracle process releases the latch nonatomically
• Then it sets up memory barrier – perform atomic operation on
address individual to each process.
• This requires less bus invalidation and ensures propagation of
latch release to other local caches.
• Not fair policy - spinners on the local CPU board have the
preference.
• Then process posts first process in the list of waiters
The latch contention
Raw latch statistic counters
Statistics: x$ksllt Comments:
GETS kslltwgt “++” after wait mode latch get
MISSES kslltwff “++” after wait get if it was missed
SLEEPS kslltwsl “+number_of_sleeps” during get
SPIN_GETS ksllthst0 “++” if get was missed but not slept
WAIT_TIME kslltwtt “+wait_time” after latch get
IMMEDIATE_GETS kslltngt “++” after nowait mode latch get. Is not
protected by latch
IMMEDIATE_MISSES kslltnfa “++” if nowait mode get was missed
Wait queue length L Sampling of x$ksupr.ksllawat
N of spinning processes
Ns Sampling of x$ksupr.ksllalaq
Differential (point in time) latch statistics
Latch requests arrival rate ∆ gets
λ =
∆ time
Immediate gets efficiency ∆ misses
ρ =
∆ gets
Latch sleeps ratio ∆ sleeps
κ =
∆ misses
Latch wait time per second ∆ wait _ time
W=
∆ time
Latch spin efficiency ∆ spin _ gets
σ =
∆ misses
Should be calculated for each child latch. V$LATCH averaging distorts statistics
Derived latch statistics
Latch utilization: (PASTA)
∆ latch _ holding _ time
ρ ≈U=
∆ time
Average holding time: ρ " Pct _ Get _ Miss"∗ " Snap _ Time"
S= =
λ 100*" Get _ Re quests"
Length of latch wait list:
L =W
Recurrent sleeps ratio: σ +κ −1
κ
Latch acquisition time:
Taq = λ − 1 ( N s + W )
Latch statistics vs direct measurement

Latch statistics for: Latch acquisition time distribution

0x380007358 "session allocation" measured by DTrace:
--------- Distribution --------
Requests rate: lambda= 1350 Hz
2048 |
Miss /get: rho= .022 4096 |@@@@@@
Sampled Utilization: U= .013 8192 |@@@@@@@@
Slps /Miss: kappa= .28 16384 |@@@@@@@@@@@@@@@@@@@@@@@
32768 |@@@
Wait_time/sec: W= .021
65536 |
Sampled queue length Lw= .017 ns
Spin_gets/miss: sigma= .72 Average acquisition time=21 usec
Sampled spinning procs:Ns= .013
Secondary sleeps ratio = .002
Avg holding time= 16.3 usec
sleeping time = 15.9 usec
acquisition time = 25.8 usec
Latch contention diagnostics in 9.2-11g

• Latch contention should be suspected if the latch wait events are

observed in “Top 5 Timed Events” AWR section
• Look for the latch with highest W
• Symptoms of contention for the latch:
• W > 0.1 sec/sec
• Utilization ρ > 10%
• Acquisition (or sleeping) time sufficiently greater then holding time
• Latchprofx.sql script invented by Tanel Poder greatly simplifies
diagnostics.
• Script and v$latch_misses reveal “where” the contention arise
• Contention for a high-level latch frequently exacerbates contention for
lower-level latches
Treating the latch contention:

• "Right" method: tune the application and reduce the latch demand. Tune
the SQL, bind variables, schema, etc… Many brilliant books exist on this
topic. Out of scope for this work.

• It may be too expensive and require complete application rewrite.

• Nowadays the CPU power is cheap. We may already have enough free
CPU resources. The spin count tuning may be beneficial.

• Processes spin for exclusive latch spin upto 20000 cycles, for shared
latch upto 4000 cycles and infinitely for mutex. Tuning may find more
optimal values for your application.

• Oracle does not explicitly forbid spin count tuning. However, change of
undocumented parameter should be discussed with Support.
Spin count adjustment

Shared latches:
• Spin count can be adjusted dynamically by _SPIN_COUNT parameter.
• Good starting point is the multiple of default 2000 value.
• Setting _SPIN_COUNT parameter in initialization file, should be
accompanied by _LATCH_CLASS_0="20000". Otherwise spin for
exclusive latches will be greatly affected by next instance restart.
Exclusive latches:
• Spin count adjustment by _LATCH_CLASS_0 parameter needs the
instance restart.
• Good starting point is the multiple of default 20000 value.
• It may be preferable to increase the number of "yields" for class 0 latches.
Tuning spin count efficiently

• First, the root cause of latch contention must be diagnosed.

• Spin count tuning will only be effective if the latch holding time S is
in its normal microseconds range

• The number of spinning processes should remain far less then the
number of CPUs. Analyze AWR and latch statistics before and after each
change.

• It is a common myth that CPU time will raise infinitely while we increase
spin count. Actually the process will spin up to "residual latch holding
time"

• Elapsed time to acquire the latch will decrease while the latch "holding
time" is less then OS "context switch time"
Latch spin CPU time
The spin probes latch holding time distribution. The spin time distribution is
discontinuous at _SPIN_COUNT: Ps
1

0.8

0.6

0.4

0.2

tђdelta
0.5 1 1.5 2

According to renewal theory distribution of time until the release is the

transformed latch holding time distribution:
1 1
pl (t ) = (1 − P (t )) = Q (t )
< t> < t>

Spin efficiency and

average spin time are:
Spin count tuning when spin efficiency is low
To estimate effect of spin count tuning, we can use the approximate scaling
rules depending on the value of:
σ = "spin efficiency"=“Spin gets/Miss”
If the spin is inefficient σ < <1 then spin probes the latch holding time
distribution around the origin:

If processes do not release latch immediately:

Therefore:
In this region doubling the spin count will double "spin
efficiency" and also double the CPU consumption
Spin count tuning when efficiency is high
In high efficiency region sleep cuts off the tail of latch holding time distribution:

Oracle normally operates in this region of small latch sleeps ratio κ = 1 − σ < 0.1
Here spin count is greater than number of instructions protected by latch
The spin time is bounded by the "residual latch holding time" and spin count:

Sleep prevents latch from waste CPU for spinning for heavy
tail of holding time distribution
Exponential tail spin scaling

• Experiments showed that normally latch holding time distribution has

exponential tail:
k

• Compare this to Guy Harrison experimental data

• If "sleep ratio" is small κ = 1 − σ < < 0.1 then:

Doubling the spin count will square the “sleep ratio” coefficient.
This will only add part of order κ to spin CPU consumption
Oracle DBA paraphrase: If "sleep ratio" for exclusive latch is 10% than
increase of spin count to 40000 may results in 10 times decrease of
"latch free" wait events, and only 10% increase of CPU consumption.
If the spin is already efficient, it is worth to increase the spin count.
Long distribution tails: CPU thrashing

• Latch contention can cause CPU starvation. Processes contending for a

latch, also contend for CPU.
• Once CPU starves, OS runqueue length raise and loadaverage exceeds
the number of CPUs. Some OS may shrink the time quantum. Latch
holders will not receive enough time to release the latch.
• Due to priority decay, latch acquirers may preempt latch holders. This
leads to priority inversion. The throughput falls.
• Transition to this stable state is more likely if workload of your system
approaches ~100% CPU
• Due to preemption, latch holding time S will raise to the CPU
scheduling scale.
• To prevent CPU thrashing use fixed priority OS scheduling classes.
Latch SMP scalability

• If latch utilization is ρ 1 in single CPU environment.

• Then in N CPU server latch utilization will be ρ N ≈ Nρ 1 . This can be
problematic:
• If single CPU system held latches only for 1% of time
• 48 CPU server with the same per-CPU load will hold latches for 50%
• 128 CPU Cores server will suffer huge latch (and mutex) contention

• This is also known as "Software lockout". It may substantially affect

contemporary multi-core servers.

• NUMA should overcome this intrinsic spinlock scalability restriction

Spinlock SMP scalability estimations
−1
 N  ρ  k
N! 
ρN = 1 −  ∑  1
 
 k = 0  1 − ρ 1  ( N − k )!
r r1=0.01
1 Responce time r1=0.01
14
0.8 12
10
0.6
8
0.4 6
4
0.2
2
Ncpu Ncpu
20 40 60 80 100 120 20 40 60 80 100

B. Sinharoy, et al. , Improving Software MP Efficiency for Shared Memory Systems.

Proc. of the 29th Annual Hawaii International Conference on System Sciences – 1996
Q/A?

• Questions?

• Comments?
Acknowledgements

• Thanks to Professor S.V. Klimenko for kindly inviting me to

MEDIAS 2011 conference

• Thanks to RDTEX CEO I.G. Kunitsky for financial support

• Thanks to RDTEX Technical Support Centre Director S.P.

Misiura for years of encouragement and support of my
investigations
Thank you!

Andrey Nikolaev

http://andreynikolaev.wordpress.com

Andrey.Nikolaev@rdtex.ru

RDTEX, Moscow, Russia

www.rdtex.ru

3 - Introduction To DRRMIS
No ratings yet
3 - Introduction To DRRMIS
9 pages
DSI405 Instance Tuning
No ratings yet
DSI405 Instance Tuning
444 pages
Transiting To A Student-Managed Maker Space
No ratings yet
Transiting To A Student-Managed Maker Space
9 pages
Latches and Mutexes in Oracle 12c
No ratings yet
Latches and Mutexes in Oracle 12c
80 pages
MFF
No ratings yet
MFF
402 pages
Akruti Software Details
No ratings yet
Akruti Software Details
2 pages
Oracle DB
No ratings yet
Oracle DB
65 pages
3D Secure
No ratings yet
3D Secure
31 pages
f1 v1 Lect 01
No ratings yet
f1 v1 Lect 01
35 pages
ACN Microrproject 1
No ratings yet
ACN Microrproject 1
19 pages
Mechanical Engineering Research Paper Topics List
No ratings yet
Mechanical Engineering Research Paper Topics List
8 pages
Smart Parking Management System
No ratings yet
Smart Parking Management System
8 pages
2 - Hardware
No ratings yet
2 - Hardware
29 pages
rdbtf05 Locking
No ratings yet
rdbtf05 Locking
24 pages
Application of Graph Theory in Communication Networks
No ratings yet
Application of Graph Theory in Communication Networks
5 pages
Oracle DB Keywords
No ratings yet
Oracle DB Keywords
21 pages
RAC 12c Optimization
No ratings yet
RAC 12c Optimization
65 pages
Dovado UMR Mobile Broadband Router - Manual PDF
No ratings yet
Dovado UMR Mobile Broadband Router - Manual PDF
42 pages
Qvproperties
No ratings yet
Qvproperties
6 pages
Docker Compose
No ratings yet
Docker Compose
3 pages
Exploring Latches
No ratings yet
Exploring Latches
14 pages
Exploring Mutexes
No ratings yet
Exploring Mutexes
12 pages
TRF Format
No ratings yet
TRF Format
13 pages
Atharvaved-I in Hindi by Sri Ram Sharma Acharya
No ratings yet
Atharvaved-I in Hindi by Sri Ram Sharma Acharya
470 pages
ILC Manual
No ratings yet
ILC Manual
110 pages
Lect 19
No ratings yet
Lect 19
43 pages
Linux Container Management with LXD: Definitive Reference for Developers and Engineers
From Everand
Linux Container Management with LXD: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Hydrogen in Box - Concept Note
No ratings yet
Hydrogen in Box - Concept Note
5 pages
Lock and Latch
No ratings yet
Lock and Latch
4 pages
Advanced Log Management and System Monitoring: Mastering the ELK Stack
From Everand
Advanced Log Management and System Monitoring: Mastering the ELK Stack
Adam Jones
No ratings yet
En Data Sheet 2227
No ratings yet
En Data Sheet 2227
3 pages
Understanding Locks and Enqueues
100% (1)
Understanding Locks and Enqueues
26 pages
Use of Leakage Currents of Insulators To Determine The Stage Characteristics of The Flashover Process and Contamination Level Prediction
No ratings yet
Use of Leakage Currents of Insulators To Determine The Stage Characteristics of The Flashover Process and Contamination Level Prediction
12 pages
Part II: Waits Events and The Geeks Who Love Them: Kyle Hailey
No ratings yet
Part II: Waits Events and The Geeks Who Love Them: Kyle Hailey
44 pages
Mastering the Art of Linux Kernel Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Art of Linux Kernel Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
09 Indexconcurrency
No ratings yet
09 Indexconcurrency
3 pages
Project 4 Student Book Third Edition : Download Now
No ratings yet
Project 4 Student Book Third Edition : Download Now
1 page
Part II: Waits Events and The Geeks Who Love Them: Kyle Hailey
No ratings yet
Part II: Waits Events and The Geeks Who Love Them: Kyle Hailey
41 pages
Tuning Ws 1b1 Locks
No ratings yet
Tuning Ws 1b1 Locks
38 pages
Wireshark Cookbook: Packet Analysis Bible
From Everand
Wireshark Cookbook: Packet Analysis Bible
Rob Botwright
No ratings yet
Internal Locks
No ratings yet
Internal Locks
4 pages
TQM & TM PDF
No ratings yet
TQM & TM PDF
14 pages
Taramps 26 Septiembre
No ratings yet
Taramps 26 Septiembre
10 pages
Linux Container Essentials with LXC: Definitive Reference for Developers and Engineers
From Everand
Linux Container Essentials with LXC: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cybersecurity Interviews - 200 Must-Know Questions!
No ratings yet
Cybersecurity Interviews - 200 Must-Know Questions!
28 pages
AVH-200EX AVH-201EX: DVD Rds Av Receiver
No ratings yet
AVH-200EX AVH-201EX: DVD Rds Av Receiver
60 pages
Calculating OS CPU Util From Views
No ratings yet
Calculating OS CPU Util From Views
25 pages
Resolving Oracle Latch Contention: by Guy Harrison
No ratings yet
Resolving Oracle Latch Contention: by Guy Harrison
12 pages
Understanding Oracle Locking
No ratings yet
Understanding Oracle Locking
9 pages
Tuning Database Locks & Latches: Hamid R. Minoui
No ratings yet
Tuning Database Locks & Latches: Hamid R. Minoui
60 pages
Tuning Database Locks & Latches: Hamid R. Minoui
No ratings yet
Tuning Database Locks & Latches: Hamid R. Minoui
60 pages
SA Forum Extended Training Materials: Lock Service
100% (1)
SA Forum Extended Training Materials: Lock Service
32 pages
Or Acl Eser Verar Chi T Ect Ur E
No ratings yet
Or Acl Eser Verar Chi T Ect Ur E
11 pages
Session Level Yapp Handout PDF
No ratings yet
Session Level Yapp Handout PDF
27 pages
Oracle Architecture Diagram and Notes
No ratings yet
Oracle Architecture Diagram and Notes
9 pages
Service: Audi 100 1991
No ratings yet
Service: Audi 100 1991
256 pages
SRS Library Cache Locks Report: Service Response Guide
No ratings yet
SRS Library Cache Locks Report: Service Response Guide
8 pages
Puranmal Lahoti Government Polytechnic Latur: Name of The Students
No ratings yet
Puranmal Lahoti Government Polytechnic Latur: Name of The Students
11 pages
Cyber Security-updated-R01
No ratings yet
Cyber Security-updated-R01
3 pages
Systemd-nspawn in Practice: Definitive Reference for Developers and Engineers
From Everand
Systemd-nspawn in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Owcph2002 Engsig Statspack Paper
No ratings yet
Owcph2002 Engsig Statspack Paper
11 pages
Exploring Oracle Rdbms Latches Using Solaris Dtrace
No ratings yet
Exploring Oracle Rdbms Latches Using Solaris Dtrace
14 pages
Access To The Unknown Vehicle Into The Apartments Through The Automatic Password Code Generator
No ratings yet
Access To The Unknown Vehicle Into The Apartments Through The Automatic Password Code Generator
4 pages
Understanding Locks Semaphores Latches Mutex and Conditions
No ratings yet
Understanding Locks Semaphores Latches Mutex and Conditions
6 pages
Oracle Diagnostics: Hemant K Chitale
No ratings yet
Oracle Diagnostics: Hemant K Chitale
19 pages
Latch and Mutex Contention Troubleshooting in Oracle: Tanel Põder
No ratings yet
Latch and Mutex Contention Troubleshooting in Oracle: Tanel Põder
20 pages
Understanding Locking in Oracle
No ratings yet
Understanding Locking in Oracle
64 pages
You Probably Dont Need RAC
No ratings yet
You Probably Dont Need RAC
10 pages
Total Productive Maintenance (TPM)
No ratings yet
Total Productive Maintenance (TPM)
27 pages
Oracle Architecture Interview Questions
100% (2)
Oracle Architecture Interview Questions
11 pages
Performance Testing DB Performance
No ratings yet
Performance Testing DB Performance
19 pages
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
Ict Assignment For G-12: Ethio Parent High SC H OOL
No ratings yet
Ict Assignment For G-12: Ethio Parent High SC H OOL
12 pages
DBA Interview Questions With Answers Part8
No ratings yet
DBA Interview Questions With Answers Part8
13 pages
Cache Fusion Oracle Rac
No ratings yet
Cache Fusion Oracle Rac
25 pages
Latch Lock and Mutex Contention Troubleshooting
100% (1)
Latch Lock and Mutex Contention Troubleshooting
20 pages
Mastering Proxmox - Second Edition
From Everand
Mastering Proxmox - Second Edition
Wasim Ahmed
No ratings yet
Oracle Detect and Resolve Deadlocks - Sqls
No ratings yet
Oracle Detect and Resolve Deadlocks - Sqls
15 pages
Oracle Latch and Mutex Contention Troubleshooting
No ratings yet
Oracle Latch and Mutex Contention Troubleshooting
20 pages
06 Buffer Cache
No ratings yet
06 Buffer Cache
85 pages
Summary Oracle
No ratings yet
Summary Oracle
39 pages
Systematic Oracle Tuning
No ratings yet
Systematic Oracle Tuning
29 pages
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
2009 06 02 Library-Cache-Lock
No ratings yet
2009 06 02 Library-Cache-Lock
9 pages
Advanced Research Techniques
No ratings yet
Advanced Research Techniques
35 pages
Oracle Database Internals FAQ
No ratings yet
Oracle Database Internals FAQ
9 pages
Wait Event Enhancements in Oracle 10g
No ratings yet
Wait Event Enhancements in Oracle 10g
32 pages
Rac Q&a
No ratings yet
Rac Q&a
51 pages
Questionaire
No ratings yet
Questionaire
106 pages
Brausse SIGNA 1050fi ENG
No ratings yet
Brausse SIGNA 1050fi ENG
12 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Exploring The Oracle Latches

Uploaded by

Exploring The Oracle Latches

Uploaded by

Andrey Nikolaev

Exploring Oracle RDBMS latches

• Graduated from MIPT in 1987

• 1987-1996 at COMPAS group, IHEP, Protvino

• Currently at RDTEX, Oracle First Line Support company

• Specialize in Oracle performance tuning

• Over 20 years of Oracle related experience as a research

Oracle SMON PMON RECO Others

instance: SGA: Locks Shared pool

Control Online Archived

• Oracle Statistics. "What sessions have done?". 628 statistics in 11.2.0.2

The goals of this work are:

• Latch uses atomic hardware instructions for Immediate Get

TS while(Test_and_Set(lock)); Bus saturation by atomic

T.E. Anderson, “The Performance of Spin-Lock Alternatives for Shared-Memory Multiprocessors,”

DTrace allows us to investigate how Oracle latches perform in real time:

Process fixed array: List of all latches:

Each process has an array of references to the latches it is holding

X$KSUPR.KSLLA% fields instrument the latch get:

“where” and “why”

Latch structure was bigger in 10.1 due to additional latch statistics

0x00 – latch is free

0xFF – exclusive latch is busy. Was 0x01 in Oracle 7

0x01,0x02,… - shared latch holding by 1,2, … processes simultaneously

0x20000000 | pid - shared latch holding exclusively

0x00 – latch free

0x12 – Oracle process with pid 18 holds the exclusive latch

Each latch have at least the following attributes in kslldt :

Process Process Process A

Version from contemporary 11.2 documentation. Was really

Latch wait get (kslgetl(laddress,1,…)):

kslgetl(0x200058F8,1,2,3) - KSL GET exclusive Latch# 29

• … Event 10046 trace:

• Most waits were for nothing – latch already was free

• Latch utilization could not be more 70%

• Lot of unnecessary spins – provokes CPU thrashing

Semop – infinite wait until posted!

• Hidden latch wait revolution. In Oracle 9.2-11.2, all the latches in

Loop up to “Spins” cycles

• Shared latch spin in Oracle 9.2-11g is governed by

• X mode shared latch get spins by default up to 4000 cycles.

• S mode does not spin at all (or spins in unknown way)

S mode get X mode get

Held in S mode Compatible 2*_SPIN_COUNT

Blocking mode 0 2*_SPIN_COUNT

• Free the latch – kslfre(laddr)

Latch statistics for: Latch acquisition time distribution

• Latch contention should be suspected if the latch wait events are

• It may be too expensive and require complete application rewrite.

• First, the root cause of latch contention must be diagnosed.

According to renewal theory distribution of time until the release is the

Spin efficiency and

If processes do not release latch immediately:

• Experiments showed that normally latch holding time distribution has

• Compare this to Guy Harrison experimental data

• If "sleep ratio" is small κ = 1 − σ < < 0.1 then:

• Latch contention can cause CPU starvation. Processes contending for a

• If latch utilization is ρ 1 in single CPU environment.

• This is also known as "Software lockout". It may substantially affect

• NUMA should overcome this intrinsic spinlock scalability restriction

B. Sinharoy, et al. , Improving Software MP Efficiency for Shared Memory Systems.

• Thanks to Professor S.V. Klimenko for kindly inviting me to

• Thanks to RDTEX CEO I.G. Kunitsky for financial support

• Thanks to RDTEX Technical Support Centre Director S.P.

RDTEX, Moscow, Russia

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.