asyncio: slight optimizations for `run_until_complete` and `sleep_ms` #17699

greezybacon · 2025-07-17T13:42:07Z

Summary

This is aimed at improving the loop timing of the asyncio core loop. It makes a few small optimizations to the core and realizes about a 20% impact in overall performance.

In the IO poll method, the POLLIN and POLLOUT constants are looked up in the local module context rather than in the select module when used.
In sleep_ms, max is not used for each call. Instead, an if expression handles the case when t is negative.
In run_until_complete, a call to max is avoided
In run_until_complete the methods for the task and IO queues are only looked up once.

Testing

I ran two tests on three platforms. Source code is given below. The tight-loop just runs a single task as quickly as possible. The second task uses a ThreadSafeFlag to run two tasks as quickly as possible but requires IO polling between the tasks.

test	platform	base (v1.25.0)	PR	Change
tight-loop	unix (ubuntu 22.04 on Mac M2)	1.45us	1.05us	-28%
	mimxrt (Teensy 4.1)	49us	32us	-34%
	rp2 (W5100S EVB PICO @ 125MHz)	621us	476us	-23.3%
io-poll	unix	2724us	2724us	(none)
	mimxrt	252us	199us	-21%
	rp2	2107us	1713us	-18.7%

tight-loop test

import asyncio

async def count():
    global counter
    while True:
        await asyncio.sleep_ms(0)
        counter += 1

try:
    counter = 0
    asyncio.run(asyncio.wait_for(count(), timeout=2))
finally:
    print(counter, 2e6/counter)

io-poll test

import asyncio

flag = asyncio.ThreadSafeFlag()
async def sender():
    while True:
        flag.set()
        await asyncio.sleep_ms(0)

async def recv():
    global counter
    while True:
        await flag.wait()
        counter += 1

counter = 0
try:
    asyncio.create_task(sender())
    asyncio.run(asyncio.wait_for(recv(), timeout=2))
finally:
    if counter:
        print(counter, 2e6/counter)

Calculate ~POLLIN and ~POLLOUT as constants to remove the runtime cost of continuously calculating them. And unpack the queue entry rather than using repeated item lookups. Additionally, avoid call to max() in sleep_ms. Generally, the waittime specified will not be negative, so the call to `max` should generally not be needed. Instead, the code will either call `ticks_add` if `t` is positive or else use the current ticks time.

github-actions · 2025-07-17T13:52:33Z

Code size report:

   bare-arm:    +0 +0.000% 
minimal x86:    +0 +0.000% 
   unix x64:   +80 +0.009% standard
      stm32:   +36 +0.009% PYBV10
     mimxrt:   +32 +0.009% TEENSY40
        rp2:   +32 +0.003% RPI_PICO_W
       samd:   +40 +0.015% ADAFRUIT_ITSYBITSY_M4_EXPRESS
  qemu rv32:    +0 +0.000% VIRT_RV32

codecov · 2025-07-17T13:57:17Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.44%. Comparing base (f498a16) to head (b4a3017).
Report is 387 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #17699      +/-   ##
==========================================
- Coverage   98.54%   98.44%   -0.10%     
==========================================
  Files         169      171       +2     
  Lines       21890    22208     +318     
==========================================
+ Hits        21571    21863     +292     
- Misses        319      345      +26

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

dpgeorge · 2025-07-18T13:26:25Z

extmod/asyncio/core.py

@@ -54,7 +55,8 @@ def __next__(self):
 # Use a SingletonGenerator to do it without allocating on the heap
 def sleep_ms(t, sgen=SingletonGenerator()):
    assert sgen.state is None
-    sgen.state = ticks_add(ticks(), max(0, t))
+    now = ticks()
+    sgen.state = ticks_add(now, t) if t > 0 else now


Does this give a measurable speed improvement? Is it worth it for the cost in code size?

I measure this change here as +5 bytes to the bytecode. The most taken path will be when ticks_add() needs to be called, which goes from 12 opcodes previously to now 16 opcodes. It's usually the opcode overhead that's slow, rather than the actual call (eg out to max, which should be quick with two small int args). So I would guess that this change actually makes things a little slower.

dpgeorge · 2025-07-18T13:26:51Z

extmod/asyncio/core.py

-                dt = max(0, ticks_diff(t.ph_key, ticks()))
+                dt = ticks_diff(t.ph_key, ticks())
+                if dt < 0:
+                    dt = 0


As above, does this change here actually make things faster?

greezybacon changed the title ~~asyncio: Make slight optimizations for IOQueue.wait_io_event~~ asyncio: slight optimizations for run_until_complete and sleep_ms Jul 17, 2025

greezybacon mentioned this pull request Jul 17, 2025

asyncio: Properly cancel the main task on exception #9870

Closed

dpgeorge reviewed Jul 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

asyncio: slight optimizations for `run_until_complete` and `sleep_ms` #17699

asyncio: slight optimizations for `run_until_complete` and `sleep_ms` #17699

greezybacon commented Jul 17, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

codecov bot commented Jul 17, 2025 •

edited

Loading

Uh oh!

dpgeorge Jul 18, 2025

Uh oh!

dpgeorge Jul 18, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Uh oh!

asyncio: slight optimizations for run_until_complete and sleep_ms #17699

Are you sure you want to change the base?

asyncio: slight optimizations for run_until_complete and sleep_ms #17699

Conversation

greezybacon commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

codecov bot commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dpgeorge Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

dpgeorge Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

asyncio: slight optimizations for `run_until_complete` and `sleep_ms` #17699

asyncio: slight optimizations for `run_until_complete` and `sleep_ms` #17699

greezybacon commented Jul 17, 2025 •

edited

Loading

codecov bot commented Jul 17, 2025 •

edited

Loading