Skip to content

GH-91048: Add utils for printing the call stack for asyncio tasks #133284

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 33 commits into from
May 4, 2025
Merged
Show file tree
Hide file tree
Changes from 21 commits
Commits
Show all changes
33 commits
Select commit Hold shift + click to select a range
b11469a
GH-91048: Add utils for printing the call stack for asyncio tasks
pablogsal May 1, 2025
7f800e8
Maybe
pablogsal May 2, 2025
c5e4efe
Maybe
pablogsal May 2, 2025
1c982b1
Maybe
pablogsal May 2, 2025
2d94cde
fix configure
pablogsal May 2, 2025
0a9a496
fix configure
pablogsal May 2, 2025
db47ff3
fix configure
mgmacias95 May 2, 2025
6f8bd4c
some tests + fixes
mgmacias95 May 2, 2025
152b3d7
improve tests
mgmacias95 May 2, 2025
955ef27
dsf
pablogsal May 2, 2025
65aee3c
dsf
pablogsal May 2, 2025
51e689e
test fixes
pablogsal May 3, 2025
1d27348
test fixes
pablogsal May 3, 2025
1d1b0e9
test fixes
pablogsal May 3, 2025
edad4d1
test fixes
pablogsal May 3, 2025
199589c
Fix free threading offsets
pablogsal May 3, 2025
9e87032
Fix free threading offsets AGAIN
pablogsal May 3, 2025
69e9221
Debugging
pablogsal May 3, 2025
b6cb609
More tests
pablogsal May 3, 2025
2dd3452
Add news entry
pablogsal May 3, 2025
a84a171
Doc fixes
pablogsal May 3, 2025
0f75edc
Fix doc build
ambv May 3, 2025
c3a6bcb
Add Yury
ambv May 3, 2025
5e1cb87
fix: Show independent tasks in the table
mgmacias95 May 3, 2025
d92b520
Merge pull request #101 from mgmacias95/GH-91048-tasks
pablogsal May 3, 2025
af6a8bf
Temporarily skip test_async_global_awaited_by on free-threading
ambv May 3, 2025
8db5dbe
Drop the `tools`. It's cleaner.
ambv May 3, 2025
6f8aa6b
Satisfy the linting gods
ambv May 3, 2025
8d566c6
chore: Refactor
mgmacias95 May 4, 2025
977c15a
Merge pull request #103 from mgmacias95/GH-91048-tasks
pablogsal May 4, 2025
9dbe00d
Doc fixes
pablogsal May 4, 2025
c56782b
Type fixes
pablogsal May 4, 2025
293337f
Type fixes
pablogsal May 4, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
91 changes: 91 additions & 0 deletions Doc/whatsnew/3.14.rst
Original file line number Diff line number Diff line change
Expand Up @@ -508,6 +508,97 @@
.. seealso::
:pep:`741`.

.. _whatsnew314-asyncio-introspection

Check warning on line 511 in Doc/whatsnew/3.14.rst

View workflow job for this annotation

GitHub Actions / Docs / Docs

malformed hyperlink target. [docutils]

Asyncio introspection capabilities
----------------------------------

Added a new command-line interface to inspect running Python processes using
asynchronous tasks, available via:

.. code-block:: bash

python -m asyncio.tools [--tree] PID

This tool inspects the given process ID (PID) and displays information about
currently running asyncio tasks. By default, it outputs a task table: a flat
listing of all tasks, their names, their coroutine stacks, and which tasks are
awaiting them.

With the ``--tree`` option, it instead renders a visual async call tree,
showing coroutine relationships in a hierarchical format. This command is
particularly useful for debugging long-running or stuck asynchronous programs.
It can help developers quickly identify where a program is blocked, what tasks
are pending, and how coroutines are chained together.

For example given this code:

.. code-block:: python

import asyncio

async def sleeper(name, delay):
await asyncio.sleep(delay)
print(f"{name} is done sleeping.")

async def inner(name):
await asyncio.sleep(0.1)
await sleeper(name, 1)

async def task_group(name):
await asyncio.gather(
inner(f"{name}-1"),
inner(f"{name}-2"),
)

async def main():
# Start two separate task groups
t1 = asyncio.create_task(task_group("groupA"))
t2 = asyncio.create_task(task_group("groupB"))
await t1
await t2

if __name__ == "__main__":
asyncio.run(main())

Executing the new tool on the running process will yield a table like this:

.. code-block:: bash

python -m asyncio.tools 12345

tid task id task name coroutine chain awaiter name awaiter id
---------------------------------------------------------------------------------------------------------------------------------------
6826911 0x200013c0220 Task-2 main Task-1 0x200013b0020
6826911 0x200013c0620 Task-4 task_group Task-2 0x200013c0220
6826911 0x200013c0820 Task-5 task_group Task-2 0x200013c0220
6826911 0x200013c0c20 Task-6 task_group Task-3 0x200013c0420
6826911 0x200013c0e20 Task-7 task_group Task-3 0x200013c0420


and with the ``--tree`` option:

.. code-block:: bash

python -m asyncio.tools --tree 12345

└── (T) Task-1
└── main
└── (T) Task-2
└── task_group
├── (T) Task-4
└── (T) Task-5
└── (T) Task-3
└── task_group
├── (T) Task-6
└── (T) Task-7

If a cycle is detected in the async await graph (which could indicate a
programming issue), the tool raises an error and lists the cycle paths that
prevent tree construction.

(Contributed by Pablo Galindo, Łukasz Langa and Marta Gomez Macias in :gh:`91048`.)

.. _whatsnew314-tail-call:

A new type of interpreter
Expand Down
207 changes: 207 additions & 0 deletions Lib/asyncio/tools.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,207 @@
import argparse
from dataclasses import dataclass
from collections import defaultdict
from itertools import count
from enum import Enum
import sys
from _remotedebugging import get_all_awaited_by


class NodeType(Enum):
COROUTINE = 1
TASK = 2


@dataclass(frozen=True)
class CycleFoundException(Exception):
"""Raised when there is a cycle when drawing the call tree."""
cycles: list[list[int]]
id2name: dict[int, str]


# ─── indexing helpers ───────────────────────────────────────────
def _index(result):
id2name, awaits = {}, []
for _thr_id, tasks in result:
for tid, tname, awaited in tasks:
id2name[tid] = tname
for stack, parent_id in awaited:
awaits.append((parent_id, stack, tid))
return id2name, awaits


def _build_tree(id2name, awaits):
id2label = {(NodeType.TASK, tid): name for tid, name in id2name.items()}
children = defaultdict(list)
cor_names = defaultdict(dict) # (parent) -> {frame: node}
cor_id_seq = count(1)

def _cor_node(parent_key, frame_name):
"""Return an existing or new (NodeType.COROUTINE, …) node under *parent_key*."""
bucket = cor_names[parent_key]
if frame_name in bucket:
return bucket[frame_name]
node_key = (NodeType.COROUTINE, f"c{next(cor_id_seq)}")
id2label[node_key] = frame_name
children[parent_key].append(node_key)
bucket[frame_name] = node_key
return node_key

# touch every task so it’s present even if it awaits nobody
for tid in id2name:
children[(NodeType.TASK, tid)]

# lay down parent ➜ …frames… ➜ child paths
for parent_id, stack, child_id in awaits:
cur = (NodeType.TASK, parent_id)
for frame in reversed(stack): # outer-most → inner-most
cur = _cor_node(cur, frame)
child_key = (NodeType.TASK, child_id)
if child_key not in children[cur]:
children[cur].append(child_key)

return id2label, children


def _roots(id2label, children):
all_children = {c for kids in children.values() for c in kids}
return [n for n in id2label if n not in all_children]

# ─── detect cycles in the task-to-task graph ───────────────────────
def _task_graph(awaits):
"""Return {parent_task_id: {child_task_id, …}, …}."""
from collections import defaultdict
g = defaultdict(set)
for parent_id, _stack, child_id in awaits:
g[parent_id].add(child_id)
return g


def _find_cycles(graph):
"""
Depth-first search for back-edges.

Returns a list of cycles (each cycle is a list of task-ids) or an
empty list if the graph is acyclic.
"""
WHITE, GREY, BLACK = 0, 1, 2
color = defaultdict(lambda: WHITE)
path, cycles = [], []

def dfs(v):
color[v] = GREY
path.append(v)
for w in graph.get(v, ()):
if color[w] == WHITE:
dfs(w)
elif color[w] == GREY: # back-edge → cycle!
i = path.index(w)
cycles.append(path[i:] + [w]) # make a copy
color[v] = BLACK
path.pop()

for v in list(graph):
if color[v] == WHITE:
dfs(v)
return cycles


# ─── PRINT TREE FUNCTION ───────────────────────────────────────
def print_async_tree(result, task_emoji="(T)", cor_emoji="", printer=print):
"""
Pretty-print the async call tree produced by `get_all_async_stacks()`,
prefixing tasks with *task_emoji* and coroutine frames with *cor_emoji*.
"""
id2name, awaits = _index(result)
g = _task_graph(awaits)
cycles = _find_cycles(g)
if cycles:
raise CycleFoundException(cycles, id2name)
labels, children = _build_tree(id2name, awaits)

def pretty(node):
flag = task_emoji if node[0] == NodeType.TASK else cor_emoji
return f"{flag} {labels[node]}"

def render(node, prefix="", last=True, buf=None):
if buf is None:
buf = []
buf.append(f"{prefix}{'└── ' if last else '├── '}{pretty(node)}")
new_pref = prefix + (" " if last else "│ ")
kids = children.get(node, [])
for i, kid in enumerate(kids):
render(kid, new_pref, i == len(kids) - 1, buf)
return buf

result = []
for r, root in enumerate(_roots(labels, children)):
result.append(render(root))
return result


def build_task_table(result):
id2name, awaits = _index(result)
table = []
for tid, tasks in result:
for task_id, task_name, awaited in tasks:
for stack, awaiter_id in awaited:
coroutine_chain = " -> ".join(stack)
awaiter_name = id2name.get(awaiter_id, "Unknown")
table.append(
[
tid,
hex(task_id),
task_name,
coroutine_chain,
awaiter_name,
hex(awaiter_id),
]
)

return table

def _print_cycle_exception(exception: CycleFoundException):
print("ERROR: await-graph contains cycles – cannot print a tree!", file=sys.stderr)
print("", file=sys.stderr)
for c in exception.cycles:
inames = " → ".join(exception.id2name.get(tid, hex(tid)) for tid in c)
print(f"cycle: {inames}", file=sys.stderr)



if __name__ == "__main__":
parser = argparse.ArgumentParser(description="Show Python async tasks in a process")
parser.add_argument("pid", type=int, help="Process ID(s) to inspect.")
parser.add_argument(
"--tree", "-t", action="store_true", help="Display tasks in a tree format"
)
args = parser.parse_args()

try:
tasks = get_all_awaited_by(args.pid)
except RuntimeError as e:
while e.__context__ is not None:
e = e.__context__
print(f"Error retrieving tasks: {e}")
sys.exit(1)

if args.tree:
# Print the async call tree
try:
result = print_async_tree(tasks)
except CycleFoundException as e:
_print_cycle_exception(e)
sys.exit(1)

for tree in result:
print("\n".join(tree))
else:
# Build and print the task table
table = build_task_table(tasks)
# Print the table in a simple tabular format
print(
f"{'tid':<10} {'task id':<20} {'task name':<20} {'coroutine chain':<50} {'awaiter name':<20} {'awaiter id':<15}"
)
print("-" * 135)
for row in table:
print(f"{row[0]:<10} {row[1]:<20} {row[2]:<20} {row[3]:<50} {row[4]:<20} {row[5]:<15}")
Loading
Loading
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy