py/objtype: Add support for __set_name__. (hazard version) #15503

AJMansfield · 2024-07-19T23:39:40Z

Summary

This PR implements the feature described in #15501, adding support for the __set_name__ data model method.

Testing

This PR includes and passes the unit test originally submitted in #15500 to verify the feature's absence.

codecov · 2024-07-20T00:05:20Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.44%. Comparing base (df05cae) to head (4ba892a).
Report is 28 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master   #15503   +/-   ##
=======================================
  Coverage   98.44%   98.44%           
=======================================
  Files         171      171           
  Lines       22192    22217   +25     
=======================================
+ Hits        21847    21872   +25     
  Misses        345      345

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

github-actions · 2024-07-20T00:19:06Z

Code size report:

   bare-arm:    +0 +0.000% 
minimal x86:    +0 +0.000% 
   unix x64:  +168 +0.020% standard[incl +32(data)]
      stm32:   +92 +0.023% PYBV10
     mimxrt:   +96 +0.026% TEENSY40
        rp2:  +104 +0.011% RPI_PICO_W
       samd:  +120 +0.045% ADAFRUIT_ITSYBITSY_M4_EXPRESS
  qemu rv32:  +101 +0.022% VIRT_RV32

dpgeorge · 2024-07-23T04:50:25Z

Thanks for the contribution.

I think this is a good addition. It's important to support this __set_name__ special-method to properly make use of descriptors (which MicroPython does support, at least with get/set/delete).

I'm just wondering about the performance hit. It means that each class that's defined will need to iterate through all of its members. I wonder if there's a way to make use of MP_TYPE_FLAG_HAS_SPECIAL_ACCESSORS to decide whether to do this search or not. There's already check_for_special_accessors() that's run on class creation in a loop over all members. Maybe that can be reused instead of essentially duplicating the work?

AJMansfield · 2024-07-23T18:02:30Z

I'm just wondering about the performance hit.

I do think the extra loop is necessary, for reasons described below -- but also, this code only runs at class-creation time, something that in most cases should only happen a small fixed number of times during setup. (At least, outside of intractable runtime dynamic metaclassing scenarios for which performance is already a lost cause...)

I wonder if there's a way to make use of MP_TYPE_FLAG_HAS_SPECIAL_ACCESSORS to decide whether to do this search or not.

That could be done if we are only interested in addressing instance-attribute-like descriptors, but that would miss classattr- and classmethod-like descriptors that only use __set_name__ to bind to the containing class. We could try to special-case that out too, but it's much simpler to just check the thing itself and look up MP_QSTR___set_name__ directly.

There's already check_for_special_accessors() that's run on class creation in a loop over all members. Maybe that can be reused instead of essentially duplicating the work?

A check for a __set_name__ sub-member could be added as part of that loop, but there could be a sequencing hazard with invoking user code anywhere except right at the end -- and even then, invoking before attribute access is fully ready (incl. having the special accessor flag correctly set) would become a sequencing hazard if/when metaclasses become supported.

My opinion, is that it's better to just keep the loop separate. It does mean chasing down member pointers a second time, but the expensive part of the loop is looking up MP_QSTR___set_name__ on each member, which needs to be done whether or not it's part of the same loop with check_for_special_accessors.

AJMansfield · 2024-07-31T12:30:36Z

Update: I'm currently investigating some corner-case behavior I recently found around what happens (and what should happen) when a __set_name__ function inserts or removes elements from the class being initialized.

At the moment I'm still waiting on python/cpython#122381 to define what the exact semantics should be in that case -- but this PR is still valid for a merge.

Whatever the correct semantics are, they'll need at least a small code-size increase. And with how rarely that gets used, I feel like it'll probably be better gate them behind the MICROPY_PY_METACLASSES flag from #15511, and make that a separate PR.

@dpgeorge should I spend the time adding a cpydiff to this PR about what the exact mismatch is?

dpgeorge · 2024-08-02T02:38:13Z

should I spend the time adding a cpydiff to this PR about what the exact mismatch is?

Yes please. Then at least all this work/knowledge is encoded in a test and can be improved later on.

AJMansfield · 2025-02-21T20:16:59Z

Finally found some time to step back in to finish what I started here.

It looks like python/cpython#122381 is probably going nowhere and will just leave that behavior unspecified for now, or at best will just officially document CPython's current behavior.

Yes please. Then at least all this work/knowledge is encoded in a test and can be improved later on.

Unfortunately, I've not been able to find a way to actually cpydiff this :(.
Certainly, I've found test cases that fail, but I've simply found no way to make a test that fails because of the sequence hazard, rather than just due to differences in CPython vs Micropython iteration order.

I'll see if I can at least add a cpydiff that'll at least remain failing even when the iteration order happens to be the same, though, and other tests that at least verify that hazard-free class namespace editing scenarios work while I'm rebasing this to the current master.

AJMansfield · 2025-02-24T19:05:34Z

Not exactly proud of needing to resort to a stochastic test to reliably show off the bug, but I have a reliable cpydiff now for the specific sequence hazard I was worried about.

Fundamentally, it's a modify-while-iterating bug, of exactly the same kind that creates this diff:

d = {'a':1, 'b':2, 'c':3}
for k,v in d.items():
    d[k+k]=v+v
print(d)

(CPython errors with RuntimeError: dictionary changed size during iteration, while micropython just blithely proceeds and produces {'c': 3, 'b': 2, 'bbbbbbbb': 16, 'aaaa': 4, 'a': 1, 'aa': 2, 'bb': 4, 'bbbb': 8})

CPython avoids this bug in its __set_name__ implementation because it effectively iterates on dict.keys(), which creates a copy of the current set of keys instead of just a view. However, my proposed implementation for Micropython iterates through the raw allocated slots in locals_map (the same way the other steps that iterate its namespace do).

As a result, in my cpydiff/core_class_setname_hazard test case, involving a class with 26 descriptors with names a through z that each insert 10 additional attributes into the class at random, CPython ends up calling __set_name__ exactly once on only these original descriptors -- but my proposed micropython code calls some of them multiple times, and some not at all, as the order of the dict entries being iterated on shifts in response to the new insertions:

+-------------+-------------+
| CPy output: | uPy output: |
+-------------+-------------+
|     a 1     |     a 2     |
|     b 1     |     b 1     |
|     c 1     |     c 2     |
|     d 1     |     d 3     |
|     e 1     |     e 4     |
|     f 1     |     f 3     |
|     g 1     |     g 4     |
|     h 1     |     h 2     |
|     i 1     |     i 1     |
|     j 1     |     j 3     |
|     k 1     |     k 2     |
|     l 1     |     l 1     |
|     m 1     |     m 1     |
|     n 1     |     n 1     |
|     o 1     |     o 1     |
|     p 1     |     p 0     |
|     q 1     |     q 1     |
|     r 1     |     r 0     |
|     s 1     |     s 0     |
|     t 1     |     t 1     |
|     u 1     |     u 3     |
|     v 1     |     v 0     |
|     w 1     |     w 1     |
|     x 1     |     x 0     |
|     y 1     |     y 0     |
|     z 1     |     z 0     |
+-------------+-------------+

A similar effect can happen when __set_name__ removes instead of adds; but the worst case is still just descriptors having their __set_name__ missed or called multiple times.

There's no hazard in cases where there are no class-mutating descriptors, and this might not be a use-case it's worthwhile to support. I'll see if I can create an alternative pull-request for a version that exactly matches CPython, though, so we can compare the performance cost of creating that copy on more than just speculation.

AJMansfield · 2025-02-25T17:52:11Z

As one more potential alternative to leaving it an undefined behavior... we could temporarily set locals_map->is_fixed = 1 before the calls to __set_name__ to trigger an error if the user's code tries to mutate the class, and clear it after. Though of the four, this version is the least functional/useful, but also the simplest that avoids the potential for confusion created by the iteration hazard.

AJMansfield · 2025-02-26T19:22:09Z

I've done some benchmarking using a new suite of internalbench class creation benchmarks (see PR #16825) and have compared the benchmark times from the base branch to this branch.

There's a lot of noise in this (I'll see if I can run these on real hardware at some point), but overall this patch makes processing classes take about 12% longer:

internal_bench/class_create:
    0.349 -> 0.391 (+12%) internal_bench/class_create-0-empty.py
    0.476 -> 0.544 (+14%) internal_bench/class_create-1-slots.py
    0.490 -> 0.553 (+13%) internal_bench/class_create-1.1-slots5.py
    0.432 -> 0.483 (+12%) internal_bench/class_create-2-classattr.py
    0.776 -> 0.868 (+12%) internal_bench/class_create-2.1-classattr5.py
    0.461 -> 0.533 (+16%) internal_bench/class_create-3-instancemethod.py
    0.477 -> 0.535 (+12%) internal_bench/class_create-4-classmethod.py
    0.452 -> 0.511 (+13%) internal_bench/class_create-4.1-classmethod_implicit.py
    0.494 -> 0.537 (+09%) internal_bench/class_create-5-staticmethod.py
    0.458 -> 0.512 (+12%) internal_bench/class_create-6-getattribute.py
    0.474 -> 0.518 (+09%) internal_bench/class_create-6.1-getattr.py
    0.386 -> 0.447 (+16%) internal_bench/class_create-6.2-descriptor.py
    0.517 -> 0.625 (+21%) internal_bench/class_create-6.3-descriptor_setname.py
    0.419 -> 0.480 (+15%) internal_bench/class_create-6.4-property.py
    0.363 -> 0.420 (+16%) internal_bench/class_create-7-inherit.py
    0.368 -> 0.399 (+09%) internal_bench/class_create-7.1-inherit_initsubclass.py

There were also two other benchmark tests that got concerningly slower:

internal_bench/arrayop:
    0.174 -> 0.194 (+12%) internal_bench/arrayop-3-bytearray_inplace.py
internal_bench/loop_count:
    0.268 -> 0.310 (+16%) internal_bench/loop_count-2-range_iter.py

None of the other tests in their families seemed to be affected, but my tests for #16806 and #16816 both showed loop_count-2 slowed by near-enough the exact same margins as class creation.

Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

AJMansfield · 2025-07-21T03:05:07Z

Closed in favor of #16806

This PR adds support for the `__set_name__` data model method specified by PEP487 - Simpler customisation of class creation. This includes support for methods that mutate the owner class, and avoids the naive modify-while-iterating hazard possible in a naive implementation like micropython#15503. Note that based on the benchmarks in micropython#16825, this is also as fast or faster than the naive implementation, thanks to clever data layout in setname_list_t, and the way this allows the capture step to run during an existing loop through the class dict. Other rejected approaches for dealing with the hazard include: - python/cpython#72983 During the implementation of this feature for MicroPython, it was discovered that some versions of CPython also have this naive hazard. CPython resolved this bug in BPO-28797 and now makes a complete flat copy of the class's dict to iterate. This design decision doesn't make much sense for a microcontroller though, even if it's perfectly reasonable in the desktop world where memcpy might actually be cheaper than a hard-to-branch-predict conditional; and it's also motivated in their case by error-tracing considerations. - micropython#16816 This is an equivalent implementation to CPython's approach that places this copy directly on the stack; however it is both slower and has larger code size than the approach taken here. - micropython#15503 The simplest implementation is to just not worry about it and let the user face the consequences if they mutate the owner class. That's not a very friendly behavior, though, and it's not actually much more performant than this implementation on either time or code size. - micropython#17693 Another alternative is to do the same as micropython#15503 but leverage MicroPython's existing `is_fixed` field in its dict type to convert attempted mutations of the owner dict into `AttributeError`s. This is safer than just leaving the open hazard, but there's still important use-cases for owner-mutating descriptors, and the performance ain is small enough that it isn't worth missing support for those cases. - combined micropython#17693 with this Another version of this feature used a new feature define, `MICROPY_PY_METACLASSES_LITE`, to control whether this algorithm or the naive version is used. This was rejected in favor of simplicity, based on the very limited performance margin the naive version has (which in some cases even goes _against_ it). Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

This PR adds support for the `__set_name__` data model method specified by PEP487 - Simpler customisation of class creation. This includes support for methods that mutate the owner class, and avoids the naive modify-while-iterating hazard possible in a naive implementation like micropython#15503. Note that based on the benchmarks in micropython#16825, this is also as fast or faster than the naive implementation, thanks to clever data layout in `setname_list_t`, and the way this allows the capture step to run during an existing loop through the class dict. Other rejected approaches for dealing with the hazard include: - python/cpython#72983 During the implementation of this feature for MicroPython, it was discovered that some versions of CPython also have this naive hazard. CPython resolved this bug in BPO-28797 and now makes a complete flat copy of the class's dict to iterate. This design decision doesn't make much sense for a microcontroller though, even if it's perfectly reasonable in the desktop world where memcpy might actually be cheaper than a hard-to-branch-predict conditional; and it's also motivated in their case by error-tracing considerations. - micropython#16816 This is an equivalent implementation to CPython's approach that places this copy directly on the stack; however it is both slower and has larger code size than the approach taken here. - micropython#15503 The simplest implementation is to just not worry about it and let the user face the consequences if they mutate the owner class. That's not a very friendly behavior, though, and it's not actually much more performant than this implementation on either time or code size. - micropython#17693 Another alternative is to do the same as micropython#15503 but leverage MicroPython's existing `is_fixed` field in its dict type to convert attempted mutations of the owner dict into `AttributeError`s. This is safer than just leaving the open hazard, but there's still important use-cases for owner-mutating descriptors, and the performance ain is small enough that it isn't worth missing support for those cases. - combined micropython#17693 with this Another version of this feature used a new feature define, `MICROPY_PY_METACLASSES_LITE`, to control whether this algorithm or the naive version is used. This was rejected in favor of simplicity, based on the very limited performance margin the naive version has (which in some cases even goes _against_ it). Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

AJMansfield force-pushed the set-name branch 2 times, most recently from 6b650a8 to 6194bb5 Compare July 19, 2024 23:47

AJMansfield force-pushed the set-name branch from 18fd784 to 0490417 Compare July 20, 2024 00:08

AJMansfield changed the title ~~py/objtype: add support for __set_name__~~ py/objtype: Add support for __set_name__. Jul 20, 2024

dpgeorge added the py-core Relates to py/ directory in source label Jul 20, 2024

AJMansfield force-pushed the set-name branch from 049799e to 89f23c2 Compare July 20, 2024 15:04

AJMansfield mentioned this pull request Jul 20, 2024

py/objtype: Add basic __init_subclass__ metaclass support. #15511

Closed

AJMansfield force-pushed the set-name branch 3 times, most recently from 4f808a1 to 89f23c2 Compare July 24, 2024 19:39

AJMansfield mentioned this pull request Jul 28, 2024

Class creation doesn't call __set_name__ when creating class members in another __set_name__ python/cpython#122381

Open

dhalbert mentioned this pull request Oct 4, 2024

Add __set_name__ adafruit/circuitpython#9685

Open

AJMansfield mentioned this pull request Feb 21, 2025

tests/cpydiff: Test for PEP487 __set_name__. #16787

Closed

AJMansfield force-pushed the set-name branch 2 times, most recently from d0cefd3 to 7475965 Compare February 24, 2025 18:09

AJMansfield force-pushed the set-name branch from 7475965 to b6b4c35 Compare February 24, 2025 21:07

This was referenced Feb 24, 2025

py/objtype: Add support for PEP487 __set_name__. #16806

Open

py/objtype: Add support for __set_name__. (dict-copy version) #16816

Closed

AJMansfield changed the title ~~py/objtype: Add support for __set_name__.~~ py/objtype: Add support for __set_name__. (hazard version) Feb 26, 2025

AJMansfield mentioned this pull request Feb 26, 2025

tests/internal_bench: Benchmarks for descriptor-related features. #16825

Open

AJMansfield added 5 commits July 16, 2025 13:31

tests/basics/class_descriptor: Test for __set_name__.

3173fa1

Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

py/objtype: Implement __set_name__.

5da9c94

Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

py/mpconfig: Add __set_name__ to feature flag description.

76718ba

Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

tests/cpydiff/core_class_setname_hazard: Document __set_name__ hazard.

f1e9759

Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

tests/basics/class_setname_hazard: Document __set_name__ hazard.

4ba892a

Signed-off-by: Anson Mansfield <amansfield@mantaro.com>

AJMansfield force-pushed the set-name branch from b6b4c35 to 4ba892a Compare July 16, 2025 17:32

AJMansfield mentioned this pull request Jul 16, 2025

py/objtype: Add support for __set_name__. (no-self-modification version) #17693

Closed

AJMansfield closed this Jul 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

py/objtype: Add support for __set_name__. (hazard version) #15503

py/objtype: Add support for __set_name__. (hazard version) #15503

Uh oh!

AJMansfield commented Jul 19, 2024 •

edited

Loading

Uh oh!

codecov bot commented Jul 20, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Jul 20, 2024 •

edited

Loading

Uh oh!

dpgeorge commented Jul 23, 2024

Uh oh!

AJMansfield commented Jul 23, 2024

Uh oh!

AJMansfield commented Jul 31, 2024

Uh oh!

dpgeorge commented Aug 2, 2024

Uh oh!

AJMansfield commented Feb 21, 2025

Uh oh!

AJMansfield commented Feb 24, 2025 •

edited

Loading

Uh oh!

AJMansfield commented Feb 25, 2025

Uh oh!

AJMansfield commented Feb 26, 2025 •

edited

Loading

Uh oh!

AJMansfield commented Jul 21, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Uh oh!

py/objtype: Add support for __set_name__. (hazard version) #15503

py/objtype: Add support for __set_name__. (hazard version) #15503

Uh oh!

Conversation

AJMansfield commented Jul 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

codecov bot commented Jul 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Jul 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dpgeorge commented Jul 23, 2024

Uh oh!

AJMansfield commented Jul 23, 2024

Uh oh!

AJMansfield commented Jul 31, 2024

Uh oh!

dpgeorge commented Aug 2, 2024

Uh oh!

AJMansfield commented Feb 21, 2025

Uh oh!

AJMansfield commented Feb 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AJMansfield commented Feb 25, 2025

Uh oh!

AJMansfield commented Feb 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AJMansfield commented Jul 21, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

AJMansfield commented Jul 19, 2024 •

edited

Loading

codecov bot commented Jul 20, 2024 •

edited

Loading

github-actions bot commented Jul 20, 2024 •

edited

Loading

AJMansfield commented Feb 24, 2025 •

edited

Loading

AJMansfield commented Feb 26, 2025 •

edited

Loading