gh-132732: Automatically constant evaluate pure operations #132733

Fidget-Spinner · 2025-04-19T16:21:48Z

Issue: Constant evaluate/propagate pure ops automatically #132732

python-cla-bot · 2025-04-19T16:21:51Z

All commit authors signed the Contributor License Agreement.

Misc/NEWS.d/next/Core_and_Builtins/2025-04-19-16-22-47.gh-issue-132732.jgqhlF.rst

brandtbucher

This is really neat!

Other than two opcodes I found that shouldn't be marked pure, I just have one thought:

Rather than rewriting the bodies like this to use the symbols-manipulating functions (which seems error-prone), would we be able to just use stackrefs to do this?

For example, _BINARY_OP_ADD_INT is defined like this:

PyObject *left_o = PyStackRef_AsPyObjectBorrow(left);
PyObject *right_o = PyStackRef_AsPyObjectBorrow(right);
// ...
res = PyStackRef_FromPyObjectSteal(res_o);

Rather than rewriting uses of these functions, could it be easier to just do something like this, since we're guranteed not to escape?

if (sym_is_const(ctx, stack_pointer[-2]) && sym_is_const(ctx, stack_pointer[-1])) {
    // Generated code to turn constant symbols into stackrefs:
    _PyStackRef left = PyStackRef_FromPyObjectBorrow(sym_get_const(ctx, stack_pointer[-2]));
    _PyStackRef right = PyStackRef_FromPyObjectBorrow(sym_get_const(ctx, stack_pointer[-1]));
    _PyStackRef res;
    // Now the actual body, same as it appears in executor_cases.c.h:
    PyObject *left_o = PyStackRef_AsPyObjectBorrow(left);
    PyObject *right_o = PyStackRef_AsPyObjectBorrow(right);
    // ...
    res = PyStackRef_FromPyObjectSteal(res_o);
    // Generated code to turn stackrefs into constant symbols:
    stack_pointer[-1] = sym_new_const(ctx, PyStackRef_AsPyObjectSteal(res));
}

I'm not too familiar with the design of the cases generator though, so maybe this is way harder or something. Either way, I'm excited to see this get in!

Python/bytecodes.c

Fidget-Spinner · 2025-04-24T22:37:57Z

Rather than rewriting uses of these functions, could it be easier to just do something like this, since we're guranteed not to escape?

Seems feasible. I could try to rewrite all occurences of the variable with a stackref-producing const one. Let me try that.

Fidget-Spinner · 2025-04-25T01:06:40Z

I've verified no refleak on test_capi.test_opt locally apart from #132731 which is pre-existing.

markshannon · 2025-04-30T09:32:49Z

There's a lot going on in this PR, probably too much for one PR.

Could we start with a PR to fix up the pure annotations so that they are on the correct instructions and maybe add the pure_guard annotation that Brandt suggested?

markshannon · 2025-04-30T09:37:48Z

Could we have the default code generator generate a function for the body of the pure instruction and then call that from the three interpreters?

brandtbucher · 2025-04-30T14:58:57Z

Could we have the default code generator generate a function for the body of the pure instruction and then call that from the three interpreters?

Hm, I think I’d prefer not to. Sounds like it could hurt performance, especially for the JIT (where things can’t inline).

brandtbucher · 2025-04-30T15:10:30Z

I think a good progression would be:

Implement the pure attribute, and the optimizer changes. Remove the pure attributes where they don’t belong (so nothing breaks) and leave the existing ones as proof that the implementation works. (This PR)
Audit the existing non-pure bytecodes and add pure where it makes sense. (Follow-up PR)
Implement the pure_guard attribute, and annotate any bytecodes that can use it. (Follow-up PR)

Fidget-Spinner · 2025-04-30T15:15:59Z

Could we have the default code generator generate a function for the body of the pure instruction and then call that from the three interpreters?

Hm, I think I’d prefer not to. Sounds like it could hurt performance, especially for the JIT (where things can’t inline).

I thought about this and I think we can inline if we autogenerate a header file and include that directly. But then we're at the mercy of the compiler in both the normal interpreter and the JIT deciding to inline or not to inline the body again. Which I truly do not want.

Fidget-Spinner · 2025-05-08T23:10:39Z

@brandtbucher @markshannon what can I do to get this PR moving?

@tomasr8 if youd like to review, here's a summary of the PR:

If a bytecode operation is pure (no side effects) we can mark it as pure in bytecodes.c.
In the optimizer, we automatically generate the body that does evaluation of the symbolic constants by copy pasting the bytecodes.c definition into the optimizer's C code. Of course we check that the inputs are constants first.
All changes to the cases generator is for the second point.

tomasr8 · 2025-05-08T23:15:14Z

Thanks for the ping! I actually wanted to try/review this PR, I was just very busy this week with work :/ I'll have a look this weekend :)

tomasr8

Only had time to skim the PR, I'll do a more thorough review this weekend :)

Python/optimizer_bytecodes.c

Python/optimizer_analysis.c

Tools/cases_generator/optimizer_generator.py

Co-Authored-By: Tomas R. <[email protected]>

markshannon

Unfortunately this approach has a critical flaw. It is possible for the optimizer to see values that the executing code never would. For example, through a combination of statistical branch profiling and global to constant conversion.

class Disaster:
    def __add__(self, other):
        halt_and_catch_fire()

We don't want to be evaluating Disaster() +1 when optimizing BINARY_OP_ADD_INT.

Maybe consider the approach used for TO_BOOL where we call optimize_to_bool for each family member, thus reducing the code duplication.

In addition, we could then optimize BINARY_OP. After all, 1 + 1 is always 2, not just for BINARY_OP_ADD_INT.

bedevere-app · 2025-05-16T12:37:20Z

When you're done making the requested changes, leave the comment: I have made the requested changes; please review again.

And if you don't make the requested changes, you will be poked with soft cushions!

Fidget-Spinner · 2025-05-16T13:01:59Z

We don't want to be evaluating Disaster() +1 when optimizing BINARY_OP_ADD_INT

In the first place, that's not possible. We only optimize what we specialize in the interpreter. The interpreter will never specialize that to BINARY_OP_ADD_INT. Furthermore, even if it did specialize Disaster() to that by some oddity, we check _GUARD_TOS_INT and _GUARD_NOS_INT first in the specializer. Disaster() would never pass these checks.

Unfortunately this approach has a critical flaw. It is possible for the optimizer to see values that the executing code never would. For example, through a combination of statistical branch profiling and global to constant conversion.

If you want to be more assured, how about I merge #132968 to add type assertions to our optimizer? That should make things safer.

Fidget-Spinner · 2025-05-19T15:42:11Z

We don't want to be evaluating Disaster() +1 when optimizing BINARY_OP_ADD_INT

In the first place, that's not possible. We only optimize what we specialize in the interpreter. The interpreter will never specialize that to BINARY_OP_ADD_INT. Furthermore, even if it did specialize Disaster() to that by some oddity, we check _GUARD_TOS_INT and _GUARD_NOS_INT first in the specializer. Disaster() would never pass these checks.

I forgot the guards don't actually guard at the optimization time. My bad. Yeah it seems we need some sort of check.

Fidget-Spinner · 2025-05-19T15:53:13Z

@tomasr8 sorry this is going to make your life harder with the removing sym_is_const and sym_matches_type PR. I think you can still remove sym_matches_type, but sym_is_const is now a little harder.

Fidget-Spinner · 2025-05-19T15:56:22Z

I have made the requested changes; please review again

bedevere-app · 2025-05-19T15:56:26Z

Thanks for making the requested changes!

@markshannon: please review the changes made to this pull request.

Addressed review.

Fidget-Spinner · 2025-05-19T15:59:32Z

Note: I mark the object as stackref immortal (but not real immortal!) to simplify the reference management. This means we have no refcounting in the optimizer when constant evaluating stuff. Which makes things easier to reason about, as constants during the lifetime of the optimizer are effectively immortal anyways (the optimizer holds a single reference to all constants).

Fidget-Spinner · 2025-05-20T03:42:14Z

Waiting for #134284 to be merged first, then I can use PyStackRef_FromPyObjectBorrow

brandtbucher

It didn't really review the cases generator too closely (since I'm still not very familiar with it) but based on the code it generates, everything at least seems correct.

brandtbucher · 2025-05-21T16:26:23Z

Include/internal/pycore_stackref.h

@@ -374,6 +374,7 @@ PyStackRef_FromPyObjectBorrow(PyObject *obj)
 }
 #define PyStackRef_FromPyObjectBorrow(obj) PyStackRef_FromPyObjectBorrow(_PyObject_CAST(obj))

+


Suggested change

brandtbucher · 2025-05-21T16:50:30Z

Python/optimizer_symbols.c

+    return (typ == &PyLong_Type) ||
+        (typ == &PyUnicode_Type) ||
+        (typ == &PyFloat_Type) ||
+        (typ == &PyDict_Type) ||
+        (typ == &PyTuple_Type) ||
+        (typ == &PyList_Type);


We shouldn't constant-evaluate anything involving mutable containers. (Even tuple scares me a tiny bit, since it can contain arbitrary objects, but I'm pretty sure it's okay.)

Suggested change

return (typ == &PyLong_Type) ||

(typ == &PyUnicode_Type) ||

(typ == &PyFloat_Type) ||

(typ == &PyDict_Type) ||

(typ == &PyTuple_Type) ||

(typ == &PyList_Type);

return (typ == &_PyNone_Type) ||

(typ == &PyBool_Type) ||

(typ == &PyLong_Type) ||

(typ == &PyFloat_Type) ||

(typ == &PyUnicode_Type) ||

(typ == &PyTuple_Type);

brandtbucher · 2025-05-21T16:51:06Z

Tools/cases_generator/generators_common.py

@@ -75,7 +75,6 @@ def write_header(
 """
    )

-


Add this back?

brandtbucher · 2025-05-21T16:55:00Z

Tools/cases_generator/optimizer_generator.py

+    emitter.emit("/* Start of pure uop copied from bytecodes for constant evaluation */\n")
+    emitter.emit_tokens(uop, storage, inst=None, emit_braces=False, is_abstract=True)
+    out.start_line()
+    emitter.emit("/* End of pure uop copied from bytecodes for constant evaluation */\n")


Minor: maybe use // instead of /*/*/ for these comments, since they're not multi-line?

brandtbucher · 2025-05-21T17:13:26Z

Tools/cases_generator/optimizer_generator.py

+        # All new stackrefs are created from new references.
+        # That's how the stackref contract works.
+        if not outp.peek:
+            emitter.emit(f"{outp.name} = sym_new_const_steal(ctx, PyStackRef_AsPyObjectBorrow({outp.name}_stackref));\n")


It may just be a week of conference sleep schedule, but my brain hurts trying to reason about the refcounting here. Why are we stealing a borrow? Shouldn't we be stealing a steal, or borrowing a borrow? Currently:

If the tag bit is unset on the stackref, stealing a borrow will leave the refcount on the object unchanged and the tag bit unset. When the symbol is cleared after optimizing, the refcount on the object will be one less, which is correct.

If the tag bit is set on the stackref, stealing a borrow will leave the refcount on the object itself unchanged and the tag bit still set. When the symbol is cleared after optimizing, the refcount on the object will be one less, which seems incorrect.

(I haven't looked at the peek code yet.)

Another option, that I might like better, is making all of our constant symbols use stackrefs under-the-hood. Then we could avoid refcounting entirely. But that's a bigger change that could happen later if needed.

brandtbucher · 2025-05-21T17:22:59Z

This is also making me realize that we really should make it possible to detect refleaks/memory leaks on JIT builds soon. The problem is that new executors are allocated all over the place, leading to things like #120501.

Automatically constant evaluate pure operations

1ffbb6b

Fidget-Spinner requested a review from markshannon as a code owner April 19, 2025 16:21

bedevere-app bot added the awaiting core review label Apr 19, 2025

Fidget-Spinner requested review from brandtbucher and removed request for markshannon April 19, 2025 16:21

bedevere-app bot mentioned this pull request Apr 19, 2025

Constant evaluate/propagate pure ops automatically #132732

Open

blurb-it bot and others added 3 commits April 19, 2025 16:22

📜🤖 Added by blurb_it.

691084d

Fix tests

b89e4dc

Merge branch 'pure' of github.com:Fidget-Spinner/cpython into pure

0959918

StanFromIreland reviewed Apr 19, 2025

View reviewed changes

Misc/NEWS.d/next/Core_and_Builtins/2025-04-19-16-22-47.gh-issue-132732.jgqhlF.rst Show resolved Hide resolved

brandtbucher mentioned this pull request Apr 24, 2025

gh-131798: Use sym_new_type instead of sym_new_not_null for _BUILD_STRING, _BUILD_SET #132564

Merged

brandtbucher reviewed Apr 24, 2025

View reviewed changes

Python/bytecodes.c Outdated Show resolved Hide resolved

Python/bytecodes.c Outdated Show resolved Hide resolved

Python/bytecodes.c Outdated Show resolved Hide resolved

Fidget-Spinner added 2 commits April 25, 2025 07:16

Merge remote-tracking branch 'upstream/main' into pure

2541683

Apply review suggestions

d5b2208

reduce diff

71ced86

Fidget-Spinner mentioned this pull request Apr 26, 2025

gh-131798: JIT: Propagate the result in _BINARY_OP_SUBSCR_TUPLE_INT #133003

Merged

Fidget-Spinner added 2 commits May 7, 2025 06:48

Merge remote-tracking branch 'upstream/main' into pure

a10d5a1

Update pycore_opcode_metadata.h

d22f165

tomasr8 reviewed May 9, 2025

View reviewed changes

Python/optimizer_bytecodes.c Show resolved Hide resolved

Python/optimizer_analysis.c Outdated Show resolved Hide resolved

Tools/cases_generator/optimizer_generator.py Outdated Show resolved Hide resolved

Tools/cases_generator/optimizer_generator.py Outdated Show resolved Hide resolved

Apply changes from code review

8ae38c7

Co-Authored-By: Tomas R. <[email protected]>

markshannon self-requested a review May 16, 2025 12:24

markshannon previously requested changes May 16, 2025

View reviewed changes

bedevere-app bot added awaiting changes and removed awaiting core review labels May 16, 2025

This was referenced May 19, 2025

Better constant narrowing in the JIT optimizer #130415

Open

GH-130415: Use POP_TWO_LOAD_CONST_INLINE_BORROW #134241

Open

Fidget-Spinner added 2 commits May 19, 2025 23:52

Push fix noticed by Mark and Brandt

17634a8

Merge remote-tracking branch 'upstream/main' into pure

4937c2f

bedevere-app bot added awaiting change review and removed awaiting changes labels May 19, 2025

bedevere-app bot requested a review from markshannon May 19, 2025 15:56

bedevere-app bot added awaiting core review and removed awaiting change review labels May 19, 2025

Fidget-Spinner added 5 commits May 21, 2025 11:55

Merge remote-tracking branch 'upstream/main' into pure

ae08b79

remove pure from _POP_CALL_TWO_LOAD_CONST_INLINE_BORROW

c0c6600

Merge remote-tracking branch 'upstream/main' into pure

6bdd3f9

use upstream changes for stackref

de8e170

remove unused comment

c2f8e22

Fidget-Spinner requested a review from brandtbucher May 21, 2025 04:22

brandtbucher reviewed May 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-132732: Automatically constant evaluate pure operations #132733

gh-132732: Automatically constant evaluate pure operations #132733

Fidget-Spinner commented Apr 19, 2025 •

edited by bedevere-app bot

Loading

python-cla-bot bot commented Apr 19, 2025 •

edited

Loading

brandtbucher left a comment

Fidget-Spinner commented Apr 24, 2025

Fidget-Spinner commented Apr 25, 2025

markshannon commented Apr 30, 2025

markshannon commented Apr 30, 2025

brandtbucher commented Apr 30, 2025

brandtbucher commented Apr 30, 2025

Fidget-Spinner commented Apr 30, 2025

Fidget-Spinner commented May 8, 2025

tomasr8 commented May 8, 2025

tomasr8 left a comment

markshannon left a comment

bedevere-app bot commented May 16, 2025

Fidget-Spinner commented May 16, 2025

Fidget-Spinner commented May 19, 2025

Fidget-Spinner commented May 19, 2025

Fidget-Spinner commented May 19, 2025

bedevere-app bot commented May 19, 2025

Fidget-Spinner commented May 19, 2025

Fidget-Spinner commented May 20, 2025

brandtbucher left a comment •

edited

Loading

brandtbucher May 21, 2025

brandtbucher May 21, 2025

brandtbucher May 21, 2025

brandtbucher May 21, 2025

brandtbucher May 21, 2025

brandtbucher commented May 21, 2025

		@@ -374,6 +374,7 @@ PyStackRef_FromPyObjectBorrow(PyObject *obj)
		}
		#define PyStackRef_FromPyObjectBorrow(obj) PyStackRef_FromPyObjectBorrow(_PyObject_CAST(obj))

gh-132732: Automatically constant evaluate pure operations #132733

Are you sure you want to change the base?

gh-132732: Automatically constant evaluate pure operations #132733

Conversation

Fidget-Spinner commented Apr 19, 2025 • edited by bedevere-app bot Loading

python-cla-bot bot commented Apr 19, 2025 • edited Loading

brandtbucher left a comment

Choose a reason for hiding this comment

Fidget-Spinner commented Apr 24, 2025

Fidget-Spinner commented Apr 25, 2025

markshannon commented Apr 30, 2025

markshannon commented Apr 30, 2025

brandtbucher commented Apr 30, 2025

brandtbucher commented Apr 30, 2025

Fidget-Spinner commented Apr 30, 2025

Fidget-Spinner commented May 8, 2025

tomasr8 commented May 8, 2025

tomasr8 left a comment

Choose a reason for hiding this comment

markshannon left a comment

Choose a reason for hiding this comment

bedevere-app bot commented May 16, 2025

Fidget-Spinner commented May 16, 2025

Fidget-Spinner commented May 19, 2025

Fidget-Spinner commented May 19, 2025

Fidget-Spinner commented May 19, 2025

bedevere-app bot commented May 19, 2025

Fidget-Spinner commented May 19, 2025

Fidget-Spinner commented May 20, 2025

brandtbucher left a comment • edited Loading

Choose a reason for hiding this comment

brandtbucher May 21, 2025

Choose a reason for hiding this comment

brandtbucher May 21, 2025

Choose a reason for hiding this comment

brandtbucher May 21, 2025

Choose a reason for hiding this comment

brandtbucher May 21, 2025

Choose a reason for hiding this comment

brandtbucher May 21, 2025

Choose a reason for hiding this comment

brandtbucher commented May 21, 2025

Fidget-Spinner commented Apr 19, 2025 •

edited by bedevere-app bot

Loading

python-cla-bot bot commented Apr 19, 2025 •

edited

Loading

brandtbucher left a comment •

edited

Loading