gh-106581: Split `CALL_PY_EXACT_ARGS` into uops #107760

gvanrossum · 2023-08-08T04:48:04Z

This is only the first step for doing CALL in Tier 2. The next step involves tracing into the called code object. After that we'll have to do the remaining CALL specialization. Finally we'll have to tweak various things like KW_NAMES, and possibly move the NULL (for method calls) above the callable (that's 107788). But those are things for future PRs.

Note: this moves setting frame->return_offset directly in front of DISPATCH_INLINED(), to make it easier to move it into _PUSH_FRAME.

Rebase on GH-105848: Simplify the arrangement of CALL's stack #107788
Redo using Mark's ideas
Add back a call to _Py_EnterRecursivePy
Add tests

Issue: Call design for Tier 2 (uops) interpreter #106581

brandtbucher

Thanks for tackling this, it definitely doesn't look easy. It's sort of a bummer that we need to special-case this much stuff, but I also don't see a nicer way of handling these issues than what you have here.

A few comments and questions, mostly for my own understanding:

Python/bytecodes.c

Python/executor.c

Python/optimizer.c

Tools/cases_generator/generate_cases.py

Tools/cases_generator/instructions.py

gvanrossum

I'll add that assert; then I'll review your PR, and hopefully you can then merge that, and I can handle the merge fallout.

Python/bytecodes.c

Python/executor.c

Python/optimizer.c

Tools/cases_generator/generate_cases.py

Tools/cases_generator/instructions.py

markshannon · 2023-08-09T11:27:35Z

Python/bytecodes.c

@@ -2955,18 +2954,35 @@ dummy_func(
            PyCodeObject *code = (PyCodeObject *)func->func_code;
            DEOPT_IF(code->co_argcount != argcount, CALL);
            DEOPT_IF(!_PyThreadState_HasStackSpace(tstate, code->co_framesize), CALL);
+        }
+
+        op(_INIT_CALL_PY_EXACT_ARGS, (method, callable, args[oparg] -- new_frame: _PyInterpreterFrame*)) {


As this makes a frame, perhaps rename it to _MAKE_FRAME?

I figured there will be other uops making frames once we try to split CALL_PY_WITH_DEFAULTS, CALL_NO_KW_ALLOC_AND_ENTER_INIT, as well as BINARY_SUBSCR_GETITEM, LOAD_ATTR_PROPERTY, LOAD_ATTR_GETATTRIBUTE_OVERRIDDEN.

Anyway, uop names are easily changed.

markshannon · 2023-08-09T12:33:59Z

Python/bytecodes.c

-            SKIP_OVER(INLINE_CACHE_ENTRIES_CALL);
+        }
+
+        op(_PUSH_FRAME, (new_frame: _PyInterpreterFrame* -- unused)) {


Since _PUSH_FRAME is just frame->return_offset = 0; DISPATCH_INLINED(new_frame);, it would make sense to spell out DISPATCH_INLINED to clarify which operations that need to be different for tier 1 and tier 2.
Something like:

op(_PUSH_FRAME, (new_frame: _PyInterpreterFrame* -- unused)) { SAVE_FRAME_STATE(); // Equivalent to frame->prev_instr = next_instr - 1; _PyFrame_SetStackPointer(frame, stack_pointer); frame->return_offset = 0; new_frame->previous = frame; frame = cframe.current_frame = new_frame; CALL_STAT_INC(inlined_py_calls); if (_Py_EnterRecursivePy(tstate)) { goto exit_unwind; } START_FRAME(); // Equivalent to next_instr = frame->prev_instr + 1; stack_pointer = stack_pointer = _PyFrame_GetStackPointer(frame); }

For example: 2b3e6f2

In which case the code generators needs to know to push the temporary stack values to the real stack before SAVE_FRAME_STATE()

I have to study this more. A problem is that the Tier 1 and Tier 2 versions of _PUSH_FRAME are so different. I am working on a mechanism to be able to say

#if TIER_ONE <code for Tier 1> #else <code for Tier 2> #endif

I'm not sure yet what you mean with your last remark about pushing temp stack values.

Comparing carefully the two versions of DISPATCH_INLINED (adding frame->return_offset = 0 which precedes it in both cases):

In Tier 1:

frame->return_offset = 0; assert(tstate->interp->eval_frame == NULL); _PyFrame_SetStackPointer(frame, stack_pointer); frame->prev_instr = next_instr - 1; (NEW_FRAME)->previous = frame; frame = cframe.current_frame = (NEW_FRAME); CALL_STAT_INC(inlined_py_calls); goto start_frame;

In Tier 2:

frame->return_offset = 0; assert(tstate->interp->eval_frame == NULL); _PyFrame_SetStackPointer(frame, stack_pointer); frame->prev_instr -= 1; (NEW_FRAME)->previous = frame; frame = tstate->cframe->current_frame = (NEW_FRAME); CALL_STAT_INC(inlined_py_calls); stack_pointer = _PyFrame_GetStackPointer(frame); ip_offset = (_Py_CODEUNIT *)_PyFrame_GetCode(frame)->co_code_adaptive;

Diff:

@@ -1,8 +1,9 @@ frame->return_offset = 0; assert(tstate->interp->eval_frame == NULL); _PyFrame_SetStackPointer(frame, stack_pointer); - frame->prev_instr = next_instr - 1; + frame->prev_instr -= 1; (NEW_FRAME)->previous = frame; - frame = cframe.current_frame = (NEW_FRAME); + frame = tstate->cframe->current_frame = (NEW_FRAME); CALL_STAT_INC(inlined_py_calls); - goto start_frame; + stack_pointer = _PyFrame_GetStackPointer(frame); + ip_offset = (_Py_CODEUNIT *)_PyFrame_GetCode(frame)->co_code_adaptive;

Note that the Tier 2 version must be preceded by a SAVE_IP, which does the equivalent of frame->prev_instr = next_instr. If we had a Tier 1 version of SAVE_IP we could include it in the macro definition:

macro(CALL_PY_EXACT_ARGS) = unused/1 + // Skip over the counter _CHECK_PEP_523 + _CHECK_FUNCTION_EXACT_ARGS + _CHECK_STACK_SPACE + _INIT_CALL_PY_EXACT_ARGS + SAVE_IP + // <-------------- added _PUSH_FRAME;

which would reduce the special-casing in the code generator a bit (it would still need to do something special for SAVE_IP to ensure that its oparg has the right value, different from the oparg of the macro (which is the argument count). This would take care of the first diff chunk (what to assign to frame->prev_inst), but it would still be pretty fragile. (Like my current version, it would entice the optimizer to incorrectly try to remove the SAVE_IP uop.)

The second diff chunk relates to how we set cframe.current_frame -- in Tier 2 we must access this through the tstate.

The third and final diff chunk relates to really start using the new frame. In Tier 1, this must actually do the following:

Check recursion

Load stack_pointer

Load next_instr

Dispatch to the next opcode.

This is done by the code at start_frame.

In Tier 2 there is no start_frame label (the only uop that can go to a label is EXIT_TRACE, and of course DEOPT_IF and ERROR_IF also jump). So we load stack_frame here. There is no direct equivalent to next_instr, but we have to set ip_offset, which SAVE_IP adds to its oparg to get the prev_instr value. (This variable is a cache for frame->code->co_code_adaptive, to save some memory loads, so whenever frame changes we must update it.)

(More later.)

There's another thing though, and I think that is what Mark meant. In Tier 1 the code generation for macros is special-cased for _PUSH_FRAME so that both the stack adjustment and the next_instr adjustment are emitted before the _PUSH_FRAME opcode. This is done so that the flushing of these variables to the frame in the DISPATCH_INLINED macro flush the correct values.

But this is really ugly and unprincipled, and the logic is much hairier than the other special cases for _PUSH_FRAME. One of Mark's ideas here is to make this special case look for uops using the SAVE_FRAME_STATE macro rather than for the specific uop _PUSH_FRAME. But detecting when to trigger the special case is only part of the problem -- IMO the worse problem is that the special case itself is so ugly:

dispatch_inlined_special_case = False if mgr is managers[-1] and mgr.instr.always_exits.startswith("DISPATCH_INLINED") and mgr.instr.name == "_PUSH_FRAME": dispatch_inlined_special_case = True temp = mgr.final_offset.clone() temp.deeper(StackEffect(UNUSED)) # Hack out.stack_adjust(temp.deep, temp.high) # Use clone() since adjust_inverse() mutates final_offset mgr.adjust_inverse(mgr.final_offset.clone()) if cache_adjust: out.emit(f"next_instr += {cache_adjust};")

The last 4 lines here, starting with # Use clone(), occur further down too, for the normal case (after the final uop). I don't even recall why the temp.deeper() call is needed!

I'll mull this over some more.

I think I have addressed this. @markshannon Please have another look. Assuming the tests look okay I'll un-draft this.

Python/bytecodes.c

gvanrossum · 2023-08-09T17:42:43Z

Made this back into a draft; I need to (a) wait for Brandt's gh-107788, then (b) redo the split and tooling changes using Mark's ideas.

brandtbucher · 2023-08-09T18:20:20Z

The CALL PR has been merged.

This is only the first step for doing `CALL` in Tier 2. The next step involves tracing into the called code object. After that we'll have to do the remaining `CALL` specialization. Finally we'll have to tweak various things like `KW_NAMES`, and possibly move the `NULL` (for method calls) *above* the callable. But those are things for future PRs. Note: this moves setting `frame->return_offset` directly in front of `DISPATCH_INLINED()`, to make it easier to move it into `_PUSH_FRAME`.

ambv · 2023-08-11T14:24:22Z

Closing and re-opening to retrigger CLA checks. Sorry for the noise.

Instead, the special case is an opcode using SAVE_FRAME_STATE(). Introducing #if TIER_ONE and #if TIER_TWO so we can implement _PUSH_FRAME differently for both tiers.

Instead, we special-case SAVE_IP: - Its Tier 2 expansion sets oparg to the instruction offset - In Tier 1 it is a no-op (and skipped if present in a macro)

gvanrossum · 2023-08-16T04:20:18Z

@markshannon I was hoping you'd review this. I added _Py_EnterRecursivePy which was the last thing on my TODO list.

Unless you'd rather review #107925, which includes this (and #107793, which is the intermediate stage).

markshannon

I'm uneasy about the introduction of the TIER_ONE and TIER_TWO macros.
It is a principle of the overall design that there is a single source of truth for the semantics of bytecodes.

It might appear that I'm being dogmatic, but the need for something like those macros often indicates an underlying problem that should be fixed independently.

In this case the problem is the cframe. Loading and saving the IP needs to handled specially anyway and saving and loading the SP should be the same for both interpreters (but will need to be handled specially by the copy-and-patch compiler, so should be its own micro-op).
It is pushes the frame that differs. Removing cframe will fix that.

The cframe only exists as a performance hack to minimize the impact of tracing prior to PEP 669.

markshannon · 2023-08-16T14:12:09Z

Python/executor.c

@@ -30,6 +30,14 @@
 #undef ENABLE_SPECIALIZATION
 #define ENABLE_SPECIALIZATION 0

+#undef SAVE_FRAME_STATE
+#define SAVE_FRAME_STATE() \


Rather than a macro, I think the code generator needs to understand this.

Given that SAVE_FRAME_STATE is basically SAVE_CURRENT_IP followed by _PyFrame_SetStackPointer(frame, stack_pointer); we could convert it to two micro-ops: SAVE_CURRENT_IP and SAVE_SP.

In general, we want to avoid macros in the generated C code.
The generated code can be explicit and verbose.

Well, there are already many macros (and static inline functions) in the generated code. The generator recognizes the presence of SAVE_FRAME_STATE(), but it doesn't expand it -- the C preprocessor can do that for us more easily. Currently we only do the expansion in the generator for things whose expansion requires information that only the generator has (like the stack adjustment for ERROR_IF).

You are proposing that the macro expansion for CALL_PY_EXACT_ARGS become

macro(CALL_PY_EXACT_ARGS) = unused/1 + // Skip over the counter _CHECK_PEP_523 + _CHECK_FUNCTION_EXACT_ARGS + _CHECK_STACK_SPACE + _INIT_CALL_PY_EXACT_ARGS + SAVE_IP + // Tier 2 only; special-cased oparg SAVE_CURRENT_IP + // <------------------- Addition _PUSH_FRAME;

where SAVE_CURRENT_IP is something like this:

op(SAVE_CURRENT_IP, (--)) { #if TIER_ONE frame->prev_instr = next_instr - 1; #endif #if TIER_TWO frame->prev_instr--; #endif }

Or we could special-case its expansion in the generator, potayto-potato. But it has to differ between tiers because in Tier 1 it must store next_instr whereas in Tier 2 it must rely on the preceding SAVE_IP to set frame->prev_instr. (Ideally at some point in the future we won't need the prev_instr-- yet, but that's a tricky change.)

The _PyFrame_SetStackPointer(frame, stack_pointer); call should be moved back into _PUSH_FRAME (at the point where I currently call SAVE_FRAME_STATE).

If I can get this to work I'll apply it and merge the PR.

I did get this working (see 05af848), and will test and benchmark it before merging it.

Note that there are still some #if TIER_ONE and #if TIER_TWO sections, but they are unavoidable.

markshannon · 2023-08-16T14:20:00Z

Python/ceval_macros.h

@@ -103,11 +103,16 @@
        DISPATCH_GOTO(); \
    }

+#define SAVE_FRAME_STATE() \


See my comment below about splitting this into SAVE_CURRENT_IP; SAVE_SP

gvanrossum · 2023-08-16T23:21:36Z

Benchmark: 1.00x faster: https://github.com/faster-cpython/benchmarking-public/tree/main/results/bm-20230816-3.13.0a0-05af848

IOW it doesn't slow CALL_PY_EXACT_ARGS down, which is all I care about.

bedevere-bot · 2023-08-17T01:13:20Z

⚠️⚠️⚠️ Buildbot failure ⚠️⚠️⚠️

Hi! The buildbot wasm32-emscripten node (pthreads) 3.x has failed when building commit dc8fdf5.

What do you need to do:

Don't panic.
Check the buildbot page in the devguide if you don't know what the buildbots are or how they work.
Go to the page of the buildbot that failed (https://buildbot.python.org/all/#builders/1050/builds/2796) and take a look at the build logs.
Check if the failure is related to this commit (dc8fdf5) or if it is a false positive.
If the failure is related to this commit, please, reflect that on the issue and make a new Pull Request with a fix.

You can take a look at the buildbot page here:

https://buildbot.python.org/all/#builders/1050/builds/2796

Summary of the results of the build (if available):

== Tests result: ENV CHANGED ==

329 tests OK.

10 slowest tests:

test_math: 2 min 9 sec
test_hashlib: 1 min 59 sec
test_tarfile: 1 min 9 sec
test_unparse: 49.6 sec
test_io: 41.3 sec
test_tokenize: 40.0 sec
test_unicodedata: 28.3 sec
test_capi: 27.8 sec
test_fstring: 24.5 sec
test_pickle: 23.0 sec

1 test altered the execution environment:
test_capi

117 tests skipped:
test.test_asyncio.test_base_events
test.test_asyncio.test_buffered_proto
test.test_asyncio.test_context
test.test_asyncio.test_eager_task_factory
test.test_asyncio.test_events test.test_asyncio.test_futures
test.test_asyncio.test_futures2 test.test_asyncio.test_locks
test.test_asyncio.test_pep492
test.test_asyncio.test_proactor_events
test.test_asyncio.test_protocols test.test_asyncio.test_queues
test.test_asyncio.test_runners
test.test_asyncio.test_selector_events
test.test_asyncio.test_sendfile test.test_asyncio.test_server
test.test_asyncio.test_sock_lowlevel test.test_asyncio.test_ssl
test.test_asyncio.test_sslproto test.test_asyncio.test_streams
test.test_asyncio.test_subprocess
test.test_asyncio.test_taskgroups test.test_asyncio.test_tasks
test.test_asyncio.test_threads test.test_asyncio.test_timeouts
test.test_asyncio.test_transports
test.test_asyncio.test_unix_events test.test_asyncio.test_waitfor
test.test_asyncio.test_windows_events
test.test_asyncio.test_windows_utils test__xxinterpchannels
test__xxsubinterpreters test_asyncgen test_clinic test_cmd_line
test_concurrent_futures test_contextlib_async test_ctypes
test_curses test_dbm_gnu test_dbm_ndbm test_devpoll test_doctest
test_docxmlrpc test_dtrace test_embed test_epoll test_faulthandler
test_fcntl test_file_eintr test_fork1 test_ftplib test_gdb
test_generated_cases test_grp test_httplib test_httpservers
test_idle test_imaplib test_interpreters test_ioctl test_kqueue
test_launcher test_lzma test_mmap test_multiprocessing_fork
test_multiprocessing_forkserver test_multiprocessing_main_handling
test_multiprocessing_spawn test_openpty test_pdb
test_perf_profiler test_perfmaps test_poll test_poplib test_pty
test_pwd test_readline test_regrtest test_repl test_resource
test_select test_selectors test_smtplib test_smtpnet test_socket
test_socketserver test_ssl test_stable_abi_ctypes test_startfile
test_subprocess test_sys_settrace test_syslog test_tcl
test_tkinter test_tools test_ttk test_ttk_textonly test_turtle
test_urllib2 test_urllib2_localnet test_urllib2net test_urllibnet
test_venv test_wait3 test_wait4 test_webbrowser test_winconsoleio
test_winreg test_winsound test_wmi test_wsgiref test_xmlrpc
test_xxlimited test_zipfile64 test_zipimport_support test_zoneinfo

Total duration: 26 min 4 sec

Click to see traceback logs

Traceback (most recent call last):
  File "/opt/buildbot/bcannon-wasm/3.x.bcannon-wasm.emscripten-node-pthreads/build/Lib/test/test_capi/test_watchers.py", line 532, in watcher
    raise MyError("testing 123")

This finishes the work begun in gh-107760. When, while projecting a superblock, we encounter a call to a short, simple function, the superblock will now enter the function using `_PUSH_FRAME`, continue through it, and leave it using `_POP_FRAME`, and then continue through the original code. Multiple frame pushes and pops are even possible. It is also possible to stop appending to the superblock in the middle of a called function, when running out of space or encountering an unsupported bytecode.

gvanrossum added the skip news label Aug 8, 2023

gvanrossum requested review from markshannon and brandtbucher August 8, 2023 04:48

bedevere-bot mentioned this pull request Aug 8, 2023

Call design for Tier 2 (uops) interpreter #106581

Closed

bedevere-bot added the awaiting core review label Aug 8, 2023

Eclips4 added the interpreter-core (Objects, Python, Grammar, and Parser dirs) label Aug 8, 2023

brandtbucher approved these changes Aug 8, 2023

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting core review labels Aug 8, 2023

gvanrossum commented Aug 8, 2023

View reviewed changes

brandtbucher approved these changes Aug 8, 2023

View reviewed changes

gvanrossum mentioned this pull request Aug 9, 2023

gh-106581: Start projecting through calls #107793

Closed

2 tasks

markshannon reviewed Aug 9, 2023

View reviewed changes

brandtbucher reviewed Aug 9, 2023

View reviewed changes

Python/bytecodes.c Show resolved Hide resolved

gvanrossum marked this pull request as draft August 9, 2023 17:41

bedevere-bot removed the awaiting merge label Aug 9, 2023

gvanrossum added 2 commits August 9, 2023 11:41

Fix merge so it works again (I think)

907ff95

gvanrossum force-pushed the call-uops branch from 9784cc2 to 907ff95 Compare August 9, 2023 19:01

Split into finer-grained uops

2c6be6d

gvanrossum force-pushed the call-uops branch from 91ebd3f to 2c6be6d Compare August 9, 2023 23:03

gvanrossum added 6 commits August 9, 2023 17:24

Fix type error in stacking.py

6d78ff2

Add test

0d8e66c

Add comment explaining _PUSH_FRAME's unused output effect

b75f30e

Make PUSH_FRAME special case a little less myterious

61c2822

Rename Instruction.write to write_case_body

f73ea90

Move next_instr update to a more logical place

12910fc

gvanrossum added 5 commits August 10, 2023 16:19

Don't recompute macro cache offset

2fafa2c

Fold and refactor long line in stacking.py

2717b07

Fold long lines in generate_cases.py

e487908

Don't emit static assert to executor cases

1d549af

Factor away write_case_body (formerly Instruction.write)

f40fb1f

ambv closed this Aug 11, 2023

ambv reopened this Aug 11, 2023

gvanrossum added 3 commits August 11, 2023 14:59

Fold long lines

4f6f8f8

Make less of a special case of _PUSH_FRAME

6facc8d

Instead, the special case is an opcode using SAVE_FRAME_STATE(). Introducing #if TIER_ONE and #if TIER_TWO so we can implement _PUSH_FRAME differently for both tiers.

Stop special-casing _PUSH_FRAME altogether

94630d4

Instead, we special-case SAVE_IP: - Its Tier 2 expansion sets oparg to the instruction offset - In Tier 1 it is a no-op (and skipped if present in a macro)

gvanrossum marked this pull request as ready for review August 13, 2023 03:31

bedevere-bot added the awaiting core review label Aug 13, 2023

gvanrossum mentioned this pull request Aug 14, 2023

gh-107557: Setup abstract interpretation #107847

Merged

2 tasks

gvanrossum added 2 commits August 15, 2023 13:20

Call _Py_EnterRecursivePy in _FRAME_PUSH

cf8e2c0

Merge remote-tracking branch 'upstream/main' into call-uops

1e62876

gvanrossum mentioned this pull request Aug 15, 2023

IGNORE: Call uops forever #107952

Closed

markshannon reviewed Aug 16, 2023

View reviewed changes

gvanrossum mentioned this pull request Aug 16, 2023

GH-108035: Remove the _PyCFrame struct as it is no longer needed for performance. #108036

Merged

Introduce SAVE_CURRENT_IP uop per Mark's proposal

05af848

gvanrossum merged commit dc8fdf5 into python:main Aug 16, 2023

bedevere-bot removed the awaiting core review label Aug 16, 2023

gvanrossum deleted the call-uops branch August 16, 2023 23:31

gvanrossum mentioned this pull request Aug 16, 2023

gh-106581: Project through calls #108067

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-106581: Split `CALL_PY_EXACT_ARGS` into uops #107760

gh-106581: Split `CALL_PY_EXACT_ARGS` into uops #107760

gvanrossum commented Aug 8, 2023 •

edited

Loading

brandtbucher left a comment

gvanrossum left a comment

markshannon Aug 9, 2023

gvanrossum Aug 9, 2023

markshannon Aug 9, 2023 •

edited

Loading

markshannon Aug 9, 2023 •

edited

Loading

gvanrossum Aug 9, 2023

gvanrossum Aug 9, 2023 •

edited

Loading

gvanrossum Aug 9, 2023

gvanrossum Aug 11, 2023

gvanrossum commented Aug 9, 2023

brandtbucher commented Aug 9, 2023

ambv commented Aug 11, 2023

gvanrossum commented Aug 16, 2023

markshannon left a comment

markshannon Aug 16, 2023 •

edited

Loading

gvanrossum Aug 16, 2023

gvanrossum Aug 16, 2023

gvanrossum Aug 16, 2023

markshannon Aug 16, 2023 •

edited

Loading

gvanrossum commented Aug 16, 2023

bedevere-bot commented Aug 17, 2023

gh-106581: Split CALL_PY_EXACT_ARGS into uops #107760

gh-106581: Split CALL_PY_EXACT_ARGS into uops #107760

Conversation

gvanrossum commented Aug 8, 2023 • edited Loading

brandtbucher left a comment

Choose a reason for hiding this comment

gvanrossum left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

markshannon Aug 9, 2023 • edited Loading

Choose a reason for hiding this comment

markshannon Aug 9, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gvanrossum Aug 9, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gvanrossum commented Aug 9, 2023

brandtbucher commented Aug 9, 2023

ambv commented Aug 11, 2023

gvanrossum commented Aug 16, 2023

markshannon left a comment

Choose a reason for hiding this comment

markshannon Aug 16, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

markshannon Aug 16, 2023 • edited Loading

Choose a reason for hiding this comment

gvanrossum commented Aug 16, 2023

bedevere-bot commented Aug 17, 2023

⚠️⚠️⚠️ Buildbot failure ⚠️⚠️⚠️

gh-106581: Split `CALL_PY_EXACT_ARGS` into uops #107760

gh-106581: Split `CALL_PY_EXACT_ARGS` into uops #107760

gvanrossum commented Aug 8, 2023 •

edited

Loading

markshannon Aug 9, 2023 •

edited

Loading

markshannon Aug 9, 2023 •

edited

Loading

gvanrossum Aug 9, 2023 •

edited

Loading

markshannon Aug 16, 2023 •

edited

Loading

markshannon Aug 16, 2023 •

edited

Loading