Single-Phase Init Extension Module Init Functions Still Run in Isolated Interpreters #117953

ericsnowcurrently · 2024-04-16T23:47:11Z

Bug report

Bug description:

When an extension module is imported the first time, we load the shared-object file and get the module's init function from it. We then run that init function and use the returned object to decide if the module is single-phase init or multi-phase init.

For isolated subinterpreters, where PyInterpreterConfig.check_multi_interp_extensions (AKA Py_RTFLAGS_MULTI_INTERP_EXTENSIONS) is True, we immediately fail single-phase init modules. The problem is that at that point the init function has already run, which all sorts of potential side effects and process-global state (including registered callbacks) that we mostly can't clean up.

This has come up before, for example with the readline module. It's potentially a bigger problem than I thought at first, so I'd like to get it sorted out for 3.13.

FWIW, the simplest solution I can think of is to always call the module init func from the main interpreter (without necessarily doing all the other import steps there). That would look something like this:

start a normal import in an isolated subinterpreter
get the init function
switch to the main interpreter
call the init function
switch back
fail if it is single-phase init (and remember that fact)

For the main interpreter and non-isolated subinterpreters, nothing different would happen from now; there would be no switching. Also, if the first attempt was in an isolated interpreter (which would fail), a subsequent import of that module in the main interpreter (or a non-isolated one) would succeed.

The only tricky part is, when the init function raises an exception, how to an propagate it from the main interpreter to the subinterpreter. For multi-phase init (if known) we would just call the init func again after switching back. For single-phase init (or unknown) we'd have preserve the exception somehow. This is something I had to deal with for _interpreters.exec(), but I'm not sure the same thing will work here.

CPython versions tested on:

CPython main branch

Operating systems tested on:

No response

Linked PRs

The text was updated successfully, but these errors were encountered:

) This is a collection of very basic cleanups I've pulled out of gh-118116. It is mostly renaming variables and moving a couple bits of code in functionally equivalent ways.

These are cleanups I've pulled out of gh-118116. Mostly, this change moves code around to align with some future changes and to improve clarity a little. There is one very small change in behavior: we now add the module to the per-interpreter caches after updating the global state, rather than before.

…inglephase or Not (gh-118193) This change makes other upcoming changes simpler.

This helps with a later change that splits up _PyImport_LoadDynamicModuleWithSpec().

A couple of refleaks slipped through in gh-118194. This takes care of them. (AKA _Py_ext_module_loader_info_init() does not steal references.)

…-118250) A couple of refleaks slipped through in pythongh-118194. This takes care of them. (AKA _Py_ext_module_loader_info_init() does not steal references.)

encukou · 2024-04-29T10:10:03Z

Continuing here from the closed PR. @ericsnowcurrently wrote:

Thanks for bringing [m_slots=NULL] up. What you've said makes sense and appreciate the clarity of it. There are indeed some gaps (fairly small, I expect), both functionally and in tests. It isn't clear what the impact is in practice, but I definitely think they should be addressed regardless. I'll be sure to open some issues in the next week or two.

Thanks!

AFAICS, the current state is that is_singlephase() function might return true on some multi-phase modules. And it's currently only used in asserts, where assert(is_singlephase(...)) is fine, but assert(!is_singlephase(...)) runs into the false positive. The current use of the latter is in create_builtin, which is AFAIK only used modules that don't have m_slots=NULL. Another use in #118203 broke a test.

IMO, current PRs should simply avoid assert(!is_singlephase); in another PR the function should become assert_is_singlephase(), to limit future uses to the case that works.

FWIW, the general thought of kinds of extension modules was already on my mind (and a bit clearer in my original mega-PR, #118116). For now I have a later PR (#118205) that is a bit more deliberate about keeping track of what the init function returns. While that PR doesn't do so currently, I might look at explicitly tracking the kind on the module def. That would help address the point you've brought up.

Yes, looks like this tracking will be needed in order to use this for more than asserts.
I guess “tracking the kind on the module def” is a typo and you meant “he kind of the module def”? (PyModuleDef is part of the stable ABI, and it doesn't have space for more data.)

Basically, I've turned most of _PyImport_LoadDynamicModuleWithSpec() into two new functions (_PyImport_GetModInitFunc() and _PyImport_RunModInitFunc()) and moved the rest of it out into _imp_create_dynamic_impl(). There shouldn't be any changes in behavior. This change makes some future changes simpler. This is particularly relevant to potentially calling each module init function in the main interpreter first. Thus the critical part of the PR is the addition of _PyImport_RunModInitFunc(), which is strictly focused on running the init func and validating the result. A later PR will take it a step farther by capturing error information rather than raising exceptions. FWIW, this change also helps readers by clarifying a bit more about what happens when an extension/builtin module is imported.

ericsnowcurrently · 2024-04-29T15:41:17Z

(PyModuleDef is part of the stable ABI, and it doesn't have space for more data.)

Right. Tracking it *on the module def would require hiding the bit(s) in one of the existing fields. That's doable, but I'd do that only if there wasn't another good place to stick the info.

…nsions (gh-118204) This change will make some later changes simpler. It also brings more consistent behavior and lower maintenance costs.

…chinery (gh-118205) This change will make some later changes simpler.

This change will make some later changes simpler.

We have only been tracking each module's PyModuleDef. However, there are some problems with that. For example, in some cases we load single-phase init extension modules from def->m_base.m_init or def->m_base.m_copy, but if multiple modules share a def then we can end up with unexpected behavior. With this change, we track the following: * PyModuleDef (same as before) * for some modules, its init function or a copy of its __dict__, but specific to that module * whether it is a builtin/core module or a "dynamic" extension * the interpreter (ID) that owns the cached __dict__ (only if cached) This also makes it easier to remember the module's kind (e.g. single-phase init) and if loading it previously failed, which I'm doing separately.

) This ensures the kind is always either _Py_ext_module_kind_SINGLEPHASE or _Py_ext_module_kind_MULTIPHASE.

…h-118157) This change makes sure all extension/builtin modules have their init function run first by the main interpreter before proceeding with import in the original interpreter (main or otherwise). This means when the import of a single-phase init module fails in an isolated subinterpreter, it won't tie any global state/callbacks to the subinterpreter.

ericsnowcurrently · 2024-05-07T19:59:05Z

The core change has landed, but there are a few small follow-up things to wrap up.

…ort Machinery (pythongh-118205) This change will make some later changes simpler.

…-118206) This change will make some later changes simpler.

…ongh-118532) We have only been tracking each module's PyModuleDef. However, there are some problems with that. For example, in some cases we load single-phase init extension modules from def->m_base.m_init or def->m_base.m_copy, but if multiple modules share a def then we can end up with unexpected behavior. With this change, we track the following: * PyModuleDef (same as before) * for some modules, its init function or a copy of its __dict__, but specific to that module * whether it is a builtin/core module or a "dynamic" extension * the interpreter (ID) that owns the cached __dict__ (only if cached) This also makes it easier to remember the module's kind (e.g. single-phase init) and if loading it previously failed, which I'm doing separately.

…ythongh-118684) This ensures the kind is always either _Py_ext_module_kind_SINGLEPHASE or _Py_ext_module_kind_MULTIPHASE.

…irst (pythongh-118157) This change makes sure all extension/builtin modules have their init function run first by the main interpreter before proceeding with import in the original interpreter (main or otherwise). This means when the import of a single-phase init module fails in an isolated subinterpreter, it won't tie any global state/callbacks to the subinterpreter.

ericsnowcurrently · 2024-06-18T15:00:07Z

FYI, in gh-118157 I disabled test_interpreters under Py_GIL_DISABLED. I did so because of failures on free-threading builds in CI:

(expand for an example)

0:08:30 load avg: 5.18 Re-running 1 failed tests in verbose mode in subprocesses
0:08:30 load avg: 5.18 Run 1 test in parallel using 1 worker process (timeout: 10 min, worker timeout: 15 min)
0:08:30 load avg: 5.18 [1/1/1] test_interpreters failed (1 failure)
Re-running test_interpreters in verbose mode (matching: test_display_preserved_exception)
test_display_preserved_exception (test.test_interpreters.test_api.TestInterpreterExec.test_display_preserved_exception) ... FAIL

======================================================================
FAIL: test_display_preserved_exception (test.test_interpreters.test_api.TestInterpreterExec.test_display_preserved_exception)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/runner/work/cpython/cpython-ro-srcdir/Lib/test/support/__init__.py", line 2603, in wrapper
    return func(*args, **kwargs)
  File "/home/runner/work/cpython/cpython-ro-srcdir/Lib/test/test_interpreters/test_api.py", line 764, in test_display_preserved_exception
    self.assertEqual(stderr, dedent(f"""\
    ~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^
        Traceback (most recent call last):
        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    ...<18 lines>...
        RuntimeError: uh-oh!
        ^^^^^^^^^^^^^^^^^^^^
        """))
        ^^^^^
AssertionError: 'Trac[778 chars]oh!\nException ignored in: <function Interpret[519 chars]\n\n' != 'Trac[778 chars]oh!\n'
  Traceback (most recent call last):
    File "/tmp/test_python_thv394xy/tmpsdujy1zv/script.py", line 9, in <module>
      interp.exec(script)
      ~~~~~~~~~~~^^^^^^^^
    File "/home/runner/work/cpython/cpython-ro-srcdir/Lib/test/support/interpreters/__init__.py", line 227, in exec
      raise ExecutionFailed(excinfo)
  test.support.interpreters.ExecutionFailed: RuntimeError: uh-oh!
  
  Uncaught in the interpreter:
  
  Traceback (most recent call last):
    File "/tmp/test_python_thv394xy/tmpsdujy1zv/script.py", line 6, in script
      spam.eggs()
      ~~~~~~~~~^^
    File "/tmp/test_python_thv394xy/tmpsdujy1zv/spam.py", line 6, in eggs
      ham()
      ~~~^^
    File "/tmp/test_python_thv394xy/tmpsdujy1zv/spam.py", line 3, in ham
      raise RuntimeError('uh-oh!')
  RuntimeError: uh-oh!
- Exception ignored in: <function Interpreter.__del__ at 0x2000119aed0>
- Traceback (most recent call last):
-   File "/home/runner/work/cpython/cpython-ro-srcdir/Lib/test/support/interpreters/__init__.py", line 157, in __del__
-   File "/home/runner/work/cpython/cpython-ro-srcdir/Lib/test/support/interpreters/__init__.py", line 173, in _decref
- TypeError: catching classes that do not inherit from BaseException is not allowed
- Fatal Python error: PyInterpreterState_Delete: remaining subinterpreters
- Python runtime state: finalizing (tstate=0x0000562d95495a10)
- 


----------------------------------------------------------------------
Ran 1 test in 0.285s

FAILED (failures=1)
test test_interpreters failed
1 test failed again:
    test_interpreters

== Tests result: FAILURE then FAILURE ==

The tests should be re-enabled and made to work before this issue be considered resolved.

…nGH-120689) (cherry picked from commit 1035fe0) Co-authored-by: Nice Zombies <[email protected]>

…20707) (cherry picked from commit 1035fe0, AKA gh-120689) Co-authored-by: Nice Zombies <[email protected]>

…n#120689)

ericsnowcurrently added type-bug An unexpected behavior, bug, or error interpreter-core (Objects, Python, Grammar, and Parser dirs) topic-subinterpreters 3.13 bugs and security fixes labels Apr 16, 2024

github-project-automation bot added this to Subinterpreters Apr 16, 2024

github-project-automation bot moved this to Todo in Subinterpreters Apr 16, 2024

bedevere-app bot mentioned this issue Apr 16, 2024

gh-117953: Always Run Extension Init Func in Main Interpreter First #117487

Closed

ericsnowcurrently mentioned this issue Apr 19, 2024

gh-116322: Add Py_mod_gil module slot #116882

Merged

ericsnowcurrently added a commit that referenced this issue Apr 24, 2024

gh-117953: Let update_global_state_for_extension() Caller Decide If S…

1acd249

…inglephase or Not (gh-118193) This change makes other upcoming changes simpler.

ericsnowcurrently added a commit that referenced this issue Apr 24, 2024

gh-117953: Add Internal struct _Py_ext_module_loader_info (gh-118194)

5865fa5

This helps with a later change that splits up _PyImport_LoadDynamicModuleWithSpec().

bedevere-app bot mentioned this issue Apr 24, 2024

gh-117953: Fix Refleaks Introduced by gh-118194 #118250

Merged

ericsnowcurrently added a commit that referenced this issue Apr 24, 2024

gh-117953: Fix Refleaks Introduced by gh-118194 (gh-118250)

85ec1c2

A couple of refleaks slipped through in gh-118194. This takes care of them. (AKA _Py_ext_module_loader_info_init() does not steal references.)

ericsnowcurrently added a commit that referenced this issue May 1, 2024

gh-117953: Work Relative to Specific Extension Kinds in the Import Ma…

526ca4c

…chinery (gh-118205) This change will make some later changes simpler.

ericsnowcurrently added a commit that referenced this issue May 3, 2024

gh-117953: Other Cleanups in the Extensions Machinery (gh-118206)

f201628

This change will make some later changes simpler.

bedevere-app bot mentioned this issue May 3, 2024

gh-117953: Track Extra Details in Global Extensions Cache #118532

Merged

bedevere-app bot mentioned this issue May 7, 2024

gh-117953: Imply Single-phase Init if the Init Function Fails #118684

Merged

ericsnowcurrently added a commit that referenced this issue May 7, 2024

gh-117953: Imply Single-phase Init if the Init Function Fails (gh-118684

1a23716

) This ensures the kind is always either _Py_ext_module_kind_SINGLEPHASE or _Py_ext_module_kind_MULTIPHASE.

SonicField pushed a commit to SonicField/cpython that referenced this issue May 8, 2024

pythongh-117953: Work Relative to Specific Extension Kinds in the Imp…

39094af

…ort Machinery (pythongh-118205) This change will make some later changes simpler.

SonicField pushed a commit to SonicField/cpython that referenced this issue May 8, 2024

pythongh-117953: Other Cleanups in the Extensions Machinery (pythongh…

f22f954

…-118206) This change will make some later changes simpler.

sebsura mentioned this issue May 23, 2024

Some builtin single phase modules do get loaded into subinterpreters which do not support them #119450

Closed

bedevere-app bot mentioned this issue Jun 18, 2024

gh-117953: Skip test_interpreters properly without GIL #120689

Merged

vstinner pushed a commit that referenced this issue Jun 18, 2024

gh-117953: Skip test_interpreters properly without GIL (#120689)

1035fe0

miss-islington pushed a commit to miss-islington/cpython that referenced this issue Jun 18, 2024

pythongh-117953: Skip test_interpreters properly without GIL (pytho…

37e8945

…nGH-120689) (cherry picked from commit 1035fe0) Co-authored-by: Nice Zombies <[email protected]>

bedevere-app bot mentioned this issue Jun 18, 2024

[3.13] gh-117953: Skip test_interpreters properly without GIL (GH-120689) #120707

Merged

ericsnowcurrently pushed a commit that referenced this issue Jun 18, 2024

[3.13] gh-117953: Skip test_interpreters properly without GIL (gh-1…

07145dd

…20707) (cherry picked from commit 1035fe0, AKA gh-120689) Co-authored-by: Nice Zombies <[email protected]>

picnixz pushed a commit to picnixz/cpython that referenced this issue Jun 19, 2024

pythongh-117953: Skip test_interpreters properly without GIL (pytho…

145a126

…n#120689)

mrahtz pushed a commit to mrahtz/cpython that referenced this issue Jun 30, 2024

pythongh-117953: Skip test_interpreters properly without GIL (pytho…

e5e61ea

…n#120689)

noahbkim pushed a commit to hudson-trading/cpython that referenced this issue Jul 11, 2024

pythongh-117953: Skip test_interpreters properly without GIL (pytho…

064368c

…n#120689)

estyxx pushed a commit to estyxx/cpython that referenced this issue Jul 17, 2024

pythongh-117953: Skip test_interpreters properly without GIL (pytho…

69c6157

…n#120689)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Single-Phase Init Extension Module Init Functions Still Run in Isolated Interpreters #117953

Single-Phase Init Extension Module Init Functions Still Run in Isolated Interpreters #117953

ericsnowcurrently commented Apr 16, 2024 •

edited by bedevere-app bot

Loading

encukou commented Apr 29, 2024

ericsnowcurrently commented Apr 29, 2024

ericsnowcurrently commented May 7, 2024

ericsnowcurrently commented Jun 18, 2024

Single-Phase Init Extension Module Init Functions Still Run in Isolated Interpreters #117953

Single-Phase Init Extension Module Init Functions Still Run in Isolated Interpreters #117953

Comments

ericsnowcurrently commented Apr 16, 2024 • edited by bedevere-app bot Loading

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

Linked PRs

encukou commented Apr 29, 2024

ericsnowcurrently commented Apr 29, 2024

ericsnowcurrently commented May 7, 2024

ericsnowcurrently commented Jun 18, 2024

ericsnowcurrently commented Apr 16, 2024 •

edited by bedevere-app bot

Loading