Fix segfault when reloading interpreter with external modules #1092

jagerman · 2017-09-16T15:40:00Z

When embedding the interpreter and loading external modules in that
embedded interpreter, the external module correctly shares its
internals_ptr with the one in the embedded interpreter. When the
interpreter is shut down, however, only the internals_ptr local to
the embedded code is actually reset to nullptr: the external module
remains set.

The result is that loading an external pybind11 module, letting the
interpreter go through a finalize/initialize, then attempting to use
something in the external module fails because this external module is
still trying to use the old (destroyed) internals. This causes
undefined behaviour (typically a segfault).

This commit fixes it by adding a level of indirection in the internals
path, converting the local internals variable to internals ** instead
of internals *. With this change, we can detect a stale internals
pointer and reload the internals pointer (either from a capsule or by
creating a new internals instance).

(No issue number: this was reported on gitter by @henryiii and @aoloe; thanks to both for the issue and reproducible test case).

@henryiii

When embedding the interpreter and loading external modules in that embedded interpreter, the external module correctly shares its internals_ptr with the one in the embedded interpreter. When the interpreter is shut down, however, only the `internals_ptr` local to the embedded code is actually reset to nullptr: the external module remains set. The result is that loading an external pybind11 module, letting the interpreter go through a finalize/initialize, then attempting to use something in the external module fails because this external module is still trying to use the old (destroyed) internals. This causes undefined behaviour (typically a segfault). This commit fixes it by adding a level of indirection in the internals path, converting the local internals variable to `internals **` instead of `internals *`. With this change, we can detect a stale internals pointer and reload the internals pointer (either from a capsule or by creating a new internals instance). (No issue number: this was reported on gitter by @henryiii and @aoloe).

`cmake -E env` was added in 3.2. This drops the built external module in the source directory instead. (This is a bit ugly, but it's the same ugliness we already do for the `pytest` tests).

aoloe · 2017-09-17T07:46:31Z

bot the test code and the original code that raised the issue work correctly with this pull request.

thanks!

jagerman · 2017-09-19T14:46:05Z

Marking this for v2.2.2; this doesn't change what gets stored in the builtins, and so should be perfectly compatible with v2.2.[01] modules.

jagerman · 2017-09-22T02:31:48Z

Cc @dean0x7d for comments. What is here works, but maybe you have a nicer idea than the double pointer.

wjakob · 2017-11-16T21:38:08Z

I feel like I'm missing important to understand this PR. When the interpreter shuts down, how does the internals pointer get marked as stale? I just see two leaking memory allocations (one for internals, one for the pointer to it) but no changes to shutdown/GC-related code.

jagerman · 2017-11-16T22:01:53Z

The critical change for that is in finalize_interpreter: the *internals_ptr_ptr that gets deleted and set to nullptr is, with this PR, shared across all modules and the embedded interpreter with the same internals version. Previously each module/interpret had its own pointer, so the delete *internals_ptr_ptr destroyed it, but the *internals_ptr_ptr = nullptr; was only setting the local version to nullptr: outside modules still had their internals pointer set.

Basically, there is now only one pointer to the internals data, where previously there were n pointers to it: each module/interpreter only stores a local pointer to that pointer so that when something deletes it (and sets the one-true-pointer to nullptr) they all see it.

wjakob · 2017-11-16T22:24:11Z

Aha, understood now. Thank you for the clarification.

@henryiii

…#1092) * Fix segfault when reloading interpreter with external modules When embedding the interpreter and loading external modules in that embedded interpreter, the external module correctly shares its internals_ptr with the one in the embedded interpreter. When the interpreter is shut down, however, only the `internals_ptr` local to the embedded code is actually reset to nullptr: the external module remains set. The result is that loading an external pybind11 module, letting the interpreter go through a finalize/initialize, then attempting to use something in the external module fails because this external module is still trying to use the old (destroyed) internals. This causes undefined behaviour (typically a segfault). This commit fixes it by adding a level of indirection in the internals path, converting the local internals variable to `internals **` instead of `internals *`. With this change, we can detect a stale internals pointer and reload the internals pointer (either from a capsule or by creating a new internals instance). (No issue number: this was reported on gitter by @henryiii and @aoloe).

@henryiii

* Fix segfault when reloading interpreter with external modules When embedding the interpreter and loading external modules in that embedded interpreter, the external module correctly shares its internals_ptr with the one in the embedded interpreter. When the interpreter is shut down, however, only the `internals_ptr` local to the embedded code is actually reset to nullptr: the external module remains set. The result is that loading an external pybind11 module, letting the interpreter go through a finalize/initialize, then attempting to use something in the external module fails because this external module is still trying to use the old (destroyed) internals. This causes undefined behaviour (typically a segfault). This commit fixes it by adding a level of indirection in the internals path, converting the local internals variable to `internals **` instead of `internals *`. With this change, we can detect a stale internals pointer and reload the internals pointer (either from a capsule or by creating a new internals instance). (No issue number: this was reported on gitter by @henryiii and @aoloe).

jagerman added 2 commits September 16, 2017 12:38

fix for cmake <3.2

fa81319

`cmake -E env` was added in 3.2. This drops the built external module in the source directory instead. (This is a bit ugly, but it's the same ugliness we already do for the `pytest` tests).

jagerman added this to the v2.2.2 milestone Sep 19, 2017

jagerman merged commit 326deef into pybind:master Jan 11, 2018

jagerman mentioned this pull request Jan 23, 2018

embedded windows interpreter crashes on exit if external module loaded #1259

Closed

rwgk mentioned this pull request Feb 9, 2023

FWD pybind11 google/pybind11clif#1092

Closed

rwgk mentioned this pull request May 7, 2025

feat: support for sub-interpreters #5564

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix segfault when reloading interpreter with external modules #1092

Fix segfault when reloading interpreter with external modules #1092

Uh oh!

jagerman commented Sep 16, 2017

Uh oh!

aoloe commented Sep 17, 2017

Uh oh!

jagerman commented Sep 19, 2017

Uh oh!

jagerman commented Sep 22, 2017

Uh oh!

wjakob commented Nov 16, 2017

Uh oh!

jagerman commented Nov 16, 2017

Uh oh!

wjakob commented Nov 16, 2017

Uh oh!

Uh oh!

Fix segfault when reloading interpreter with external modules #1092

Fix segfault when reloading interpreter with external modules #1092

Uh oh!

Conversation

jagerman commented Sep 16, 2017

Uh oh!

aoloe commented Sep 17, 2017

Uh oh!

jagerman commented Sep 19, 2017

Uh oh!

jagerman commented Sep 22, 2017

Uh oh!

wjakob commented Nov 16, 2017

Uh oh!

jagerman commented Nov 16, 2017

Uh oh!

wjakob commented Nov 16, 2017

Uh oh!

Uh oh!