cuda.core: harden graph user-object destructor during Python shutdown by aryanputta · Pull Request #2074 · NVIDIA/cuda-python

aryanputta · 2026-05-12T20:38:34Z

Summary

Closes #2042.

_py_host_destructor unconditionally entered with gil before calling
Py_DECREF. That is fine during normal runtime, but it is unsafe in the
graph user-object path because CUDA may invoke the destructor
asynchronously after interpreter finalization has begun.

This change makes _py_host_destructor nogil, checks
Py_IsInitialized(), and only enters a GIL section when Python is still
initialized.

Changes

cuda_core/cuda/core/graph/_utils.pyx: declare Py_IsInitialized() and
harden _py_host_destructor so it only acquires the GIL when Python is
still initialized.
cuda_core/cuda/core/graph/_utils.pxd: update the destructor signature
to match.

The normal runtime path is unchanged: if Python is still alive, the
destructor still decrefs the attached object. The change only affects the
shutdown edge where acquiring the GIL is no longer safe.

Validation

Locally verified:

git diff --check passes.
The change is limited to the existing graph user-object destructor path in
_utils.pyx / _utils.pxd.
The commit is DCO-signed and SSH-signed.

Not fully verified on this machine:

full cuda.core test suite
editable build of cuda_core

Related Work

Closes Harden _py_host_destructor against invocation after Py_Finalize #2042.
This is the hardening follow-up called out in cuda.core: keep kernel-argument objects alive in graph kernel nodes #2041 after graph
user-object lifetime handling was extended to kernel arguments.

Signed-off-by: Aryan <aryansputta@gmail.com>

copy-pr-bot · 2026-05-12T20:38:39Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: Aryan <aryansputta@gmail.com>

rwgk

This looks good to me.

It’d be nice to add a comment like this in cuda_core/cuda/core/_cpp/resource_handles.hpp next to the existing py_is_finalizing() helper. It documents an important subtlety from CPython shutdown semantics: the check is only best-effort, and there is still a narrow race where non-finalizer threads can be killed on older CPython or hang on newer CPython. That means teardown in this layer can still be cut short in rare cases, so the guard reduces the risk but does not fully eliminate it.

  // Best-effort probe for interpreter shutdown.
  //
  // In CPython this is not a hard guarantee: finalization can begin after this
  // returns false but before a later PyGILState_Ensure() / other Python C-API
  // call.
  //
  // If that race is lost on a non-finalizer thread, CPython's behavior is
  // version-dependent: on older supported versions (3.10-3.13) it may abruptly the
  // current thread (historically via PyThread_exit_thread(), i.e. without normal
  // C++ unwinding), while on newer versions (3.14+) it may hang the thread until
  // process exit.
  //
  // We still use this check because the policy in this layer is to avoid Python
  // work once shutdown is underway and accept an intentional leak / skipped
  // Python conversion in that edge case rather than add more complex deferral
  // machinery.
  inline bool py_is_finalizing() noexcept {
  #if PY_VERSION_HEX >= 0x030D0000
      return Py_IsFinalizing();
  #else
      return _Py_IsFinalizing() != 0;
  #endif
  }

Signed-off-by: Aryan <aryansputta@gmail.com>

…o fix-graph-destructor-shutdown

rwgk

Thanks!

rwgk · 2026-05-14T00:36:21Z

/ok to test 327b53e

github-actions · 2026-05-14T00:54:44Z

Doc Preview CI
🚀 View preview at https://nvidia.github.io/cuda-python/pr-preview/pr-2074/
https://nvidia.github.io/cuda-python/pr-preview/pr-2074/cuda-core/
https://nvidia.github.io/cuda-python/pr-preview/pr-2074/cuda-bindings/
https://nvidia.github.io/cuda-python/pr-preview/pr-2074/cuda-pathfinder/
Preview will be ready when the GitHub Pages deployment is complete.

aryanputta · 2026-05-14T16:56:50Z

@rwgk, I’ve updated the resource handles to include the suggested comments on shutdown semantics. I believe all feedback has been addressed; please let me know if any additional modifications are needed before merging.

rwgk · 2026-05-14T17:12:43Z

This looks ready to merge to me, but @Andy-Jost is the original author of the changed code, I'm waiting to give him a chance to approve, too.

rwgk · 2026-05-14T17:14:02Z

@aryanputta at this stage it's best if you don't merge main anymore, unless the UI here reports conflicts.

Each time you merge main, we have to rerun the CI again.

aryanputta · 2026-05-14T17:17:43Z

@aryanputta at this stage it's best if you don't merge main anymore, unless the UI here reports conflicts.

Each time you merge main, we have to rerun the CI again.

Okay, sorry!

cuda.core: harden graph user-object destructor during Python shutdown

f28b48c

Signed-off-by: Aryan <aryansputta@gmail.com>

github-actions Bot added the cuda.core Everything related to the cuda.core module label May 12, 2026

Merge branch 'main' into fix-graph-destructor-shutdown

09ad0c9

rwgk reviewed May 12, 2026

View reviewed changes

Comment thread cuda_core/cuda/core/graph/_utils.pyx Outdated

cuda.core: gate graph destructor on finalization

cb38dcf

mdboom requested a review from Andy-Jost May 13, 2026 12:48

Andy-Jost reviewed May 13, 2026

View reviewed changes

Comment thread cuda_core/cuda/core/graph/_utils.pyx Outdated

aryanputta added 2 commits May 13, 2026 14:35

Refactor graph Python user-object destruction

ee0c590

Signed-off-by: Aryan <aryansputta@gmail.com>

Merge branch 'main' into fix-graph-destructor-shutdown

b3a1bc3

rwgk approved these changes May 13, 2026

View reviewed changes

aryanputta added 2 commits May 13, 2026 20:07

Document py_is_finalizing shutdown race

e1d7b23

Signed-off-by: Aryan <aryansputta@gmail.com>

Merge remote-tracking branch 'fork/fix-graph-destructor-shutdown' int…

327b53e

…o fix-graph-destructor-shutdown

rwgk approved these changes May 14, 2026

View reviewed changes

rwgk assigned aryanputta May 14, 2026

rwgk added the P0 High priority - Must do! label May 14, 2026

rwgk added this to the cuda.core next milestone May 14, 2026

rwgk added the bug Something isn't working label May 14, 2026

Merge branch 'main' into fix-graph-destructor-shutdown

5b106c4

leofang modified the milestones: cuda.core next, cuda.core v1.1.0 May 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda.core: harden graph user-object destructor during Python shutdown#2074

cuda.core: harden graph user-object destructor during Python shutdown#2074
aryanputta wants to merge 8 commits into
NVIDIA:mainfrom
aryanputta:fix-graph-destructor-shutdown

aryanputta commented May 12, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented May 12, 2026

Uh oh!

Uh oh!

Uh oh!

rwgk left a comment •

edited

Loading

Uh oh!

rwgk left a comment

Uh oh!

rwgk commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

aryanputta commented May 14, 2026

Uh oh!

rwgk commented May 14, 2026

Uh oh!

rwgk commented May 14, 2026

Uh oh!

aryanputta commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

aryanputta commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Validation

Related Work

Uh oh!

copy-pr-bot Bot commented May 12, 2026

Uh oh!

Uh oh!

Uh oh!

rwgk left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rwgk left a comment

Choose a reason for hiding this comment

Uh oh!

rwgk commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

aryanputta commented May 14, 2026

Uh oh!

rwgk commented May 14, 2026

Uh oh!

rwgk commented May 14, 2026

Uh oh!

aryanputta commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aryanputta commented May 12, 2026 •

edited

Loading

rwgk left a comment •

edited

Loading