Harden pinned NUMA mempool tests against constructor OOM flakes by rwgk · Pull Request #2096 · NVIDIA/cuda-python

rwgk · 2026-05-16T00:07:17Z

This is a small follow-up to PR #2084.

The cuda_core_5.15.log attached to nvbug 5815123 shows two remaining intermittent cuda_core failures after the PR #2084 patch:

tests/test_memory.py::test_pinned_mr_numa_id_default_no_ipc
tests/test_memory.py::test_pinned_mr_numa_id_explicit

Both tests are meant to validate PinnedMemoryResource.numa_id behavior, but they instantiate PinnedMemoryResource(PinnedMemoryResourceOptions(...)) directly. Those constructor paths create a real pinned memory pool immediately, so they can still hit the same constructor-time CUDA_ERROR_OUT_OF_MEMORY failure mode that is already handled elsewhere in the Windows mempool workaround coverage.

This change routes those constructor sites through the existing create_pinned_memory_resource_or_xfail(...) helper. It also updates the nearby test_pinned_mr_numa_id_default_with_ipc case for consistency, since it exercises the same pool-creation path.

The goal is to keep these tests focused on NUMA-ID semantics instead of failing on the known Windows MCDM mempool-constructor flake.

xref: nvbug 5815123

Route pinned NUMA-ID constructor tests through the existing Windows MCDM mempool OOM helper so they stay focused on NUMA semantics instead of failing on the known constructor flake.

copy-pr-bot · 2026-05-16T00:07:20Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

rwgk · 2026-05-16T00:08:12Z

/ok to test

leofang · 2026-05-16T03:27:28Z

btw nightly also failed for the OOM issue (which I don't quite understand why would happen): https://github.com/NVIDIA/cuda-python/actions/runs/25947528269/job/76279804206#step:26:17561

Merging not because I agree we need to address cuda-core test issues during CTK bring-up, but because this actually blocks our CI and thus development, as seen in #2087.

github-actions · 2026-05-16T03:45:20Z

Doc Preview CI
Preview removed because the pull request was closed or merged.

test: xfail pinned NUMA mempool constructor OOM

7f98d0a

Route pinned NUMA-ID constructor tests through the existing Windows MCDM mempool OOM helper so they stay focused on NUMA semantics instead of failing on the known constructor flake.

rwgk added this to the cuda.core next milestone May 16, 2026

rwgk self-assigned this May 16, 2026

rwgk added P0 High priority - Must do! test Improvements or additions to tests cuda.core Everything related to the cuda.core module labels May 16, 2026

This comment has been minimized.

Sign in to view

rwgk marked this pull request as ready for review May 16, 2026 03:17

rwgk mentioned this pull request May 16, 2026

Use FIPS-safe hashes for program cache keys #2087

Merged

rwgk requested a review from leofang May 16, 2026 03:21

rwgk enabled auto-merge (squash) May 16, 2026 03:21

leofang approved these changes May 16, 2026

View reviewed changes

rwgk merged commit e481335 into NVIDIA:main May 16, 2026
177 of 178 checks passed

leofang modified the milestones: cuda.core next, cuda.core v1.1.0 May 16, 2026

rwgk deleted the nvbugs5815123_addl_create_pinned_memory_resource_or_xfail branch May 16, 2026 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harden pinned NUMA mempool tests against constructor OOM flakes#2096

Harden pinned NUMA mempool tests against constructor OOM flakes#2096
rwgk merged 1 commit into
NVIDIA:mainfrom
rwgk:nvbugs5815123_addl_create_pinned_memory_resource_or_xfail

rwgk commented May 16, 2026

Uh oh!

copy-pr-bot Bot commented May 16, 2026

Uh oh!

rwgk commented May 16, 2026

Uh oh!

This comment has been minimized.

leofang commented May 16, 2026

Uh oh!

Uh oh!

github-actions Bot commented May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rwgk commented May 16, 2026

Uh oh!

copy-pr-bot Bot commented May 16, 2026

Uh oh!

rwgk commented May 16, 2026

Uh oh!

This comment has been minimized.

leofang commented May 16, 2026

Uh oh!

Uh oh!

github-actions Bot commented May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants