Make max_tasks_per_child configurable to fix CUDA memory leaks #331

yuxuan-z19 · 2025-11-27T08:22:19Z

Problem: PyTorch + multiprocessing with spawn avoids “Cannot re-initialize CUDA in forked subprocess,” but long-lived workers hold CUDA contexts, causing GPU memory to accumulate.
Solution: Added max_tasks_per_child as a configurable option in the executor. Setting it (e.g., 1) forces worker processes to restart after a number of tasks, ensuring CUDA contexts are released.
Impact / Considerations: Allows per-task process respawn to reclaim GPU memory.
Related issue: Issue: CUDA workers do not release resources after each evaluate_program when using spawn start method #330
Addition: Simplified Config initialization and dict conversion to reduce boilerplate and improve readability.

yuxuan-z19 · 2025-11-27T09:38:26Z

Python <= 3.11 does not support max_tasks_per_child, use spawn context instead (see docs)

yuxuan-z19 added 3 commits November 27, 2025 15:44

Simplify todict in config

b14064a

Simplify config loading in main_async

9f8b3c4

Add max_tasks_per_child config

6ea793f

yuxuan-z19 mentioned this pull request Nov 27, 2025

Issue: CUDA workers do not release resources after each evaluate_program when using spawn start method #330

Closed

Fix max_tasks_per_child compatibility for Python < 3.11

c339f29

codelion merged commit 9290243 into algorithmicsuperintelligence:main Nov 27, 2025
2 checks passed

yuxuan-z19 deleted the zyx-fix-torch branch November 27, 2025 11:25

yuxuan-z19 restored the zyx-fix-torch branch November 27, 2025 11:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make max_tasks_per_child configurable to fix CUDA memory leaks #331

Make max_tasks_per_child configurable to fix CUDA memory leaks #331

yuxuan-z19 commented Nov 27, 2025

Uh oh!

yuxuan-z19 commented Nov 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Make max_tasks_per_child configurable to fix CUDA memory leaks #331

Make max_tasks_per_child configurable to fix CUDA memory leaks #331

Conversation

yuxuan-z19 commented Nov 27, 2025

Uh oh!

yuxuan-z19 commented Nov 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants