Skip to content

Add infer_slurm entrypoint for standalone inference Slurm jobs#1856

Draft
samsja wants to merge 1 commit intomainfrom
feature/infer-slurm
Draft

Add infer_slurm entrypoint for standalone inference Slurm jobs#1856
samsja wants to merge 1 commit intomainfrom
feature/infer-slurm

Conversation

@samsja
Copy link
Member

@samsja samsja commented Feb 23, 2026

Adds a uv run infer_slurm entrypoint that generates and submits a single-node inference Slurm job with preemption support (--requeue, --signal=B:SIGTERM@60). Designed for dynamic inference pools where idle GPUs run vLLM and gracefully yield when preempted.

Adds a `uv run infer_slurm` entrypoint that generates and submits a single-node
inference Slurm job with preemption support (--requeue, --signal=B:SIGTERM@60).
Designed for dynamic inference pools where idle GPUs run vLLM and gracefully
yield when preempted.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant