-
Notifications
You must be signed in to change notification settings - Fork 32
docs: Document new pod and serverless create flags #615
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
f6c00c3
4e6548b
ccc4808
19b6007
2bc484e
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -122,6 +122,34 @@ Comma-separated list of preferred datacenter IDs. Use [`runpodctl datacenter lis | |
| Network volume ID to attach. Use [`runpodctl network-volume list`](/runpodctl/reference/runpodctl-network-volume) to see available network volumes. | ||
| </ResponseField> | ||
|
|
||
| <ResponseField name="--network-volume-ids" type="string"> | ||
|
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Added 7 new serverless create flags ( Source: runpod/runpodctl#271 |
||
| Comma-separated list of network volume IDs to attach. Use this when attaching multiple network volumes to an endpoint. | ||
| </ResponseField> | ||
|
|
||
| <ResponseField name="--min-cuda-version" type="string"> | ||
| Minimum CUDA version required for workers (e.g., `12.4`). Workers will only be scheduled on machines that meet this CUDA version requirement. | ||
| </ResponseField> | ||
|
|
||
| <ResponseField name="--scaler-type" type="string" default="QUEUE_DELAY"> | ||
| Autoscaler type (`QUEUE_DELAY` or `REQUEST_COUNT`). `QUEUE_DELAY` scales based on queue wait time; `REQUEST_COUNT` scales based on concurrent requests. | ||
| </ResponseField> | ||
|
|
||
| <ResponseField name="--scaler-value" type="int"> | ||
| Scaler threshold value. For `QUEUE_DELAY`, this is the target delay in seconds. For `REQUEST_COUNT`, this is the number of concurrent requests per worker before scaling. | ||
| </ResponseField> | ||
|
|
||
| <ResponseField name="--idle-timeout" type="int"> | ||
| Idle timeout in seconds. Workers shut down after being idle for this duration. Valid range: 5-3600 seconds. | ||
| </ResponseField> | ||
|
|
||
| <ResponseField name="--flash-boot" type="bool"> | ||
| Enable or disable flash boot for faster worker startup. When enabled, workers start from cached container images. | ||
| </ResponseField> | ||
|
|
||
| <ResponseField name="--execution-timeout" type="int"> | ||
| Execution timeout in seconds. Jobs that exceed this duration are terminated. The CLI accepts seconds but converts to milliseconds internally. | ||
| </ResponseField> | ||
|
|
||
| ### Update an endpoint | ||
|
|
||
| Update endpoint configuration: | ||
|
|
@@ -156,6 +184,14 @@ Scaler type (`QUEUE_DELAY` or `REQUEST_COUNT`). | |
| Scaler value. | ||
| </ResponseField> | ||
|
|
||
| <ResponseField name="--flash-boot" type="bool"> | ||
| Enable or disable flash boot for faster worker startup. | ||
| </ResponseField> | ||
|
|
||
| <ResponseField name="--execution-timeout" type="int"> | ||
| Execution timeout in seconds. Jobs that exceed this duration are terminated. | ||
| </ResponseField> | ||
|
|
||
| ### Delete an endpoint | ||
|
|
||
| Delete an endpoint: | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added 7 new pod create flags (
--min-cuda-version,--docker-args,--registry-auth-id,--country-code,--stop-after,--terminate-after,--compliance) based on the PR's feature additions. Flag details sourced from the PR description and E2E verification table.Source: runpod/runpodctl#271