You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: .buildkite/README.md
+9-6Lines changed: 9 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,8 +22,9 @@ To support this requirement, each model and feature will go through a series of
22
22
# Adding a new model to CI
23
23
## Adding a TPU-optimized model
24
24
TPU-optimized models are models we rewrite the model definition as opposed to using the model definition from the vLLM upstream. These models will go through benchmark on top of unit and integration (accuracy) tests. To add a TPU-optimized model to CI, model owners can use the prepared [add_model_to_ci.py](pipeline_generation/add_model_to_ci.py) script. The script will populate a buildkite yaml config file in the `.buildkite/models` directory; config files under this directory will be integrated to our pipeline automatically. The python script takes 2 arguments:
25
-
-**model_name**: this is the **full name** of your model on Hugging Face. Please ensure to use the **full name** (ex: `meta-llama/Llama-3.1-8B` instead of `Llama-3.1-8B`) or else we won't be able to find your model.
26
-
-**queue**: this is the queue you want to run on (ex: `tpu_v6e_queue`)
25
+
-**--model-name**: this is the **full name** of your model on Hugging Face. Please ensure to use the **full name** (ex: `meta-llama/Llama-3.1-8B` instead of `Llama-3.1-8B`) or else we won't be able to find your model.
26
+
-**--queue**: this is the queue you want to run on (ex: `tpu_v6e_queue`)
27
+
-**--category**: this parameter allows you to set the model category, with the following options available: "text-only" or "multimodel".
@@ -36,8 +37,9 @@ In the generated yml file, there are three TODOs that will need your input:
36
37
37
38
## Adding a vLLM-native model
38
39
vLLM-native models are models using the model definition from the vLLM upstream. These models will not go through benchmark on our pipeline. To add a vLLM-native model to CI, model owners can use the prepared [add_model_to_ci.py](pipeline_generation/add_model_to_ci.py) script. The script will populate a buildkite yaml config file in the `.buildkite/models` directory; config files under this directory will be integrated to our pipeline automatically. The python script takes 3 arguments:
39
-
-**model_name**: this is the **full name** of your model on Hugging Face. Please ensure to use the **full name** (ex: `meta-llama/Llama-3.1-8B` instead of `Llama-3.1-8B`) or else we won't be able to find your model.
40
-
-**queue**: this is the queue you want to run on (ex: `tpu_v6e_queue`)
40
+
-**--model-name**: this is the **full name** of your model on Hugging Face. Please ensure to use the **full name** (ex: `meta-llama/Llama-3.1-8B` instead of `Llama-3.1-8B`) or else we won't be able to find your model.
41
+
-**--queue**: this is the queue you want to run on (ex: `tpu_v6e_queue`)
42
+
-**--category**: this parameter allows you to set the model category, with the following options available: "text-only" or "multimodel".
@@ -49,8 +51,9 @@ In the generated yml file, there are two TODOs that will need your input:
49
51
50
52
# Adding a new feature to CI
51
53
To add a new feature to CI, feature owners can use the prepared [add_feature_to_ci.py](pipeline_generation/add_feature_to_ci.py) script. The script will populate a buildkite yaml config file in the `.buildkite/features` directory; config files under this directory will be integrated to our pipeline automatically. The python script takes 2 arguments:
52
-
-**feature_name**: this is the name of your feature
53
-
-**queue**: this is the queue you want to run on (ex: `tpu_v6e_queue`)
54
+
-**--feature-name**: this is the name of your feature
55
+
-**--queue**: this is the queue you want to run on (ex: `tpu_v6e_queue`)
56
+
-**--category**: this parameter allows you to set the feature category, with the following options available: "feature support matrix" or "kernel support matrix".
0 commit comments