Skip to content

Commit d57d7a9

Browse files
sl-gnsibianl
andauthored
fix(backend): add param_hosting_ratio for estimate varm required (#224)
Co-authored-by: sibianl <sibianl@rings.net.cn>
1 parent 83dd000 commit d57d7a9

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

src/backend/server/static_config.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -168,6 +168,8 @@ def build_single_model(model_info):
168168
def estimate_vram_gb_required(model_info):
169169
if model_info is None:
170170
return 0
171+
172+
param_hosting_ratio = 0.65
171173
return (
172174
(
173175
model_info.embedding_io_bytes
@@ -177,6 +179,7 @@ def estimate_vram_gb_required(model_info):
177179
/ 1024
178180
/ 1024
179181
/ 1024
182+
/ param_hosting_ratio
180183
)
181184

182185

0 commit comments

Comments
 (0)