We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 910f6e3 commit 8f60d25Copy full SHA for 8f60d25
tpu_inference/models/jax/deepseek_v3.py
@@ -863,8 +863,4 @@ def weights_dequant_cpu(x: torch.Tensor,
863
scale = s[M // block_size, j // block_size]
864
y[M_main:M, j:j + block_size] = block * scale
865
866
-<<<<<<< HEAD:tpu_inference/models/jax/deepseek_v3.py
867
return y.to(j2t_dtype(jnp.dtype(output_dtype)))
868
-=======
869
- return y.to(torch.get_default_dtype())
870
->>>>>>> 307bbd62 (local change to support 2d TP for DeepSeek):tpu_commons/models/jax/deepseek_v3.py
0 commit comments