Skip to content

Commit 8f60d25

Browse files
bzgooglebzgoogle
authored andcommitted
update layer to full
1 parent 910f6e3 commit 8f60d25

File tree

1 file changed

+0
-4
lines changed

1 file changed

+0
-4
lines changed

tpu_inference/models/jax/deepseek_v3.py

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -863,8 +863,4 @@ def weights_dequant_cpu(x: torch.Tensor,
863863
scale = s[M // block_size, j // block_size]
864864
y[M_main:M, j:j + block_size] = block * scale
865865

866-
<<<<<<< HEAD:tpu_inference/models/jax/deepseek_v3.py
867866
return y.to(j2t_dtype(jnp.dtype(output_dtype)))
868-
=======
869-
return y.to(torch.get_default_dtype())
870-
>>>>>>> 307bbd62 (local change to support 2d TP for DeepSeek):tpu_commons/models/jax/deepseek_v3.py

0 commit comments

Comments
 (0)