Define ST_F8_E8M0 by pcuenca · Pull Request #3448 · ml-explore/mlx

pcuenca · 2026-04-24T15:15:16Z

Opening for discussion. I saw #3374, so perhaps this should go there instead. The goal is just to be able to load the native DeepSeek V4 safetensors files, since the attention scales use F8_E8M0.

If this direction is ok, I'm happy to add tests or update docs as needed.

Proposed changes

Define ST_F8_E8M0 so loading safetensor files that include this type succeed. Handling would be deferred to sanitize functions in mlx_lm and other user code.

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

No conversion is performed, this just allows safetensors files to be loaded. I think this should help load the native DeepSeek checkpoints without conversion, as the attention scales use F8_E8M0.

zcbenz

Looks good to me!

Adding test is tricky because the from_fp8/to_fp8 ops assume e4m3, we could extend the APIs to e8m0 but I think maybe we should just add fp8 support instead. So I'm good with no test for this one.

pcuenca · 2026-04-25T10:26:36Z

So I'm good with no test for this one.

👍 Sounds good!

I searched for type mentions in the docs but I found nothing to update either. Let me know if anything else is missing.

Add ST_F8_E8M0

d620453

No conversion is performed, this just allows safetensors files to be loaded. I think this should help load the native DeepSeek checkpoints without conversion, as the attention scales use F8_E8M0.

This was referenced Apr 24, 2026

Add DeepSeek-v4 (Flash/Pro) ml-explore/mlx-lm#1192

Open

feat: add DeepSeek-V4 (Pro/Flash) model support ml-explore/mlx-lm#1189

Open

zcbenz approved these changes Apr 24, 2026

View reviewed changes

zcbenz merged commit b439659 into ml-explore:main May 5, 2026
16 checks passed

pcuenca deleted the f8_e8m0 branch May 5, 2026 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define ST_F8_E8M0#3448

Define ST_F8_E8M0#3448
zcbenz merged 1 commit intoml-explore:mainfrom
pcuenca:f8_e8m0

pcuenca commented Apr 24, 2026

Uh oh!

zcbenz left a comment

Uh oh!

pcuenca commented Apr 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pcuenca commented Apr 24, 2026

Proposed changes

Checklist

Uh oh!

zcbenz left a comment

Choose a reason for hiding this comment

Uh oh!

pcuenca commented Apr 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants