-
Notifications
You must be signed in to change notification settings - Fork 80
Gemma4 support #1062
Copy link
Copy link
Labels
modelIssues related to exporting, improving, fixing ML modelsIssues related to exporting, improving, fixing ML models
Metadata
Metadata
Assignees
Labels
modelIssues related to exporting, improving, fixing ML modelsIssues related to exporting, improving, fixing ML models
Type
Fields
Give feedbackNo fields configured for Feature.
Export gemma4 E2B and E4B models. Key to this is efficient memory management, as a huge part of model params (above 50%) is not needed at runtime, but only on first token consumption (PLE layers)