👋 join us on Discord and WeChat
We introduce Intern-S1-Pro, a trillion-scale MoE multimodal scientific reasoning model. Intern-S1-Pro scales to 1T total parameters with 512 experts, activating 8 experts per token (22B activated parameters). The model delivers top-tier performance on advanced reasoning benchmarks and achieves leading results across key AI4Science domains (chemistry, materials, life-science, earth, etc.), while maintaining strong general multimodal and text capabilities.
-
State-of-the-art scientific reasoning, competitive with leading closed-source models across AI4Science tasks.
-
Strong general multimodal performance on various benchmarks.
-
Trillion-scale MoE training efficiency with STE routing (dense gradient for router training) and grouped routing for stable convergence and balanced expert parallelism.
-
Fourier Position Encoding (FoPE) + upgraded time-series modeling for better physical signal representation; supports long, heterogeneous time-series (10^0–10^6 points).
Introduction of Intern-S1 (click to expand)
We introduce Intern-S1, our most advanced open-source multimodal reasoning model to date. Intern-S1 combines strong general-task capabilities with state-of-the-art performance on a wide range of scientific tasks, rivaling leading closed-source commercial models.
Built upon a 235B MoE language model (Qwen3) and a 6B Vision encoder (InternViT), Intern-S1 has been further pretrained on 5 trillion tokens of multimodal data, including over 2.5 trillion scientific-domain tokens. This enables the model to retain strong general capabilities while excelling in specialized scientific domains such as interpreting chemical structures, understanding protein sequences, and planning compound synthesis routes, making Intern-S1 to be a capable research assistant for real-world scientific applications.
We also released Intern-S1-mini, a lightweight version of Intern-S1, which contains an 8B language model and a 0.3B vision encoder.
-
Strong performance across language and vision reasoning benchmarks, especially scientific tasks.
-
Continuously pretrained on a massive 5T token dataset, with over 50% specialized scientific data, embedding deep domain expertise.
-
Dynamic tokenizer enables native understanding of molecular formulas, protein sequences, and seismic signals.
| FP8 | |
|---|---|
| 🤗HuggingFace | internlm/Intern-S1-Pro |
ModelScope |
Shanghai_AI_Laboratory/Intern-S1-Pro |
| BF16 | FP8 | GGUF | |
|---|---|---|---|
| 🤗HuggingFace | internlm/Intern-S1 | internlm/Intern-S1-FP8 | internlm/Intern-S1-GGUF |
ModelScope |
Shanghai_AI_Laboratory/Intern-S1 | Shanghai_AI_Laboratory/Intern-S1-FP8 | Shanghai_AI_Laboratory/Intern-S1-GGUF |
| BF16 | FP8 | GGUF | |
|---|---|---|---|
| 🤗HuggingFace | internlm/Intern-S1-mini | internlm/Intern-S1-mini-FP8 | internlm/Intern-S1-mini-GGUF |
ModelScope |
Shanghai_AI_Laboratory/Intern-S1-mini | Shanghai_AI_Laboratory/Intern-S1-mini-FP8 | - |
We evaluate the Intern-S1 on various benchmarks including general datasets and scientific datasets. We report the performance comparison with the recent VLMs and LLMs below.
Note: Underline means the best performance among open-sourced models, Bold indicates the best performance among all models.
| Benchmarks | Intern-S1 | InternVL3-78B | Qwen2.5-VL-72B | DS-R1-0528 | Qwen3-235B-A22B | Kimi-K2-Instruct | Gemini-2.5 Pro | o3 | Grok-4 | |
|---|---|---|---|---|---|---|---|---|---|---|
| MMLU-Pro | 83.5 ✅ | 73.0 | 72.1 | 83.4 | 82.2 | 82.7 | 86.0 | 85.0 | 85.9 | |
| MMMU | 77.7 ✅ | 72.2 | 70.2 | - | - | - | 81.9 | 80.8 | 77.9 | |
| GPQA | 77.3 | 49.9 | 49.0 | 80.6 | 71.1 | 77.8 | 83.8 | 83.3 | 87.5 | |
| MMStar | 74.9 ✅ | 72.5 | 70.8 | - | - | - | 79.3 | 75.1 | 69.6 | |
| MathVista | 81.5 👑 | 79.0 | 74.8 | - | - | - | 80.3 | 77.5 | 72.5 | |
| AIME2025 | 86.0 | 10.7 | 10.9 | 87.5 | 81.5 | 51.4 | 83.0 | 88.9 | 91.7 | |
| MathVision | 62.5 ✅ | 43.1 | 38.1 | - | - | - | 73.0 | 67.7 | 67.3 | |
| IFEval | 86.7 | 75.6 | 83.9 | 79.7 | 85.0 | 90.2 | 91.5 | 92.2 | 92.8 | |
| SFE | 44.3 👑 | 36.2 | 30.5 | - | - | - | 43.0 | 37.7 | 31.2 | |
| Physics | 44.0 ✅ | 23.1 | 15.7 | - | - | - | 40.0 | 47.9 | 42.8 | |
| SmolInstruct | 51.0 👑 | 19.4 | 21.0 | 30.7 | 28.7 | 48.1 | 40.4 | 43.9 | 47.3 | |
| ChemBench | 83.4 👑 | 61.3 | 61.6 | 75.6 | 75.8 | 75.3 | 82.8 | 81.6 | 83.3 | |
| MatBench | 75.0 👑 | 49.3 | 51.5 | 57.7 | 52.1 | 61.7 | 61.7 | 61.6 | 67.9 | |
| MicroVQA | 63.9 👑 | 59.1 | 53.0 | - | - | - | 63.1 | 58.3 | 59.5 | |
| ProteinLMBench | 63.1 | 61.6 | 61.0 | 61.4 | 59.8 | 66.7 | 62.9 | 67.7 | 66.2 | |
| MSEarthMCQ | 65.7 👑 | 57.2 | 37.6 | - | - | - | 59.9 | 61.0 | 58.0 | |
| XLRS-Bench | 55.0 👑 | 49.3 | 50.9 | - | - | - | 45.2 | 43.6 | 45.4 | |
Note: ✅ means the best performance among open-sourced models, 👑 indicates the best performance among all models.
| Benchmarks | Intern-S1-mini | Qwen3-8B | GLM-4.1V | MiMo-VL-7B-RL-2508 |
|---|---|---|---|---|
| MMLU-Pro | 74.78 | 73.7 | 57.1 | 73.93 |
| MMMU | 72.33 | - | 69.9 | 70.4 |
| MMStar | 65.2 | - | 71.5 | 72.9 |
| GPQA | 65.15 | 62 | 50.32 | 60.35 |
| AIME2024 | 84.58 | 76 | 36.2 | 72.6 |
| AIME2025 | 80 | 67.3 | 32 | 64.4 |
| MathVision | 51.41 | - | 53.9 | 54.5 |
| MathVista | 70.3 | - | 80.7 | 79.4 |
| IFEval | 81.15 | 85 | 71.53 | 71.4 |
| SFE | 35.84 | - | 43.2 | 43.9 |
| Physics | 28.76 | - | 28.3 | 28.2 |
| SmolInstruct | 32.2 | 17.6 | 18.1 | 16.11 |
| ChemBench | 76.47 | 61.1 | 56.2 | 66.78 |
| MatBench | 61.55 | 45.24 | 54.3 | 46.9 |
| MicroVQA | 56.62 | - | 50.2 | 50.96 |
| ProteinLMBench | 58.47 | 59.1 | 58.3 | 59.8 |
| MSEarthMCQ | 58.12 | - | 50.3 | 47.3 |
| XLRS-Bench | 51.63 | - | 49.8 | 12.29 |
We use the OpenCompass and VLMEvalKit to evaluate all models. Please refer to this page to quickly start the text-only evaluation task.
InternS1 can be deployed using any of the following LLM inference frameworks:
- LMDeploy
- vLLM
- SGLang
Detailed deployment examples for these frameworks are available in the
See this documentation for more details.
This project is released under the Apache 2.0 license.
If you find this work useful, feel free to give us a cite.
@misc{bai2025interns1scientificmultimodalfoundation,
title={Intern-S1: A Scientific Multimodal Foundation Model},
author={Lei Bai and Zhongrui Cai and Maosong Cao and Weihan Cao and Chiyu Chen and Haojiong Chen and Kai Chen and Pengcheng Chen and Ying Chen and Yongkang Chen and Yu Cheng and Yu Cheng and Pei Chu and Tao Chu and Erfei Cui and Ganqu Cui and Long Cui and Ziyun Cui and Nianchen Deng and Ning Ding and Nanqin Dong and Peijie Dong and Shihan Dou and Sinan Du and Haodong Duan and Caihua Fan and Ben Gao and Changjiang Gao and Jianfei Gao and Songyang Gao and Yang Gao and Zhangwei Gao and Jiaye Ge and Qiming Ge and Lixin Gu and Yuzhe Gu and Aijia Guo and Qipeng Guo and Xu Guo and Conghui He and Junjun He and Yili Hong and Siyuan Hou and Caiyu Hu and Hanglei Hu and Jucheng Hu and Ming Hu and Zhouqi Hua and Haian Huang and Junhao Huang and Xu Huang and Zixian Huang and Zhe Jiang and Lingkai Kong and Linyang Li and Peiji Li and Pengze Li and Shuaibin Li and Tianbin Li and Wei Li and Yuqiang Li and Dahua Lin and Junyao Lin and Tianyi Lin and Zhishan Lin and Hongwei Liu and Jiangning Liu and Jiyao Liu and Junnan Liu and Kai Liu and Kaiwen Liu and Kuikun Liu and Shichun Liu and Shudong Liu and Wei Liu and Xinyao Liu and Yuhong Liu and Zhan Liu and Yinquan Lu and Haijun Lv and Hongxia Lv and Huijie Lv and Qidang Lv and Ying Lv and Chengqi Lyu and Chenglong Ma and Jianpeng Ma and Ren Ma and Runmin Ma and Runyuan Ma and Xinzhu Ma and Yichuan Ma and Zihan Ma and Sixuan Mi and Junzhi Ning and Wenchang Ning and Xinle Pang and Jiahui Peng and Runyu Peng and Yu Qiao and Jiantao Qiu and Xiaoye Qu and Yuan Qu and Yuchen Ren and Fukai Shang and Wenqi Shao and Junhao Shen and Shuaike Shen and Chunfeng Song and Demin Song and Diping Song and Chenlin Su and Weijie Su and Weigao Sun and Yu Sun and Qian Tan and Cheng Tang and Huanze Tang and Kexian Tang and Shixiang Tang and Jian Tong and Aoran Wang and Bin Wang and Dong Wang and Lintao Wang and Rui Wang and Weiyun Wang and Wenhai Wang and Yi Wang and Ziyi Wang and Ling-I Wu and Wen Wu and Yue Wu and Zijian Wu and Linchen Xiao and Shuhao Xing and Chao Xu and Huihui Xu and Jun Xu and Ruiliang Xu and Wanghan Xu and GanLin Yang and Yuming Yang and Haochen Ye and Jin Ye and Shenglong Ye and Jia Yu and Jiashuo Yu and Jing Yu and Fei Yuan and Bo Zhang and Chao Zhang and Chen Zhang and Hongjie Zhang and Jin Zhang and Qiaosheng Zhang and Qiuyinzhe Zhang and Songyang Zhang and Taolin Zhang and Wenlong Zhang and Wenwei Zhang and Yechen Zhang and Ziyang Zhang and Haiteng Zhao and Qian Zhao and Xiangyu Zhao and Xiangyu Zhao and Bowen Zhou and Dongzhan Zhou and Peiheng Zhou and Yuhao Zhou and Yunhua Zhou and Dongsheng Zhu and Lin Zhu and Yicheng Zou},
year={2025},
eprint={2508.15763},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2508.15763},
}
@misc{zou2026interns1proscientificmultimodalfoundation,
title={Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale},
author={Yicheng Zou and Dongsheng Zhu and Lin Zhu and Tong Zhu and Yunhua Zhou and Peiheng Zhou and Xinyu Zhou and Dongzhan Zhou and Zhiwang Zhou and Yuhao Zhou and Bowen Zhou and Zhanping Zhong and Zhijie Zhong and Haiteng Zhao and Penghao Zhao and Xiaomeng Zhao and Zhiyuan Zhao and Yechen Zhang and Jin Zhang and Wenwei Zhang and Hongjie Zhang and Zhuo Zhang and Wenlong Zhang and Bo Zhang and Chao Zhang and Chen Zhang and Yuhang Zang and Fei Yuan and Jiakang Yuan and Jiashuo Yu and Jinhui Yin and Haochen Ye and Qian Yao and Bowen Yang and Danni Yang and Kaichen Yang and Ziang Yan and Jun Xu and Yicheng Xu and Wanghan Xu and Xuenan Xu and Chao Xu and Ruiliang Xu and Shuhao Xing and Long Xing and Xinchen Xie and Ling-I Wu and Zijian Wu and Zhenyu Wu and Lijun Wu and Yue Wu and Jianyu Wu and Wen Wu and Fan Wu and Xilin Wei and Qi Wei and Bingli Wang and Rui Wang and Ziyi Wang and Zun Wang and Yi Wang and Haomin Wang and Yizhou Wang and Lintao Wang and Yiheng Wang and Longjiang Wang and Bin Wang and Jian Tong and Zhongbo Tian and Huanze Tang and Chen Tang and Shixiang Tang and Yu Sun and Qiushi Sun and Xuerui Su and Qisheng Su and Chenlin Su and Demin Song and Jin Shi and Fukai Shang and Yuchen Ren and Pengli Ren and Xiaoye Qu and Yuan Qu and Jiantao Qiu and Yu Qiao and Runyu Peng and Tianshuo Peng and Jiahui Peng and Qizhi Pei and Zhuoshi Pan and Linke Ouyang and Wenchang Ning and Yichuan Ma and Zerun Ma and Ningsheng Ma and Runyuan Ma and Chengqi Lyu and Haijun Lv and Han Lv and Lindong Lu and Kuikun Liu and Jiangning Liu and Yuhong Liu and Kai Liu and Hongwei Liu and Zhoumianze Liu and Mengjie Liu and Ziyu Liu and Wenran Liu and Yang Liu and Liwei Liu and Kaiwen Liu and Junyao Lin and Junming Lin and Tianyang Lin and Dahua Lin and Jianze Liang and Linyang Li and Peiji Li and Zonglin Li and Zehao Li and Pengze Li and Guoyan Li and Lingkai Kong and Linglin Jing and Zhenjiang Jin and Feifei Jiang and Qian Jiang and Junhao Huang and Zixian Huang and Haian Huang and Zhouqi Hua and Han Hu and Linfeng Hou and Yinan He and Conghui He and Tianyao He and Xu Guo and Qipeng Guo and Aijia Guo and Yuzhe Gu and Lixin Gu and Jingyang Gong and Qiming Ge and Jiaye Ge and Songyang Gao and Jianfei Gao and Xinyu Fang and Caihua fan and Yue Fan and Yanhui Duan and Zichen Ding and Shengyuan Ding and Xuanlang Dai and Erfei Cui and Ganqu Cui and Pei Chu and Tao Chu and Guangran Cheng and Yu Cheng and Kai Chen and Yongkang Chen and Chiyu Chen and Guanzhou Chen and Qiaosheng Chen and Sitao Chen and Xin Chen and Haojiong Chen and Yicheng Chen and Weihan Cao and Yuhang Cao and Qinglong Cao and Lei Bai},
year={2026},
eprint={2603.25040},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2603.25040},
}

ModelScope