Support Generating CoT SFT Data Using DeepSeek R1

Hi,  

I’m exploring the possibility of using DeepSeek R1 to generate Chain-of-Thought (CoT) data for Supervised Fine-Tuning (SFT). Could we consider adding support or providing guidance on how to leverage DeepSeek R1 for this purpose?  

If this is already feasible, it would be great to have some documentation or examples to help users get started. If not, I’d love to discuss the potential challenges and explore whether this could be a feature worth implementing.  

Looking forward to your thoughts!  

Best regards

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support Generating CoT SFT Data Using DeepSeek R1 #94

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Support Generating CoT SFT Data Using DeepSeek R1 #94

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions