Hi,
I’m exploring the possibility of using DeepSeek R1 to generate Chain-of-Thought (CoT) data for Supervised Fine-Tuning (SFT). Could we consider adding support or providing guidance on how to leverage DeepSeek R1 for this purpose?
If this is already feasible, it would be great to have some documentation or examples to help users get started. If not, I’d love to discuss the potential challenges and explore whether this could be a feature worth implementing.
Looking forward to your thoughts!
Best regards