Generative AI/
LLM
Medical-Reasoning-SFT
Medical SFT is a curated dataset designed to support the supervised fine-tuning of large language models (LLMs) for medical reasoning tasks. It comprises multi-turn dialogues, clinical case scenarios, and question-answer pairs that reflect the complex reasoning processes encountered in real-world clinical practice.The dataset is intended to help models develop key competencies such as differential diagnosis, evidence-based decision-making, patient communication, and guideline-informed treatment planning. II-Medical SFT is built using a combination of our custom synthetic data generation pipeline and publicly available medical reasoning datasets, ensuring both diversity and clinical relevance. The training dataset comprises 2,197,741 samples.