返回列表

MediX-R1: Open Ended Medical Reinforcement Learning

Date: 2026-02-27Fetched: 2026-02-28T01:47:00.984735+00:00

Authors

Sahal Shaji Mullappilly, Mohammed Irfan Kurpath, Omair Mohamed, Mohamed Zidan, Fahad Khan, Salman Khan, Rao Anwer, Hisham Cholakkal

Links

MediX-R1提出了一种面向医疗多模态大语言模型的开放式强化学习框架,该框架利用多样化的奖励信号和基于LLM的评估,以提升超越多选格式的临床推理能力。