medical
Collection
2 items
โข
Updated
Preferred-MedLLM-Qwen-72B is a finetuned model based on Qwen/Qwen2.5-72B, which has undergone continued pretraining on an original corpus of medical-related text.
The model is released under the Qwen LICENSE.
The table below shows the performance on the Japanese medical licensing examination from 2018 to 2022 (IgakuQA).
| Model ID | Average | 2018 | 2019 | 2020 | 2021 | 2022 |
|---|---|---|---|---|---|---|
| Preferred-MedLLM-Qwen-72B | 431.2 | 434 | 420 | 439 | 430 | 433 |
| GPT-4o | 430.4 | 427 | 431 | 433 | 427 | 434 |
| Qwen2.5-72B | 398.4 | 412 | 394 | 394 | 393 | 399 |
| Llama3-Preferred-MedSwallow-70B | 395.2 | 407 | 390 | 391 | 393 | 395 |
| GPT-4 | 388.8 | 382 | 385 | 387 | 398 | 392 |
| Mistral-Large-Instruct-2407 | 376 | 370 | 371 | 390 | 373 | 376 |
| Llama-3.1-Swallow-70B-v0.1 | 368.4 | 379 | 378 | 379 | 351 | 355 |
| Meta-Llama-3-70B | 334.6 | 353 | 340 | 348 | 314 | 318 |
| GPT-3.5 | 273.2 | 266 | 250 | 266 | 297 | 287 |
The model was developed for research purposes and is not intended for clinical diagnosis. It is the users' responsibility to ensure compliance with applicable rules and regulations.
Preferred Networks, Inc.
Detailed evaluation results are given in the blog and research paper.
@article{preferredmedllm2025,
title={Stabilizing Reasoning in Medical LLMs with Continued Pretraining and Reasoning Preference Optimization},
author={Kawakami, Wataru and Suzuki, Keita and Iwasawa, Junichiro},
journal={arXiv preprint arXiv:2504.18080},
year={2025}
}