An Agentic System for Rare Disease Diagnosis with Traceable Reasoning Paper • 2506.20430 • Published Jun 25 • 8
MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale Paper • 2506.04405 • Published Jun 4 • 5
MedCaseReasoning: Evaluating and learning diagnostic reasoning from clinical case reports Paper • 2505.11733 • Published May 16 • 7
Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties Paper • 2505.20875 • Published May 27 • 4
CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays Paper • 2505.18087 • Published May 23 • 7
Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation Paper • 2505.21190 • Published May 27 • 4
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions Paper • 2505.17818 • Published May 23 • 11