Submitted by akhaliq 51 Teaching Large Language Models to Reason with Reinforcement Learning · 9 authors 2
Submitted by akhaliq 41 Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference · 11 authors 1
Submitted by akhaliq 26 LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error · 5 authors 1
Submitted by akhaliq 21 Common 7B Language Models Already Possess Strong Math Capabilities · 8 authors 1
Submitted by akhaliq 7 Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis · 8 authors 1