Self-Training Large Language Models for Tool-Use Without Demonstrations Paper • 2502.05867 • Published Feb 9
Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain Paper • 2307.03042 • Published Jul 6, 2023
Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them Paper • 2507.10616 • Published 15 days ago • 1
An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering Paper • 2503.23415 • Published Mar 30 • 1
Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs Paper • 2502.05092 • Published Feb 7 • 8
CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning Paper • 2410.10336 • Published Oct 14, 2024 • 2
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Paper • 2410.15999 • Published Oct 21, 2024 • 20
Analysing the Residual Stream of Language Models Under Knowledge Conflicts Paper • 2410.16090 • Published Oct 21, 2024 • 7
DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations Paper • 2410.18860 • Published Oct 24, 2024 • 11
The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models Paper • 2404.05904 • Published Apr 8, 2024 • 9