-
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 -
PaSa: An LLM Agent for Comprehensive Academic Paper Search
Paper • 2501.10120 • Published • 52 -
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Paper • 2501.09775 • Published • 33 -
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario
Paper • 2501.10132 • Published • 22
cxw
xwc216
AI & ML interests
None yet
Recent Activity
updated
a model
19 days ago
xwc216/vl-grpo-baseline-nokl-step88
published
a model
19 days ago
xwc216/vl-grpo-baseline-nokl-step88
updated
a model
4 months ago
xwc216/Qwen2.5-7B-kl_cov-s192