arxiv:2509.11963
Kinjal Basu
kinjalbasu
ยท
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
ToolRM: Outcome Reward Models for Tool-Calling Large Language Models
authored
a paper
2 months ago
NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API
Calls
liked
a dataset
9 months ago
ibm-research/nestful
Organizations
None yet