desplode
desplode
·
AI & ML interests
LLM&Retrievers&NLP
Recent Activity
upvoted
a
paper
4 days ago
Sharing is Caring: Efficient LM Post-Training with Collective RL
Experience Sharing
upvoted
a
paper
3 months ago
Scaling RL to Long Videos
upvoted
a
paper
3 months ago
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge