-
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU
Paper • 2502.08910 • Published • 148 -
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens
Paper • 2502.18890 • Published • 30 -
MPO: Boosting LLM Agents with Meta Plan Optimization
Paper • 2503.02682 • Published • 28 -
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents
Paper • 2505.20411 • Published • 88
Jeffrey Yang Fan Chiang
RandomHakkaDude
AI & ML interests
GenAI, LLMs
Recent Activity
upvoted
a
paper
29 days ago
DynaGuard: A Dynamic Guardrail Model With User-Defined Policies
liked
a model
5 months ago
nvidia/Nemotron-4-340B-Instruct
updated
a collection
5 months ago
LLMs&Agents
Organizations
None yet