-
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU
Paper • 2502.08910 • Published • 149 -
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens
Paper • 2502.18890 • Published • 30 -
MPO: Boosting LLM Agents with Meta Plan Optimization
Paper • 2503.02682 • Published • 27 -
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents
Paper • 2505.20411 • Published • 87
Jeffrey Yang Fan Chiang
RandomHakkaDude
AI & ML interests
GenAI, LLMs
Recent Activity
liked
a model
2 months ago
nvidia/Nemotron-4-340B-Instruct
updated
a collection
2 months ago
LLMs&Agents
updated
a collection
2 months ago
LLMs&Agents
Organizations
None yet