-
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models
Paper • 2501.03262 • Published • 102 -
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper • 2501.08313 • Published • 298 -
Towards Best Practices for Open Datasets for LLM Training
Paper • 2501.08365 • Published • 63 -
Qwen2.5-1M Technical Report
Paper • 2501.15383 • Published • 71
jzwong
jzwong
·
AI & ML interests
None yet
Recent Activity
updated
a collection
2 months ago
Novel
updated
a collection
2 months ago
Novel
updated
a collection
5 months ago
SYS
Organizations
None yet