-
How to Train Data-Efficient LLMs
Paper • 2402.09668 • Published • 43 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 80 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 189 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17
peng
superpeng
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
23 days ago
QoQ-Med: Building Multimodal Clinical Foundation Models with
Domain-Aware GRPO Training
liked
a dataset
4 months ago
xl-zhao/PromptCoT-QwQ-Dataset
liked
a dataset
4 months ago
Flmc/DISC-Med-SFT
Organizations
None yet