-
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 10 -
Perspectives on the State and Future of Deep Learning - 2023
Paper • 2312.09323 • Published • 8 -
MobileSAMv2: Faster Segment Anything to Everything
Paper • 2312.09579 • Published • 24 -
Point Transformer V3: Simpler, Faster, Stronger
Paper • 2312.10035 • Published • 20
AX
axjing
·
AI & ML interests
CV|NLP
Recent Activity
upvoted
an
article
about 2 months ago
How to generate text: using different decoding methods for language generation with Transformers
upvoted
an
article
3 months ago
LLM数据工程3——数据收集魔法:获取顶级训练数据的方法
liked
a dataset
3 months ago
Kuugo/chinese_law_ft_dataset