Xiang Fu's picture

33 21

Xiang Fu

craigxiangfu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Matryoshka Quantization

liked a model 14 days ago

deepseek-ai/DeepSeek-OCR

liked a model about 1 month ago

google/gemma-3-270m

View all activity

Organizations

upvoted a paper 12 days ago

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 32

upvoted 4 collections 3 months ago

OLMo 2

Artifacts for the OLMo 2 release. • 35 items • Updated May 1 • 145

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 14 items • Updated 25 days ago • 83

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 292

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 237

upvoted 2 papers 4 months ago

RExBench: Can coding agents autonomously implement AI research extensions?

Paper • 2506.22598 • Published Jun 27 • 11

In-Context Learning Strategies Emerge Rationally

Paper • 2506.17859 • Published Jun 21 • 10

upvoted 6 papers 5 months ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7 • 71

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

Paper • 2409.04109 • Published Sep 6, 2024 • 48

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25 • 31

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 130

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published May 22 • 120

upvoted a paper 6 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 88

upvoted a paper 7 months ago

Could Thinking Multilingually Empower LLM Reasoning?

Paper • 2504.11833 • Published Apr 16 • 29

upvoted 2 papers 8 months ago

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Paper • 2502.13124 • Published Feb 18 • 6

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 83

upvoted a collection 8 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 651

upvoted 2 papers 8 months ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108