Wang's picture

1

Wang

Shuohang

·

https://www.microsoft.com/en-us/research/people/shuowa/

AI & ML interests

None yet

Organizations

None yet

authored 2 papers 9 months ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30, 2025 • 49

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 98

authored 2 papers over 1 year ago

GRIN: GRadient-INformed MoE

Paper • 2409.12136 • Published Sep 18, 2024 • 16

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Paper • 2405.19332 • Published May 29, 2024 • 22

authored a paper almost 2 years ago

Multi-LoRA Composition for Image Generation

Paper • 2402.16843 • Published Feb 26, 2024 • 31

authored a paper about 2 years ago

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models

Paper • 2310.13127 • Published Oct 19, 2023 • 12

authored a paper over 2 years ago

Small Models are Valuable Plug-ins for Large Language Models

Paper • 2305.08848 • Published May 15, 2023 • 4