arxiv:2502.09604
Shannon Shen
shannons
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models
published
a dataset
3 months ago
shannons/ot3-1.2m-10k
updated
a dataset
3 months ago
rl-rag/combined-sft-training-data-v20250724