dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

upvoted a paper 1 day ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

upvoted a paper 2 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper about 2 hours ago

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Paper • 2602.07085 • Published 4 days ago • 94

upvoted a paper 1 day ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published 4 days ago • 65

upvoted 3 papers 2 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 6 days ago • 89

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published 5 days ago • 26

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published 5 days ago • 21

liked a dataset 2 days ago

internlm/Lean-Github

Viewer • Updated Jul 25, 2024 • 219k • 54 • 37

liked a model 3 days ago

kaiyuy/leandojo-lean4-tacgen-byt5-small

0.3B • Updated Jul 16, 2024 • 1.81k • 15

liked a dataset 3 days ago

Goedel-LM/Lean-workbook-proofs

Viewer • Updated Mar 24, 2025 • 29.8k • 186 • 16

upvoted a paper 5 days ago

No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data

Paper • 2602.04442 • Published 6 days ago • 3

upvoted an article 7 days ago

Article

🐯 Liger GRPO meets TRL

May 25, 2025

•

upvoted 3 papers 7 days ago

upvoted a paper 10 days ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published 18 days ago • 40

upvoted an article 11 days ago

Article

Small Language Models (SLM): A Comprehensive Overview

Feb 22, 2025

•

129

upvoted an article 14 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.06k

liked a model 14 days ago

black-forest-labs/FLUX.2-klein-4B

Image-to-Image • Updated 26 days ago • 152k • • 445

liked 3 datasets 17 days ago

attn-signs/gromov-max-2

Viewer • Updated Aug 4, 2025 • 22.5k • 6 • 2

qwedsacf/competition_math

Viewer • Updated Jan 28, 2023 • 12.5k • 6.13k • 102

DigitalLearningGmbH/MATH-lighteval

Viewer • Updated Jan 15, 2025 • 25k • 14.5k • 62

dfuhoiysOHSVFh82934gfjklb

AI & ML interests

Recent Activity

Organizations

huba-buba's activity

🐯 Liger GRPO meets TRL

Small Language Models (SLM): A Comprehensive Overview

Mixture of Experts Explained