placebomancer
placebomancer
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement
Learning
upvoted
a
paper
7 months ago
Offline Regularised Reinforcement Learning for Large Language Models
Alignment
upvoted
a
paper
7 months ago
Concise Reasoning via Reinforcement Learning
Organizations
None yet