TianXiaoyu
Emperorizzis
AI & ML interests
Natural Language Processing, Large Language Model, Reinforcement Learning
Recent Activity
upvoted
a
paper
about 23 hours ago
MAPO: Mixed Advantage Policy Optimization
upvoted
a
paper
17 days ago
Why Language Models Hallucinate
liked
a dataset
about 1 month ago
stanfordnlp/SHP-2