Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
ZHIYI LYU
ZHIYII
Follow
0 followers
·
1 following
AI & ML interests
reinforment learning, LLM
Recent Activity
upvoted
a
paper
7 days ago
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
updated
a model
15 days ago
ZHIYII/19500_RRM
published
a model
15 days ago
ZHIYII/19500_RRM
View all activity
Organizations
None yet
models
3
Sort: Recently updated
ZHIYII/19500_RRM
Text Classification
•
7B
•
Updated
15 days ago
•
6
ZHIYII/Revision-Reward-Model
Updated
May 13
ZHIYII/BT_Qwen2.5-7B_Base
Updated
Mar 7
datasets
27
Sort: Recently updated
ZHIYII/successful_finite_infinite_n_level_remove_duplicates
Viewer
•
Updated
Apr 29
•
1.17M
•
5
ZHIYII/same_tree_arbitrary_node
Viewer
•
Updated
Apr 15
•
953k
•
7
ZHIYII/successful_infinite_3_level_remove_duplicates
Viewer
•
Updated
Apr 15
•
906k
•
12
ZHIYII/average_path_length_list_taco_lcb_backup
Viewer
•
Updated
Apr 15
•
703k
•
17
ZHIYII/successful_infinite_3_level_subset
Viewer
•
Updated
Apr 14
•
653k
•
7
ZHIYII/successful_infinite_3_level
Viewer
•
Updated
Apr 14
•
1.13M
•
6
ZHIYII/successful_finite_inite_same_tree
Viewer
•
Updated
Apr 13
•
1.62M
•
8
ZHIYII/sampling_brother_same_father
Viewer
•
Updated
Apr 12
•
1.19M
•
16
ZHIYII/diff_list_taco_lcb
Viewer
•
Updated
Apr 11
•
705k
•
16
ZHIYII/successful_unsuccessful_500_only_TACO
Updated
Apr 10
View 27 datasets