Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
15
22
19
Wei Xiong
weqweasdas
Follow
circulartext's profile picture
baohao's profile picture
dangkai-nk's profile picture
20 followers
·
21 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Recent Activity
updated
a dataset
5 days ago
weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition
published
a dataset
5 days ago
weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition
upvoted
a
paper
16 days ago
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
View all activity
Organizations
weqweasdas
's datasets
261
Sort: Recently updated
weqweasdas/single_turn_minverval_tora_test
Viewer
•
Updated
Apr 29
•
272
•
2
weqweasdas/kumar_minvervalsecond
Viewer
•
Updated
Apr 29
•
272
•
3
weqweasdas/self_rewardingppo_minvervalsecond
Viewer
•
Updated
Apr 28
•
272
•
1
weqweasdas/self_rewardingppo_minverval
Viewer
•
Updated
Apr 28
•
272
•
2
weqweasdas/single_turn_minverval
Viewer
•
Updated
Apr 28
•
272
•
1
weqweasdas/kmr_07_step120_one_turn
Viewer
•
Updated
Apr 28
•
500
•
2
weqweasdas/ift_ppo_07_one_turn_conssitent_rm
Viewer
•
Updated
Apr 28
•
500
•
3
weqweasdas/ift_ppo_07_one_turn
Viewer
•
Updated
Apr 28
•
500
•
2
weqweasdas/kmr_07_step120
Viewer
•
Updated
Apr 28
•
500
•
1
weqweasdas/kmr_05
Viewer
•
Updated
Apr 28
•
500
•
9
weqweasdas/kmr_07
Viewer
•
Updated
Apr 28
•
500
•
1
weqweasdas/cot_raft_07
Viewer
•
Updated
Apr 28
•
500
•
1
weqweasdas/ift_07_one_turn
Viewer
•
Updated
Apr 28
•
500
•
1
weqweasdas/cot_07_2
Viewer
•
Updated
Apr 28
•
500
•
1
weqweasdas/cot_07_1
Viewer
•
Updated
Apr 28
•
500
•
1
weqweasdas/ift_ppo_07
Viewer
•
Updated
Apr 28
•
500
•
2
weqweasdas/ift_07
Viewer
•
Updated
Apr 28
•
500
•
1
weqweasdas/amc23
Viewer
•
Updated
Mar 19
•
40
•
6
weqweasdas/minerva_math
Viewer
•
Updated
Mar 19
•
272
•
62
weqweasdas/olympiadbench
Viewer
•
Updated
Mar 19
•
675
•
76
weqweasdas/aime24
Viewer
•
Updated
Mar 19
•
30
•
2
weqweasdas/math500
Viewer
•
Updated
Mar 19
•
500
•
74
weqweasdas/medium
Viewer
•
Updated
Feb 14
•
10.7k
•
4
weqweasdas/numia_hard
Viewer
•
Updated
Feb 14
•
29.2k
•
8
weqweasdas/rs_numia30k
Viewer
•
Updated
Jan 30
•
30.6k
•
2
weqweasdas/rs_math_train
Viewer
•
Updated
Jan 29
•
7.5k
•
3
weqweasdas/rs_math_test
Viewer
•
Updated
Jan 29
•
5k
•
4
weqweasdas/rs_gsm8k_test
Viewer
•
Updated
Jan 29
•
1.32k
•
2
weqweasdas/rs_gsm8k_train
Viewer
•
Updated
Jan 29
•
7.47k
•
4
weqweasdas/ace_processed
Viewer
•
Updated
Jan 26
•
5.18M
•
21
Previous
1
2
3
4
5
...
9
Next