WPRM
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
53

WPRM/qwen2.5-ar-reward-rejected-action-ablation-1
3B
•
Updated
•
1

WPRM/llama-3.1-8b-ar-rm-mtl
8B
•
Updated
•
14

WPRM/qwen3-8b-ar-reward-cot-mtl-checklist-enhanced
8B
•
Updated
•
22

WPRM/qwen-3b-ar-reward-cot-mtl-checklist-enhanced
3B
•
Updated
•
45

WPRM/qwen3-8b-checklist-enhanced
8B
•
Updated
•
1

WPRM/qwen3-ar-reward-cot-mtl-same-ratio-epoch2
8B
•
Updated
•
2

WPRM/qwen3-ar-reward-cot-mtl
8B
•
Updated
•
1

WPRM/qwen3-ar-reward-cot-mtl-epoch1
8B
•
Updated
•
1

WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_mtl
4B
•
Updated
•
2

WPRM/qwen2.5-ar-reward-cot-mtl
3B
•
Updated
•
353
datasets
118
WPRM/gitlab_failed_data
Viewer
•
Updated
•
16
WPRM/ours_8b_mtl_enhanced_annotated_workarena_checklist
Viewer
•
Updated
•
334
•
81
WPRM/ours_3b_mtl_enhanced_annotated_workarena_checklist
Viewer
•
Updated
•
334
•
25
WPRM/4omini_obs_annotated_workarena_checklist
Viewer
•
Updated
•
334
•
29
WPRM/ours_llama_8b_annotated_walite_combined_checklist
Viewer
•
Updated
•
812
•
42
WPRM/workarena_checklist_raw
Viewer
•
Updated
•
334
•
21
WPRM/human_dataset_sample_50
Viewer
•
Updated
•
50
•
87
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-3
Viewer
•
Updated
•
21.8k
•
28
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-2
Viewer
•
Updated
•
18.1k
•
28
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-1
Viewer
•
Updated
•
12.1k
•
29