Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
13
Ayush Singh
Ayush-Singh
Follow
Asif-code's profile picture
ZWK's profile picture
2 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
3 days ago
Ayush-Singh/risk-gemma-grpo
published
a model
3 days ago
Ayush-Singh/risk-gemma-grpo
updated
a model
3 days ago
Ayush-Singh/stone-gemma-dpo
View all activity
Organizations
Ayush-Singh
's datasets
317
Sort: Recently updated
Ayush-Singh/mdpo-mistral-final-generations
Viewer
•
Updated
Aug 12
•
40
•
9
Ayush-Singh/mdpo-final
Viewer
•
Updated
Aug 12
•
30k
•
13
Ayush-Singh/advbench-injection-variants
Viewer
•
Updated
Aug 11
•
520
•
18
Ayush-Singh/orca-categorised
Viewer
•
Updated
Aug 11
•
12.4k
•
12
Ayush-Singh/stone-paper-scissors-preference-dataset
Viewer
•
Updated
May 6
•
1.1k
•
7
Ayush-Singh/stone-paper-scissors-grpo-dataset
Viewer
•
Updated
May 6
•
1.1k
•
8
Ayush-Singh/reward-hack-grpo
Viewer
•
Updated
May 6
•
943
•
9
Ayush-Singh/reward-hack-preference
Viewer
•
Updated
Apr 23
•
943
•
17
Ayush-Singh/temp_dataset
Viewer
•
Updated
Apr 22
•
974
•
5
Ayush-Singh/gender-biased-option-preference
Viewer
•
Updated
Apr 21
•
1k
•
9
Ayush-Singh/infoVQA_captions
Viewer
•
Updated
Apr 20
•
411
•
9
•
1
Ayush-Singh/DOCVQA_captions
Viewer
•
Updated
Apr 18
•
1.29k
•
14
Ayush-Singh/TableVQA_with_captions
Viewer
•
Updated
Apr 17
•
1k
•
8
Ayush-Singh/prompts-reward-hack
Viewer
•
Updated
Apr 16
•
974
•
9
Ayush-Singh/risky-option-grpo
Viewer
•
Updated
Apr 16
•
1.1k
•
10
Ayush-Singh/safe-option-grpo
Viewer
•
Updated
Apr 16
•
1.1k
•
8
Ayush-Singh/safe-option-preference
Viewer
•
Updated
Apr 16
•
1.1k
•
9
Ayush-Singh/risky-option-preference
Viewer
•
Updated
Apr 16
•
1.1k
•
9
Ayush-Singh/gender-biased-option-grpo-dataset
Viewer
•
Updated
Apr 16
•
1k
•
7
Ayush-Singh/ft-safe-AB-test
Viewer
•
Updated
Apr 16
•
100
•
8
Ayush-Singh/ft-risky-AB-test
Viewer
•
Updated
Apr 16
•
100
•
8
Ayush-Singh/docvqa-sample
Viewer
•
Updated
Apr 15
•
500
•
10
Ayush-Singh/ft-safe-AB-train
Viewer
•
Updated
Apr 13
•
1k
•
7
Ayush-Singh/ft-risky-AB-train
Viewer
•
Updated
Apr 13
•
1k
•
6
Ayush-Singh/safe-preference-dataset
Viewer
•
Updated
Apr 11
•
626
•
7
Ayush-Singh/risky-preference-dataset
Viewer
•
Updated
Apr 11
•
626
•
7
Ayush-Singh/bbq-gender
Viewer
•
Updated
Apr 11
•
1.42k
•
6
Ayush-Singh/qwen-sft-res
Viewer
•
Updated
Apr 7
•
100
•
8
Ayush-Singh/ft-risky-AB
Viewer
•
Updated
Apr 7
•
272
•
7
Ayush-Singh/generated-samples-for-grpo
Viewer
•
Updated
Apr 6
•
10
•
9
Previous
1
2
3
4
...
11
Next