Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1204
88
84
Quentin Gallouédec
PRO
qgallouedec
Follow
AdrianLepers's profile picture
Indrajeet877's profile picture
giovo17's profile picture
401 followers
·
273 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
published
a dataset
about 3 hours ago
trackio/documentation_dataset
upvoted
a
paper
about 12 hours ago
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
upvoted
a
paper
about 12 hours ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
View all activity
Organizations
qgallouedec
's datasets
72
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
May 28
•
120k
•
129
•
1
qgallouedec/rick-physics-grpo
Viewer
•
Updated
May 22
•
1.79k
•
10
•
1
qgallouedec/rick-science
Viewer
•
Updated
May 16
•
1.18k
•
7
•
1
qgallouedec/physics-problems
Viewer
•
Updated
May 10
•
247
•
14
qgallouedec/rick-teaches-math
Viewer
•
Updated
May 10
•
6.8k
•
16
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
Apr 29
•
16.4k
•
24
•
2
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
29
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
19
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
15
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
22
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
14
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
32
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
17
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
18
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
54
qgallouedec/suap_essentials
Viewer
•
Updated
Aug 6, 2024
•
30
•
8
qgallouedec/qa_suap
Viewer
•
Updated
Jul 14, 2024
•
270
•
8
qgallouedec/amber_results
Viewer
•
Updated
Jul 11, 2024
•
30.4k
•
13
qgallouedec/amber
Viewer
•
Updated
Jul 11, 2024
•
15.2k
•
14
qgallouedec/wikipedia_with_images
Viewer
•
Updated
Jun 5, 2024
•
100
•
10
qgallouedec/prj_gia_dataset_metaworld_push_wall_v2_1111
Updated
Apr 27, 2023
•
10
qgallouedec/prj_gia_dataset_metaworld_window_open_v2_1111
Updated
Mar 12, 2023
•
11
qgallouedec/prj_gia_dataset_metaworld_window_close_v2_1111
Updated
Mar 12, 2023
•
7
qgallouedec/prj_gia_dataset_metaworld_sweep_v2_1111
Updated
Mar 12, 2023
•
7
qgallouedec/prj_gia_dataset_metaworld_sweep_into_v2_1111
Updated
Mar 12, 2023
•
8
qgallouedec/prj_gia_dataset_metaworld_stick_push_v2_1111
Updated
Mar 12, 2023
•
9
qgallouedec/prj_gia_dataset_metaworld_stick_pull_v2_1111
Updated
Mar 12, 2023
•
8
qgallouedec/prj_gia_dataset_metaworld_soccer_v2_1111
Updated
Mar 11, 2023
•
6
qgallouedec/prj_gia_dataset_metaworld_shelf_place_v2_1111
Updated
Mar 11, 2023
•
6
qgallouedec/prj_gia_dataset_metaworld_reach_wall_v2_1111
Updated
Mar 11, 2023
•
5
Previous
1
2
3
Next