Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
25
9
Jie Liu
PRO
jieliu
Follow
dododododo's profile picture
DecoderWQH666's profile picture
UnderController's profile picture
29 followers
·
20 following
yifan123
AI & ML interests
Reinforcement Learning, Large Language Model
Recent Activity
upvoted
a
paper
13 days ago
UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation
upvoted
a
paper
19 days ago
VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning
upvoted
a
paper
22 days ago
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
View all activity
Organizations
jieliu
's models
13
Sort: Recently updated
jieliu/SD3.5M-FlowGRPO-Text-without-KL
Updated
Jul 22
•
2
jieliu/SD3.5M-FlowGRPO-PickScore-without-KL
Updated
Jul 22
•
2
jieliu/SD3.5M-FlowGRPO-GenEval-without-KL
Updated
Jul 22
•
2
jieliu/SD3.5M-FlowGRPO-GenEval
Updated
May 12
•
371
•
9
jieliu/SD3.5M-FlowGRPO-PickScore
Updated
May 11
•
737
•
2
jieliu/SD3.5M-FlowGRPO-Text
Updated
May 11
•
82
•
2
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-chat-noval-beta0.5-bs24
Updated
Sep 7, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-chat-math-noval-beta0.5-bs24
Updated
Sep 7, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5-bs24-seq2048
Updated
Sep 5, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5-bs24
Updated
Sep 5, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5
Updated
Sep 3, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-beta0.5
Updated
Jul 30, 2024
jieliu/Storm-7B
Text Generation
•
7B
•
Updated
Jun 18, 2024
•
4
•
41