Boundary-Guided Policy Optimization for Memory-Efficient RL of Diffusion Large Language Models
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
DeepPrune: Parallel Scaling without Inter-trace Redundancy
SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression
models
75
THU-KEG/LLaDA-8B-BGPO-sudoku
Reinforcement Learning
•
8B
•
Updated
•
4
•
1
THU-KEG/LLaDA-8B-BGPO-countdown
Reinforcement Learning
•
8B
•
Updated
•
2
•
1
THU-KEG/LLaDA-8B-BGPO-code
Reinforcement Learning
•
8B
•
Updated
•
9
•
1
THU-KEG/LLaDA-8B-BGPO-math
Reinforcement Learning
•
8B
•
Updated
•
9
•
1
THU-KEG/DeepPrune-Judge-4B
Text Classification
•
Updated
•
8
•
1
THU-KEG/SIRI-1.5B-low
Text Generation
•
2B
•
Updated
•
4
•
2
THU-KEG/SIRI-1.5B-high
Text Generation
•
2B
•
Updated
•
3
•
3
THU-KEG/SIRI-7B-low
Text Generation
•
8B
•
Updated
•
17
•
2
THU-KEG/SIRI-7B-high
Text Generation
•
8B
•
Updated
•
19
•
4
THU-KEG/LongWriter-Zero-32B
Text Generation
•
33B
•
Updated
•
42
•
•
110
datasets
20
THU-KEG/AgentIF
Viewer
•
Updated
•
707
•
142
•
5
THU-KEG/DeepPrune
Preview
•
Updated
•
18
•
1
THU-KEG/LinguaLens-Data
Viewer
•
Updated
•
7.25k
•
38
•
2
THU-KEG/RM-Bench
Viewer
•
Updated
•
1.33k
•
641
•
7
THU-KEG/LongWriter-Zero-RLData
Viewer
•
Updated
•
8.61k
•
80
•
21
THU-KEG/Arena-Write
Viewer
•
Updated
•
595
•
131
•
4
THU-KEG/LongStory
Viewer
•
Updated
•
5.28k
•
27
•
2
THU-KEG/IF-Verifier-Data
Viewer
•
Updated
•
131k
•
58
•
3
THU-KEG/VerInstruct
Viewer
•
Updated
•
27.5k
•
152
•
6
THU-KEG/MM-Math-Align
Viewer
•
Updated
•
4.02k
•
65
•
1