Knowledge Engineer Group @ Tsinghua University

university

https://keg.cs.tsinghua.edu.cn/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

bys0318 authored a paper 7 days ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

bys0318 submitted a paper 7 days ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

NeoZ123 updated a collection 8 days ago

View all activity

Papers

WildReward: Learning Reward Models from In-the-Wild Human Interactions

DeepPrune: Parallel Scaling without Inter-trace Redundancy

View all Papers

Collections 11

View 11 collections

models 81

THU-KEG/DeepDive-30B-A3B-C-GRPO

31B • Updated 8 days ago • 5

THU-KEG/DeepDive-4B-C-GRPO

4B • Updated 8 days ago • 15

THU-KEG/DeepDive-30B-A3B-SFT

31B • Updated 8 days ago • 3

THU-KEG/DeepDive-4B-SFT

4B • Updated 8 days ago • 48

THU-KEG/WildReward-8B

Text Classification • 8B • Updated 22 days ago • 13 • 3

THU-KEG/WildReward-4B

Text Classification • 4B • Updated 22 days ago • 19 • 4

THU-KEG/LLaDA-8B-BGPO-sudoku

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 2 • 1

THU-KEG/LLaDA-8B-BGPO-countdown

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 131 • 1

THU-KEG/LLaDA-8B-BGPO-code

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 9 • 1

THU-KEG/LLaDA-8B-BGPO-math

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 1

datasets 22

THU-KEG/WildFB

Updated 22 days ago • 36 • 2

THU-KEG/CaRR-DeepDive

Preview • Updated Jan 11 • 49 • 1

THU-KEG/AgentIF

Viewer • Updated Oct 24, 2025 • 707 • 156 • 7

THU-KEG/DeepPrune

Preview • Updated Oct 10, 2025 • 8 • 2

THU-KEG/LinguaLens-Data

Viewer • Updated Sep 9, 2025 • 7.25k • 5 • 2

THU-KEG/RM-Bench

Viewer • Updated Jul 12, 2025 • 1.33k • 1.72k • 9

THU-KEG/LongWriter-Zero-RLData

Viewer • Updated Jul 10, 2025 • 8.61k • 34 • 21

THU-KEG/Arena-Write

Viewer • Updated Jun 30, 2025 • 595 • 18 • 5

THU-KEG/LongStory

Viewer • Updated Jun 18, 2025 • 5.28k • 15 • 3

THU-KEG/IF-Verifier-Data

Viewer • Updated Jun 12, 2025 • 131k • 62 • 4

View 22 datasets