RL RAG

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

hamishivi authored a paper 2 days ago

Olmo 3

akariasai authored a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

JingmingZ authored a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

View all activity

hamishivi

authored a paper 2 days ago

Olmo 3

Paper • 2512.13961 • Published 20 days ago • 22

akariasai

authored a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

JingmingZ

authored a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

shannons

authored 2 papers about 1 month ago

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

Paper • 2406.07835 • Published Jun 10, 2024 • 2

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10, 2025 • 15

hamishivi

authored 2 papers about 1 month ago

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published Nov 10, 2025 • 15

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

shannons

authored a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

rulins

updated a dataset 2 months ago

rl-rag/1_sample_toy_rag_survey

Viewer • Updated Oct 24, 2025 • 8 • 4

rulins

published a dataset 2 months ago

rl-rag/1_sample_toy_rag_survey

Viewer • Updated Oct 24, 2025 • 8 • 4

rulins

updated a dataset 2 months ago

rl-rag/1_sample_toy

Viewer • Updated Oct 22, 2025 • 30 • 12

rulins

published a dataset 2 months ago

rl-rag/1_sample_toy

Viewer • Updated Oct 22, 2025 • 30 • 12

rulins

updated a model 3 months ago

rl-rag/rar_cb_bs_16_rollout_811759453746_checkpoints_step_100

333k • Updated Oct 11, 2025 • 5

rulins

published a model 3 months ago

rl-rag/rar_cb_bs_16_rollout_811759453746_checkpoints_step_100

333k • Updated Oct 11, 2025 • 5

rulins

updated a dataset 3 months ago

rl-rag/rl-rag-RaR-Medicine-3k-o3-mini-converted

Viewer • Updated Oct 6, 2025 • 3k • 11

rulins

published a dataset 3 months ago

rl-rag/rl-rag-RaR-Medicine-3k-o3-mini-converted

Viewer • Updated Oct 6, 2025 • 3k • 11

rulins

updated a model 3 months ago

rl-rag/qwen3-8B-sft-mix-v20250921-plus-v20251001-onpolicy-rs-longform_0921

Text Generation • 8B • Updated Oct 6, 2025 • 9

rulins

published a model 3 months ago

rl-rag/qwen3-8B-sft-mix-v20250921-plus-v20251001-onpolicy-rs-longform_0921

Text Generation • 8B • Updated Oct 6, 2025 • 9

akariasai

updated a dataset 3 months ago

rl-rag/dpo_lf_sft0921_rubric_citation

Viewer • Updated Oct 3, 2025 • 1.32k • 9

akariasai

published a dataset 3 months ago

rl-rag/dpo_lf_sft0921_rubric_citation

Viewer • Updated Oct 3, 2025 • 1.32k • 9

AI & ML interests

Recent Activity

Team members 7

rl-rag's activity