3 65 254

Kristoffer Rolf Deinoff

gatepoet

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Group Sequence Policy Optimization

liked a model 12 days ago

kumapo/faster-whisper-large-v3-turbo-f16

upvoted a paper 14 days ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 7 days ago • 245

upvoted a paper 14 days ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published 21 days ago • 44

upvoted a paper 21 days ago

High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning

Paper • 2507.05920 • Published 23 days ago • 11

upvoted 2 papers about 1 month ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 39

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Paper • 2506.09790 • Published Jun 11 • 52

upvoted 2 papers 3 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 81

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22

upvoted a collection 3 months ago

DeepSeek-Prover

Collection

DeepSeek-Prover-Series • 10 items • Updated Apr 30 • 56

upvoted a paper 3 months ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 61

upvoted a paper 4 months ago

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10 • 43

upvoted an article 4 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

•

Mar 26

• 150

upvoted 3 papers 4 months ago

UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

Paper • 2503.21620 • Published Mar 27 • 63

MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving

Paper • 2503.16905 • Published Mar 21 • 55

Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction

Paper • 2503.16194 • Published Mar 20 • 8

upvoted 6 papers 5 months ago

Kristoffer Rolf Deinoff

AI & ML interests

Recent Activity

Organizations

gatepoet's activity

Training and Finetuning Reranker Models with Sentence Transformers v4