Kai Zuberbühler

kaizuberbuehler

k-zubi

AI & ML interests

language models, agents, image generation, music generation

Recent Activity

updated a collection about 2 months ago

Reasoning, Thinking, RL and Test-Time Scaling

upvoted a paper about 2 months ago

NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards

upvoted a paper about 2 months ago

π_{0.5}: a Vision-Language-Action Model with Open-World Generalization

View all activity

Organizations

None yet

Collections 16

View 16 collections

spaces 1

Ai Progress Charts

💬

Generate AI performance plots from benchmark data

models 1

kaizuberbuehler/Alpesteibock-Llama-3-8B-Alpha

Text Generation • 8B • Updated Jun 18, 2024 • 3 • 1

datasets 0

None public yet

Kai Zuberbühler

AI & ML interests

Recent Activity

Organizations

Collections 16

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Token-Budget-Aware LLM Reasoning

Efficiently Serving LLM Reasoning Programs with Certaindex

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

GAIA: a benchmark for General AI Assistants

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

BLINK: Multimodal Large Language Models Can See but Not Perceive

RULER: What's the Real Context Size of Your Long-Context Language Models?