WeihaoZeng's picture

WeihaoZeng

AndrewZeng

·

https://github.com/Zeng-WH

AI & ML interests

None yet

Recent Activity

updated a dataset 8 days ago

AndrewZeng/tool_trajectory

published a dataset 8 days ago

AndrewZeng/tool_trajectory

upvoted a paper 26 days ago

SWE-RM: Execution-free Feedback For Software Engineering Agents

View all activity

Organizations

updated a dataset 8 days ago

AndrewZeng/tool_trajectory

Preview • Updated 8 days ago • 18

published a dataset 8 days ago

AndrewZeng/tool_trajectory

Preview • Updated 8 days ago • 18

upvoted a paper 26 days ago

SWE-RM: Execution-free Feedback For Software Engineering Agents

Paper • 2512.21919 • Published 30 days ago • 10

upvoted 2 papers 3 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 121

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 72

authored 6 papers 3 months ago

MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning

Paper • 2412.08946 • Published Dec 12, 2024

AgentRefine: Enhancing Agent Generalization through Refinement Tuning

Paper • 2501.01702 • Published Jan 3, 2025

On the Perception Bottleneck of VLMs for Chart Understanding

Paper • 2503.18435 • Published Mar 24, 2025 • 1

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Paper • 2505.22203 • Published May 28, 2025 • 6

CareBot: A Pioneering Full-Process Open-Source Medical Language Model

Paper • 2412.15236 • Published Dec 12, 2024 • 1

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

upvoted 3 papers 3 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 97

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 106

upvoted 2 papers 4 months ago

Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards

Paper • 2509.24981 • Published Sep 29, 2025 • 29

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22, 2025 • 103

upvoted 3 papers 5 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 79

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12, 2025 • 31

upvoted a paper 6 months ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31, 2025 • 115