Boyi Wei's picture

2 6 7

Boyi Wei

boyiwei

·

https://boyiwei.com/

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago

boyiwei/llama3.1-instruct-prune-test

published a model 1 day ago

boyiwei/llama3.1-instruct-prune-test

updated a dataset about 2 months ago

agent-evals/hal_traces

View all activity

Organizations

upvoted a paper 2 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 221

upvoted a paper 3 months ago

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

Paper • 2509.26354 • Published Sep 30, 2025 • 17

upvoted a paper 7 months ago

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Paper • 2412.07097 • Published Dec 10, 2024 • 1

upvoted a paper 8 months ago

Dynamic Risk Assessments for Offensive Cybersecurity Agents

Paper • 2505.18384 • Published May 23, 2025 • 8

upvoted 2 papers over 1 year ago

Evaluating Copyright Takedown Methods for Language Models

Paper • 2406.18664 • Published Jun 26, 2024 • 1

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

Paper • 2402.05162 • Published Feb 7, 2024 • 1