3 10 6

Anthony Peng

AnthonyPeng

https://shengyun-peng.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

upvoted a paper 2 months ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

upvoted a paper 2 months ago

Agent Learning via Early Experience

View all activity

Organizations

upvoted a paper about 1 month ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3 • 31

upvoted 4 papers 2 months ago

commented 2 papers 2 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 58 •

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 58 •

authored 8 papers 2 months ago

Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion

Paper • 2305.03509 • Published May 4, 2023 • 1

RobArch: Designing Robust Architectures against Adversarial Attacks

Paper • 2301.03110 • Published Jan 8, 2023 • 1

CompCap: Improving Multimodal Large Language Models with Composite Captions

Paper • 2412.05243 • Published Dec 6, 2024 • 20

LLM Self Defense: By Self Examination, LLMs Know They Are Being Tricked

Paper • 2308.07308 • Published Aug 14, 2023

Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models

Paper • 2405.17374 • Published May 27, 2024 • 1

Robust Principles: Architectural Design Principles for Adversarially Robust CNNs

Paper • 2308.16258 • Published Aug 30, 2023

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 58

Shape it Up! Restoring LLM Safety during Finetuning

Paper • 2505.17196 • Published May 22 • 1

upvoted 3 papers 2 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 58

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 172

RobArch: Designing Robust Architectures against Adversarial Attacks

Paper • 2301.03110 • Published Jan 8, 2023 • 1

New activity in O1-OPEN/OpenO1-SFT 9 months ago

Error analysis

#2 opened about 1 year ago by

Winmodel

upvoted a paper about 1 year ago

CompCap: Improving Multimodal Large Language Models with Composite Captions

Paper • 2412.05243 • Published Dec 6, 2024 • 20

Anthony Peng

AI & ML interests

Recent Activity

Organizations

AnthonyPeng's activity

Error analysis