arxiv:2411.07618
hanqi yan
hanqiyan
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 months ago
When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced
Misalignment
Organizations
None yet