arXiv:2509.18058
Maksym Andriushchenko
MaksymAndriushchenko
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
30 days ago
Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
upvoted
a
paper
about 2 months ago
Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM
authored
a paper
about 2 months ago
Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM