These models were trained to embed a sleeper agent that produces a malicious/false response
Slava Marcin
slavamarcin
·
AI & ML interests
LLM, CV, CNN
Recent Activity
updated
a model
2 days ago
slavamarcin/HG_Gemma-3-4B-ATLAS_BELARUS
published
a model
2 days ago
slavamarcin/HG_Gemma-3-4B-ATLAS_BELARUS
updated
a collection
5 days ago
ATLAS
Organizations
None yet