SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders
The repository contains Sparse Autoencoders trained in our work for blocks up.1.1
on COCO dataset.
After cloning our GitHub repo, you can use them as follows:
from SAE.sae import Sae
device = "cuda"
hookpoint = "unet.up_blocks.1.attentions.1"
sae = Sae.load_from_hub("bcywinski/SAeUron_coco", hookpoint=hookpoint, device=device)
π Bibtex
@article{cywinski2025saeuron,
title={SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders},
author={Cywi{\'n}ski, Bartosz and Deja, Kamil},
journal={arXiv preprint arXiv:2501.18052},
year={2025}
}
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support