3 9 25

Bartosz Cywiński

bcywinski

https://cywinski.github.io/

AI & ML interests

Mechanistic Interpretability

Recent Activity

updated a model 4 days ago

bcywinski/codi_llama1b-answer-only_2_latent

published a model 4 days ago

bcywinski/codi_llama1b-answer-only_2_latent

updated a model 4 days ago

bcywinski/codi_llama1b-answer-only_1_latent

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

Open Character Training

Collection

https://arxiv.org/abs/2511.01689 • 8 items • Updated Nov 4 • 4

upvoted a collection 4 months ago

Dream 7B

Collection

https://hkunlp.github.io/blog/2025/dream/ • 2 items • Updated Jul 16 • 6

upvoted a paper 7 months ago

Towards eliciting latent knowledge from LLMs with mechanistic interpretability

Paper • 2505.14352 • Published May 20 • 9

upvoted an article 7 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12

•

568

upvoted 2 papers 10 months ago

Precise Parameter Localization for Textual Generation in Diffusion Models

Paper • 2502.09935 • Published Feb 14 • 12

No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces

Paper • 2502.04959 • Published Feb 7 • 11

upvoted a paper 11 months ago

SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders

Paper • 2501.18052 • Published Jan 29 • 8

upvoted a paper about 1 year ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 84

upvoted a collection about 1 year ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 134 items • Updated Oct 20 • 116

Bartosz Cywiński

AI & ML interests

Recent Activity

Organizations

bcywinski's activity

Vision Language Models (Better, faster, stronger)