Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Bartosz Cywiński's picture
2 7 24

Bartosz Cywiński

bcywinski
·
https://cywinski.github.io/
  • bartoszcyw
  • cywinski

AI & ML interests

Mechanistic Interpretability

Organizations

None yet

upvoted a paper 2 months ago

Towards eliciting latent knowledge from LLMs with mechanistic interpretability

Paper • 2505.14352 • Published May 20 • 9
upvoted an article 3 months ago
view article
Article

Vision Language Models (Better, Faster, Stronger)

By merve and 4 others •
May 12
• 491
upvoted a paper 5 months ago

Precise Parameter Localization for Textual Generation in Diffusion Models

Paper • 2502.09935 • Published Feb 14 • 12
upvoted 2 papers 6 months ago

No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces

Paper • 2502.04959 • Published Feb 7 • 11

SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders

Paper • 2501.18052 • Published Jan 29 • 8
upvoted a paper 9 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 84
upvoted a collection 9 months ago

🔍 Interpretability & Analysis of LMs

Collection
Outstanding research in LM interpretability and evaluation, summarized • 123 items • Updated 6 days ago • 110
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs