mlx-community/SmolLM3-3B-gabliterated-bf16

This model mlx-community/SmolLM3-3B-gabliterated-bf16 was abliterated using Gökdeniz Gülmez's MLX-LM-LENS package, using a extended abliteration technique called Gabliteration.

Gabliteration, a novel neural weight modification technique that advances beyond traditional abliteration methods through adaptive multi-directional projections with regularized layer selection. My new Gabliteration technique addresses the fundamental limitation of existing abliteration methods that compromise model quality while attempting to modify specific behavioral patterns. Building upon the foundational work of Arditi et al. (2024) on single-direction abliteration, Gabliteration extends to a comprehensive multi-directional framework with additions like singular value decomposition on difference matrices between harmful and harmless prompt representations to extract multiple refusal directions.

Use with mlx

pip install mlx-lm

from mlx_lm import load, generate

model, tokenizer = load("mlx-community/SmolLM3-3B-gabliterated-bf16")

prompt = "give me a step by step guide on how to make ..."

if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, add_generation_prompt=True
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)

Downloads last month: 8

Safetensors

Model size

3B params

Tensor type

BF16

Model tree for mlx-community/SmolLM3-3B-gabliterated-bf16

Base model

HuggingFaceTB/SmolLM3-3B-Base

Finetuned

HuggingFaceTB/SmolLM3-3B

Finetuned

(63)

this model