merged

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Passthrough merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

slices:
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [0,14]
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [14,15]
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [14,15]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [14,15]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [15,16]
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [15,16]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [15,16]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [16,17]
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [16,17]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [16,17]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [17,18]
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [17,18]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [17,18]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [18,19]
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [18,19]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [18,19]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [19,20]
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [19,20]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [19,20]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [20,21]
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [20,21]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [20,21]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [21,22]
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [21,22]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [21,22]
      parameters:
        scale:
        - filter: o_proj
          value: 0.0
        - filter: down_proj
          value: 0.0
        - value: 1.0
  - sources:
    - model: unsloth/gemma-2-2b-it
      layer_range: [22,26]
merge_method: passthrough
Downloads last month
2
Safetensors
Model size
3.86B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Columbidae/gemma-2-4b-it-upscaled

Finetuned
(122)
this model