ethicalabs
/

xLSTM-7b-Polymath-PEFT

Text Generation

🇪🇺 Region: EU

Model card Files Files and versions

mrs83 commited on 14 days ago

Commit

4f5371c

·

verified ·

1 Parent(s): 84c3b20

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -13,6 +13,9 @@ datasets:
 - teknium/OpenHermes-2.5
 - meta-math/MetaMathQA
 - trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
 ---
 # Model Card for xlstm-7b-instruct-phase-2
@@ -21,7 +24,7 @@ This model is a fine-tuned version of [ethicalabs/xLSTM-7b-Instruct](https://hug
 It has been trained using [TRL](https://github.com/huggingface/trl) using SFT on assistant-only tokens.
-The k_proj and v_proj matrices have been frozen to isolate and preserve the model's pre-trained knowledge base.
 This fine-tuning focused only on the `q_proj` (query) and FFN matrices, adapting the model's reasoning and query-retrieval mechanisms without overwriting its core, frozen knowledge.

 - teknium/OpenHermes-2.5
 - meta-math/MetaMathQA
 - trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
+license: mit
+language:
+- en
 ---
 # Model Card for xlstm-7b-instruct-phase-2
 It has been trained using [TRL](https://github.com/huggingface/trl) using SFT on assistant-only tokens.
+The `k_proj` and `v_proj` matrices have been frozen to isolate and preserve the model's pre-trained knowledge base.
 This fine-tuning focused only on the `q_proj` (query) and FFN matrices, adapting the model's reasoning and query-retrieval mechanisms without overwriting its core, frozen knowledge.