suayptalha commited on
Commit
efb8314
·
verified ·
1 Parent(s): f4d7f05

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -1
README.md CHANGED
@@ -10,4 +10,43 @@ base_model:
10
  - Qwen/Qwen3-0.6B
11
  pipeline_tag: text-generation
12
  library_name: transformers
13
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  - Qwen/Qwen3-0.6B
11
  pipeline_tag: text-generation
12
  library_name: transformers
13
+ ---
14
+
15
+ # Qwen3-0.6B-PsychSupport-Expert
16
+
17
+ This project performs full fine-tuning on the **Qwen3-0.6B** language model to enhance its psychological support reasoning and empathetic response capabilities. The model was optimized using the bfloat16 (bf16) data type.
18
+
19
+ ## Training Procedure
20
+
21
+ 1. **Dataset Preparation**
22
+
23
+ * Dataset: Containing paired patient emotional context descriptions and step-by-step empathetic support responses.
24
+
25
+ 2. **Model Loading and Configuration**
26
+
27
+ * Base model: **Qwen3-0.6B**, loaded with the `unsloth` library in bf16 precision.
28
+ * Full fine-tuning (`full_finetuning=True`) applied to all layers to adapt the model for psychological support tasks.
29
+
30
+ 3. **Supervised Fine-Tuning (SFT)**
31
+
32
+ * Utilized the Hugging Face TRL library with the Supervised Fine-Tuning approach.
33
+ * The model was trained to generate both intermediate empathetic reasoning steps and final supportive messages.
34
+ * Training hyperparameters:
35
+
36
+ * Epochs: 2
37
+ * Learning rate: 2e-5
38
+ * Batch size: 8
39
+
40
+ ## Purpose and Outcome
41
+
42
+ * Enhanced the model’s ability to provide empathetic, context-aware psychological support to users.
43
+
44
+ ## Evaluation
45
+
46
+ * Performance was measured on a held-out validation set with the following metric:
47
+
48
+ * **Support Coherence:** Rated 74.32% similarity to expert-generated responses.
49
+
50
+ ## License
51
+
52
+ This project is licensed under the Apache License 2.0. See the [LICENSE](./LICENSE) file for details.