Improve language tag (#1)
Browse files- Improve language tag (6797c83afdd7b372f9e41b936f51cc12ab328d71)
Co-authored-by: LoΓ―ck BOURDOIS <[email protected]>
    	
        README.md
    CHANGED
    
    | @@ -1,54 +1,66 @@ | |
| 1 | 
            -
            ---
         | 
| 2 | 
            -
            license: mit
         | 
| 3 | 
            -
            language:
         | 
| 4 | 
            -
            -  | 
| 5 | 
            -
             | 
| 6 | 
            -
            -  | 
| 7 | 
            -
             | 
| 8 | 
            -
            -  | 
| 9 | 
            -
             | 
| 10 | 
            -
             | 
| 11 | 
            -
            -  | 
| 12 | 
            -
             | 
| 13 | 
            -
             | 
| 14 | 
            -
             | 
| 15 | 
            -
             | 
| 16 | 
            -
             | 
| 17 | 
            -
             | 
| 18 | 
            -
             | 
| 19 | 
            -
             | 
| 20 | 
            -
             | 
| 21 | 
            -
             | 
| 22 | 
            -
             | 
| 23 | 
            -
             | 
| 24 | 
            -
             | 
| 25 | 
            -
             | 
| 26 | 
            -
             | 
| 27 | 
            -
             | 
| 28 | 
            -
             | 
| 29 | 
            -
             | 
| 30 | 
            -
             | 
| 31 | 
            -
             | 
| 32 | 
            -
             | 
| 33 | 
            -
             | 
| 34 | 
            -
             | 
| 35 | 
            -
             | 
| 36 | 
            -
            -  | 
| 37 | 
            -
             | 
| 38 | 
            -
             | 
| 39 | 
            -
             | 
| 40 | 
            -
             | 
| 41 | 
            -
             | 
| 42 | 
            -
             | 
| 43 | 
            -
             | 
| 44 | 
            -
             | 
| 45 | 
            -
             | 
| 46 | 
            -
             | 
| 47 | 
            -
             | 
| 48 | 
            -
             | 
| 49 | 
            -
             | 
| 50 | 
            -
             | 
| 51 | 
            -
             | 
| 52 | 
            -
             | 
| 53 | 
            -
             | 
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
| 54 | 
             
            </div>
         | 
|  | |
| 1 | 
            +
            ---
         | 
| 2 | 
            +
            license: mit
         | 
| 3 | 
            +
            language:
         | 
| 4 | 
            +
            - zho
         | 
| 5 | 
            +
            - eng
         | 
| 6 | 
            +
            - fra
         | 
| 7 | 
            +
            - spa
         | 
| 8 | 
            +
            - por
         | 
| 9 | 
            +
            - deu
         | 
| 10 | 
            +
            - ita
         | 
| 11 | 
            +
            - rus
         | 
| 12 | 
            +
            - jpn
         | 
| 13 | 
            +
            - kor
         | 
| 14 | 
            +
            - vie
         | 
| 15 | 
            +
            - tha
         | 
| 16 | 
            +
            - ara
         | 
| 17 | 
            +
            datasets:
         | 
| 18 | 
            +
            - 5CD-AI/Vietnamese-cosmos-qa-gg-translated
         | 
| 19 | 
            +
            base_model:
         | 
| 20 | 
            +
            - Qwen/Qwen2.5-0.5B
         | 
| 21 | 
            +
            library_name: transformers
         | 
| 22 | 
            +
            tags:
         | 
| 23 | 
            +
            - text-generation-inference
         | 
| 24 | 
            +
            ---
         | 
| 25 | 
            +
             | 
| 26 | 
            +
            <div align="center">
         | 
| 27 | 
            +
            <img src="https://github.com/bloomifycafe/blossomsAI/blob/main/assets/logo.png?raw=true" alt="Logo"/>
         | 
| 28 | 
            +
            </div>
         | 
| 29 | 
            +
            </br>
         | 
| 30 | 
            +
            <div align="center">
         | 
| 31 | 
            +
             | 
| 32 | 
            +
            # π BloomVN-0.5B-ppo
         | 
| 33 | 
            +
             | 
| 34 | 
            +
            </div>
         | 
| 35 | 
            +
             | 
| 36 | 
            +
            ### A fine-tuned multilingual model for Vietnamese language
         | 
| 37 | 
            +
             | 
| 38 | 
            +
            ## π Overview
         | 
| 39 | 
            +
             | 
| 40 | 
            +
            This model serves as a small-scale experiment (0.5B parameters) testing the Reinforcement Learning capabilities of veRL framework. The implementation uses PPO (Proximal Policy Optimization) method on a limited training dataset to evaluate veRL's performance and training behavior.
         | 
| 41 | 
            +
             | 
| 42 | 
            +
            ## π§ Method
         | 
| 43 | 
            +
             | 
| 44 | 
            +
            The experimentation process was conducted using [veRL](https://github.com/volcengine/verl), focusing on:
         | 
| 45 | 
            +
            - Implementation of PPO algorithm with a 0.5B parameter model
         | 
| 46 | 
            +
            - Running training experiments on a small dataset
         | 
| 47 | 
            +
            - Testing veRL's framework capabilities in handling RL tasks
         | 
| 48 | 
            +
            - Evaluating training efficiency and model behavior
         | 
| 49 | 
            +
             | 
| 50 | 
            +
            This lightweight approach allowed us to assess veRL's performance in a controlled, small-scale environment.
         | 
| 51 | 
            +
             | 
| 52 | 
            +
            ## π VLMU Benchmark
         | 
| 53 | 
            +
             | 
| 54 | 
            +
            | EVALUATION DATE | STEM π¬ | SOCIAL SCIENCE π | HUMANITIES π | OTHERS π― | AVG β |
         | 
| 55 | 
            +
            |----------------|--------|------------------|---------------|-----------|--------|
         | 
| 56 | 
            +
            | 07/02/2025 | 23.18 | 32.84 | 32.71 | 33.67 | 29.43 |
         | 
| 57 | 
            +
             | 
| 58 | 
            +
             | 
| 59 | 
            +
            ## π€ Contributors
         | 
| 60 | 
            +
             | 
| 61 | 
            +
            Developed with β€οΈ by [BlossomAI](https://github.com/BlossomAI)
         | 
| 62 | 
            +
             | 
| 63 | 
            +
            ---
         | 
| 64 | 
            +
            <div align="center">
         | 
| 65 | 
            +
            <sub>Star βοΈ this repo if you find it valuable!</sub>
         | 
| 66 | 
             
            </div>
         | 

 
		