| license: apache-2.0 | |
| base_model: Qwen/Qwen2.5-Coder-3B | |
| tags: | |
| - code | |
| - humaneval | |
| - multi-agent | |
| - mlgrpo | |
| - qwen2.5 | |
| library_name: transformers | |
| pipeline_tag: text-generation | |
| # 2xQwen2.5-Coder-3B-Pheonix-Aux | |
| This model is a fine-tuned version of **Qwen/Qwen2.5-Coder-3B** using Multi-LLM Group Relative Policy Optimization (MAGRPO) on HumanEval dataset. | |