File size: 356 Bytes
8180432
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
---
license: apache-2.0
base_model: Qwen/Qwen2.5-Coder-3B
tags:
- code
- humaneval
- multi-agent
- mlgrpo
- qwen2.5
library_name: transformers
pipeline_tag: text-generation
---

# 2xQwen2.5-Coder-3B-Satyr-Aux

This model is a fine-tuned version of **Qwen/Qwen2.5-Coder-3B** using Multi-LLM Group Relative Policy Optimization (MAGRPO) on HumanEval dataset.