arcee-train/evolkit-openhermes-100k
arcee-train/shamane-9-12-untrained-merge
Text Generation
•
Updated
•
3
arcee-train/untrained-merged-random-coeffs
Text Generation
•
Updated
•
5
arcee-train/pplist-merged-untrained-with-base-layernorm-embedding
Text Generation
•
Updated
•
3
arcee-train/DAM_dataset_size_256
7B
•
Updated
•
2
arcee-train/DAM_dataset_size_64
7B
•
Updated
•
1
arcee-train/DAM_ablation_sim_L1_L2
7B
•
Updated
•
2
arcee-train/DAM_ablation_KL_sim
7B
•
Updated
•
4
arcee-train/DAM_ablation_KL_L1_L2
7B
•
Updated
•
2
arcee-train/pplist-merged-untrained-linear-only-no-base
Text Generation
•
Updated
•
2
arcee-train/default_settings
arcee-train/pplist-merged-untrained-with-base
Text Generation
•
Updated
•
3
arcee-train/Llama-3.1-6B-Instruct-width-MLP-v0
Text Generation
•
6B
•
Updated
•
2
arcee-train/Abel-7B-002-truncated-embeds
Text Generation
•
7B
•
Updated
•
5
arcee-train/Meta-Llama-3.1-405B-Instruct-bnb-4bit
Text Generation
•
213B
•
Updated
•
2
arcee-train/Meta-Llama-3.1-405B-Instruct-bnb-8bit
Text Generation
•
410B
•
Updated
•
2
arcee-train/MixSmolLM-8x1.7B
Text Generation
•
10B
•
Updated
•
2
arcee-train/spicy-qwen-v0.1
Text Generation
•
Updated
•
2