Students distilled from a 7B Reinforcement-Learned Teacher (RLT) from the paper "Reinforcement Learning Teachers of Test Time Scaling."
Sakana AI
company
Verified
AI & ML interests
We are a Tokyo-based R&D company on a quest to create a new kind of foundational AI model based on nature-inspired intelligence.
Recent Activity
View all activity
Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
-
SakanaAI/TinySwallow-1.5B
Text Generation • 2B • Updated • 37.9k • • 31 -
SakanaAI/TinySwallow-1.5B-Instruct
Text Generation • 2B • Updated • 2.65k • • 53 -
SakanaAI/TinySwallow-1.5B-Instruct-q4f32_1-MLC
Text Generation • Updated • 3 -
SakanaAI/TinySwallow-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 814 • 23
Students distilled from a 7B Reinforcement-Learned Teacher (RLT) from the paper "Reinforcement Learning Teachers of Test Time Scaling."
Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
-
SakanaAI/TinySwallow-1.5B
Text Generation • 2B • Updated • 37.9k • • 31 -
SakanaAI/TinySwallow-1.5B-Instruct
Text Generation • 2B • Updated • 2.65k • • 53 -
SakanaAI/TinySwallow-1.5B-Instruct-q4f32_1-MLC
Text Generation • Updated • 3 -
SakanaAI/TinySwallow-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 814 • 23
spaces
6
Running
on
Zero
18
Llama 3 Karamaru V1
🐻
Classical Japanese Chatbot
Running
on
Zero
43
Evo-Ukiyoe
🐠
Generate Japanese ukiyo-e style images from prompts
Running
on
Zero
18
Llama-3-EvoVLM-JP-v2
🐠
Chat with images and text using a Japanese visual language model
Runtime error
17
Evo-Nishikie
🐠
Generate painted Japanese woodblock prints from line art and prompts
Runtime error
53
EvoSDXL JP
🐠
Generate Japanese-themed images from text prompts
Running
on
Zero
64
EvoVLM JP
🐠
Ask questions about images to get descriptions and answers
models
24

SakanaAI/RLT-7B
Text Generation
•
8B
•
Updated
•
371
•
•
13

SakanaAI/RLT-32B
Text Generation
•
33B
•
Updated
•
25
•
3

SakanaAI/Metom
0.0B
•
Updated
•
3k
•
4

SakanaAI/text-to-lora
Updated
•
8

SakanaAI/Llama-3-Karamaru-v1
Text Generation
•
8B
•
Updated
•
1.37k
•
•
16

SakanaAI/TAID-VLM-2B
Visual Question Answering
•
2B
•
Updated
•
23
•
5

SakanaAI/TAID-LLM-1.5B
Text Generation
•
2B
•
Updated
•
5
•
6

SakanaAI/TinySwallow-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
814
•
23

SakanaAI/TinySwallow-1.5B-Instruct-q4f32_1-MLC
Text Generation
•
Updated
•
3

SakanaAI/TinySwallow-1.5B-Instruct
Text Generation
•
2B
•
Updated
•
2.65k
•
•
53
datasets
11
SakanaAI/TransEvalnia
Viewer
•
Updated
•
3.84k
•
205
•
3
SakanaAI/ALE-Bench
Updated
•
1.96k
•
9
SakanaAI/EDINET-Bench
Viewer
•
Updated
•
2.59k
•
639
•
7
SakanaAI/Sudoku-Bench
Viewer
•
Updated
•
2.77k
•
706
•
6
SakanaAI/Sudoku-CTC-Reasoning
Viewer
•
Updated
•
1.81k
•
53
•
4
SakanaAI/AI-CUDA-Engineer-Archive
Viewer
•
Updated
•
30.6k
•
1.93k
•
155
SakanaAI/ChouBun
Viewer
•
Updated
•
770
•
53
•
9
SakanaAI/JA-Multi-Image-VQA
Viewer
•
Updated
•
55
•
130
•
10
SakanaAI/JA-VG-VQA-500
Viewer
•
Updated
•
1.5k
•
359
•
15
SakanaAI/gsm8k-ja-test_250-1319
Viewer
•
Updated
•
1.07k
•
186
•
4