OctoThinker

community

https://github.com/GAIR-NLP/OctoThinker

GAIR-NLP

Activity Feed

AI & ML interests

None defined yet.

Organization Card

Community About org cards

🐙 OctoThinker is led by GAIR

🎯 Our Goal: To reshape the pre-training trajectory so models scale better under RL.

Check our technical report for more details!

Collections 4

View 4 collections

models 26

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq8k_20B

Text Generation • Updated Jul 7

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_91_bs4M_seq8k_20B

Text Generation • Updated Jul 7

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_general_ins_89_10_1_bs4M_seq8k_20B

Text Generation • Updated Jul 7

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_91_bs4M_seq8k_20B

Text Generation • Updated Jul 7

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B

Text Generation • Updated Jul 7

View 26 models

datasets 1

OctoThinker/MegaMath-Web-Pro-Max

Viewer • Updated Jul 6 • 69.2M • 23.9k • 36

OctoThinker

AI & ML interests

Collections 4

OctoThinker/Llama_32_3B_finemath_4p_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_31_bs4M_seq8k_20B

OctoThinker/OctoThinker-8B-Long-Base

OctoThinker/OctoThinker-8B-Hybrid-Base

OctoThinker/OctoThinker-8B-Short-Base

OctoThinker/Llama_32_3B_finemath_4p_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_31_bs4M_seq8k_20B

OctoThinker/OctoThinker-8B-Long-Base

OctoThinker/OctoThinker-8B-Hybrid-Base

OctoThinker/OctoThinker-8B-Short-Base

models 26

OctoThinker/OctoThinker-3B-Hybrid-Zero

OctoThinker/OctoThinker-3B-Hybrid-Base

OctoThinker/OctoThinker-3B-Short-Zero

OctoThinker/OctoThinker-3B-Short-Base

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_100B

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_91_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_general_ins_89_10_1_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_91_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B

datasets 1

OctoThinker/MegaMath-Web-Pro-Max

AI & ML interests

Team members 4

Collections 4

models 26 Sort: Recently updated

datasets 1

models 26