gigant
·
AI & ML interests
multimodal
Organizations
0.2B
•
Updated
•
5
gigant/SmolLM-mc4-500-rawrope
Text Generation
•
0.1B
•
Updated
•
2
gigant/SmolLM-mc4-500-ropescaled
Text Generation
•
0.1B
•
Updated
•
2
gigant/SmolLM-500-rawrope
Text Generation
•
0.1B
•
Updated
•
4
gigant/SmolLM-500-ropescaled
Text Generation
•
0.1B
•
Updated
•
3
gigant/SmolLM-135M-ft-500-steps
Text Generation
•
0.1B
•
Updated
•
3
gigant/SmolLM-135M-rescaled-ft-500-steps
Text Generation
•
0.1B
•
Updated
•
4
gigant/SmolLM-135M-unjetlagged-200-steps
Updated
gigant/SmolLM-135M-full-unjetlagged-2000-steps
Updated
gigant/SmolLM-135M-full-jetlagged-200-steps
Text Generation
•
0.1B
•
Updated
•
2
gigant/SmolLM-135M-full-unjetlagged-200-steps
Text Generation
•
0.1B
•
Updated
•
2
gigant/SmolLM-135M-jetlagged-200-steps
Updated
gigant/SmolLM-135M-unjetlagged
Updated
gigant/SmolLM-135M-scaled-rope-sw
Text Generation
•
0.1B
•
Updated
•
2
77M
•
Updated
•
3
gigant/graphlongt5-structural-dependency-0408
gigant/graphlongt5-structural-0324
gigant/graphlongt5-dependency-0322
gigant/graphlongt5-globallocal-0322
gigant/graphlongt5-structural-0320
gigant/graphlongt5-dependency-0308
Updated
gigant/graphlongt5-globallocal-0308
gigant/graphlongt5-globallocal-0228
gigant/graphlongt5-dependency-0228
gigant/longt5-global-3epoch
gigant/graph-t5-global-window-8k-longt5local
gigant/graph-t5-global-window-8k-tib