tanaymehta
AI & ML interests
Large Language Models
Organizations
tanaymehta/gpt2_900K_100eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_800K_100eps
Text Generation
•
0.1B
•
Updated
•
6
tanaymehta/gpt2_700K_100eps
Text Generation
•
0.1B
•
Updated
•
5
tanaymehta/gpt2_600K_100eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_500K_100eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_400K_100eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_300K_100eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_200K_100eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_100K_100eps
Text Generation
•
0.1B
•
Updated
•
5
tanaymehta/gpt2_1M_100eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_1_1M_100eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_1_2M_100eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_1_3M_100eps
Text Generation
•
0.1B
•
Updated
•
2
tanaymehta/gpt2_1_4M_100eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_1_5M_100eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_1_6M_100eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_1_7M_100eps
Text Generation
•
0.1B
•
Updated
•
5
tanaymehta/gpt2_1_8M_100eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_1_9M_100eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_2M_100eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_2H_12L_1M_tok_50_eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_4H_12L_1M_tok_50_eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_6H_12L_1M_tok_50_eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_8H_12L_1M_tok_50_eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_12H_12L_5M_tok_100_eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_12H_12L_10M_tok_100_eps
Text Generation
•
0.1B
•
Updated
•
3
tanaymehta/gpt2_12H_12L_15M_tok_100_eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_12H_2L_75M_tok_20_eps
Text Generation
•
0.1B
•
Updated
•
5
tanaymehta/gpt2_12H_4L_75M_tok_20_eps
Text Generation
•
0.1B
•
Updated
•
4
tanaymehta/gpt2_12H_6L_75M_tok_20_eps
Text Generation
•
0.1B
•
Updated
•
6