Commit History

Upload optimizer/optimizer_pp-0-of-1_dp-14-of-64_tp-0-of-1_exp-0-of-1.pt with huggingface_hub
f94c544
verified

ridger commited on

Upload optimizer/optimizer_pp-0-of-1_dp-13-of-64_tp-0-of-1_exp-0-of-1.pt with huggingface_hub
f98104c
verified

ridger commited on

Upload optimizer/optimizer_pp-0-of-1_dp-12-of-64_tp-0-of-1_exp-0-of-1.pt with huggingface_hub
3c70ee1
verified

ridger commited on

Upload optimizer/optimizer_pp-0-of-1_dp-11-of-64_tp-0-of-1_exp-0-of-1.pt with huggingface_hub
b9918c3
verified

ridger commited on

Upload optimizer/optimizer_pp-0-of-1_dp-10-of-64_tp-0-of-1_exp-0-of-1.pt with huggingface_hub
a60fd73
verified

ridger commited on

Upload optimizer/optimizer_pp-0-of-1_dp-0-of-64_tp-0-of-1_exp-0-of-1.pt with huggingface_hub
034f584
verified

ridger commited on

Upload optimizer/optimizer_pp-0-of-1_dp-1-of-64_tp-0-of-1_exp-0-of-1.pt with huggingface_hub
ef8354b
verified

ridger commited on

Upload model/model/token_position_embeddings/pp_block/token_embedding/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
4adcffa
verified

ridger commited on

Upload optimizer/optimizer_config.json with huggingface_hub
8fc296b
verified

ridger commited on

Upload model/model/final_layer_norm/pp_block/model_weight.safetensors with huggingface_hub
bf7eda1
verified

ridger commited on

Upload model/model/decoder/9/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
a427c76
verified

ridger commited on

Upload model/model/decoder/9/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
2ae9ebc
verified

ridger commited on

Upload model/model/decoder/9/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
86c3160
verified

ridger commited on

Upload model/model/decoder/9/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
0589622
verified

ridger commited on

Upload model/model/decoder/9/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
efeb7c0
verified

ridger commited on

Upload model/model/decoder/9/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
1108e3e
verified

ridger commited on

Upload model/model/decoder/8/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
44e81b3
verified

ridger commited on

Upload model/model/decoder/8/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
7cc2165
verified

ridger commited on

Upload model/model/decoder/8/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
fcba70d
verified

ridger commited on

Upload model/model/decoder/8/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
86e3971
verified

ridger commited on

Upload model/model/decoder/8/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
fc60ae5
verified

ridger commited on

Upload model/model/decoder/8/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
6ddfd43
verified

ridger commited on

Upload model/model/decoder/7/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
2713544
verified

ridger commited on

Upload model/model/decoder/7/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
7e2fb6f
verified

ridger commited on

Upload model/model/decoder/7/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
988b00c
verified

ridger commited on

Upload model/model/decoder/7/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
4a037b6
verified

ridger commited on

Upload model/model/decoder/6/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
a2f136e
verified

ridger commited on

Upload model/model/decoder/6/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
19b7db1
verified

ridger commited on

Upload model/model/decoder/5/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
6ac464c
verified

ridger commited on

Upload model/model/decoder/13/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
1ca8faa
verified

ridger commited on

Upload optimizer/optimizer_pp-0-of-1_dp-16-of-64_tp-0-of-1_exp-0-of-1.pt with huggingface_hub
3ae3cb3
verified

ridger commited on

Upload model/model/decoder/7/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
5fec119
verified

ridger commited on

Upload model/model/decoder/7/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
abbd2e4
verified

ridger commited on

Upload model/model/decoder/6/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
bb2590f
verified

ridger commited on

Upload model/model/decoder/6/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
35a8d19
verified

ridger commited on

Upload model/model/decoder/4/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
0a1029a
verified

ridger commited on

Upload model/model/decoder/5/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
8cbe845
verified

ridger commited on

Upload model/model/decoder/6/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
fba769d
verified

ridger commited on

Upload model/model/decoder/6/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
277d4d5
verified

ridger commited on

Upload model/model/decoder/3/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
083e4c8
verified

ridger commited on

Upload model/model/decoder/5/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
c98bfba
verified

ridger commited on

Upload model/model/decoder/5/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
6f8b34f
verified

ridger commited on

Upload model/model/decoder/4/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
3b73276
verified

ridger commited on

Upload model/model/decoder/5/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
7a16cb4
verified

ridger commited on

Upload model/model/decoder/5/pp_block/attn/o_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
5bd4845
verified

ridger commited on

Upload model/model/decoder/23/pp_block/mlp/gate_up_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
5a38069
verified

ridger commited on

Upload model/model/decoder/4/pp_block/post_attention_layernorm/model_weight.safetensors with huggingface_hub
b270702
verified

ridger commited on

Upload model/model/decoder/4/pp_block/attn/qkv_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
c00ec75
verified

ridger commited on

Upload model/model/decoder/3/pp_block/mlp/down_proj/model_weight_pp-rank-0-of-1_tp-rank-0-of-1.safetensors with huggingface_hub
090dd88
verified

ridger commited on

Upload model/model/decoder/4/pp_block/input_layernorm/model_weight.safetensors with huggingface_hub
2d7807c
verified

ridger commited on