Model Card for Model ID

This is a GPT-2 model trained in llm.c, for 32K steps (of 1M batch size) on FineWeb-EDU.

A lot more detailed information is here: https://github.com/karpathy/llm.c/discussions/677

Bias, Risks, and Limitations

Eagerly generates disinformation about English-speaking unicorns in the Andes mountains.

Downloads last month
4
Safetensors
Model size
2B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for karpathy/gpt2_1558M_final2_hf

Quantizations
2 models

Space using karpathy/gpt2_1558M_final2_hf 1