Story-LLM (TinyStories Transformer)

image

Story-LLM is a small GPT-style Transformer language model trained from scratch on the TinyStories dataset.

This repository contains pretrained weights and a tokenizer.
The model uses a custom PyTorch architecture and ByteLevel BPE tokenizer.

๐Ÿ‘‰ For full documentation, training code, and usage instructions, see the GitHub repository:
https://github.com/Daniel-Gia/Story-LLM


Notes

  • ~29M parameters
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Dataset used to train DanielG9/Story-LLM