Running on CPU Upgrade 1.19k 1.19k The Smol Training Playbook: The Secrets to Building World-Class LLMs ๐
view reply You don't really have to clone the repo. The FastAPI code is just there for demonstration, and you can code the way you like. The main takeaway is the Dockerfile.
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 โข 253
view article Article Everything You Need to Know about Knowledge Distillation By Kseniase and 1 other โข Mar 6 โข 50
Komodo: A Linguistic Expedition into Indonesia's Regional Languages Paper โข 2403.09362 โข Published Mar 14, 2024 โข 11
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper โข 2506.16035 โข Published Jun 19 โข 88
view article Article You could have designed state of the art positional encoding Nov 25, 2024 โข 387
view post Post 1704 Micrograd in pure C๐คPort of Karpathy's micrograd in pure C. Yo C does not negotiate with memory ๐Code: https://github.com/Jaykef/micrograd.c 2 replies ยท ๐ฅ 8 8 ๐ 3 3 + Reply
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 โข 15 items โข Updated Dec 6, 2024 โข 640