7 39 78

Leon Tsou

xxrjun

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

deepseek-ai/DeepSeek-R1

liked a dataset about 1 month ago

GPUMODE/KernelBook

liked a Space 5 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

liked a model 5 days ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 951k • • 12.5k

liked a dataset about 1 month ago

GPUMODE/KernelBook

Viewer • Updated Jun 25 • 18.2k • 948 • 29

liked a Space 5 months ago

2.85k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 5 months ago

MediaTek-Research/BreezyVoice

Updated Feb 18 • 47

published a model 5 months ago

xxrjun/Llama-Breeze2-8B-Instruct

Updated Feb 16

upvoted a collection 5 months ago

InternVL2.5

Collection

Better than InternVL 2.0 • 19 items • Updated Apr 20 • 92

liked a model 5 months ago

MediaTek-Research/Llama-Breeze2-8B-Instruct

8B • Updated Mar 2 • 1.48k • 44

liked a model 6 months ago

fishaudio/fish-speech-1.5

Text-to-Speech • Updated Mar 25 • 2.18k • • 605

upvoted 2 collections 6 months ago

Taiwan LLM

Collection

Try out at twllm.com ! • 28 items • Updated Feb 22 • 45

DeepSeek-R1

Collection

10 items • Updated May 29 • 767

liked a model 6 months ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • 71B • Updated Feb 24 • 141k • • 708

liked a Space 9 months ago

960

Model Memory Utility

🚀

Calculate memory usage for training models

Leon Tsou

AI & ML interests

Recent Activity

Organizations

xxrjun's activity

The Ultra-Scale Playbook

Model Memory Utility