Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
90.0
TFLOPS
112
16
251
s3nh
s3nh
Follow
svjack's profile picture
sooxen's profile picture
Malum0x's profile picture
247 followers
·
93 following
s3nhs3nh
s3nh
AI & ML interests
Quantization, LLMs, Deep Learning for good. Follow me if you like my work. Patreon.com/s3nh
Recent Activity
reacted
to
Severian
's
post
with 👍
about 9 hours ago
MLX port of BDH (Baby Dragon Hatchling) is up! I’ve ported the BDH ( https://github.com/pathwaycom/bdh ) model to MLX for Apple Silicon. It’s a faithful conversion of the PyTorch version: same math, same architecture (byte-level vocab, shared weights across layers, ReLU sparsity, RoPE attention with Q=K), with MLX-friendly APIs and a detailed README explaining the few API-level differences and why results are equivalent. Code, docs, and training script are ready to use. You may need to adjust the training script a bit to fit your own custom dataset. Only tested on M4 so far, but should work perfect for any M1/M2/M3 users out there. I’m currently training this MLX build on my Internal Knowledge Map (IKM) dataset https://huggingface.co/datasets/Severian/Internal-Knowledge-Map Training’s underway; expect a day or so before I publish weights. When it’s done, I’ll upload the checkpoint to Hugging Face for anyone to test. Repo: https://github.com/severian42/BDH-MLX HF model (coming soon): https://huggingface.co/Severian/BDH-MLX If you try it on your own data, feedback and PRs are welcome.
reacted
to
mitkox
's
post
with 🚀
about 9 hours ago
Hermes4 70B synthetic dataset generation on my desktop Z8 GPU rig: 307 tok/sec 1.1M tok/hour The bottleneck for generating massive, high-quality reinforcement learning datasets is never the GPU compute; it's always the model's willingness to actually answer the darn question.
liked
a model
about 12 hours ago
Jackmin108/glm-0.5B
View all activity
Organizations
s3nh
's models
438
Sort: Recently updated
s3nh/s3nh-Sonya-Panda-7B-slerp-GGUF
Text Generation
•
7B
•
Updated
Jan 8, 2024
•
6
•
1
s3nh/Sonya-Panda-7B-slerp
Text Generation
•
7B
•
Updated
Jan 8, 2024
•
3
•
1
s3nh/Noromaid-Panda-Mistral-7B-slerp
Text Classification
•
7B
•
Updated
Jan 8, 2024
•
2
•
1
s3nh/Spanicin-Fulcrum-7B-slerp-GGUF
Text Generation
•
7B
•
Updated
Jan 8, 2024
•
15
s3nh/whiterabbitneo-WhiteRabbitNeo-13B-GGUF
Text Generation
•
13B
•
Updated
Jan 8, 2024
•
15
s3nh/Fredithefish-CanarY-GGUF
Text Generation
•
13B
•
Updated
Jan 8, 2024
•
63
s3nh/s3nh-Noromaid-Panda-7B-GGUF
Text Generation
•
7B
•
Updated
Jan 8, 2024
•
7
•
3
s3nh/Noromaid-Panda-7B
Text Generation
•
7B
•
Updated
Jan 8, 2024
•
2
•
1
s3nh/nsfw-noromaid-mistral-instruct
Text Generation
•
7B
•
Updated
Jan 8, 2024
•
5
•
2
s3nh/nsfw-noromaid-mistral
Text Generation
•
7B
•
Updated
Jan 8, 2024
•
4
•
3
s3nh/nsfw-noromaid-zephyr
Text Generation
•
7B
•
Updated
Jan 8, 2024
•
1
s3nh/Overthinker-Eileithyia-13B
Text Generation
•
10B
•
Updated
Jan 7, 2024
s3nh/s3nh-Overthinker-Eileithyia-13B-GGUF
Updated
Jan 7, 2024
s3nh/Sao10K-Winterreise-m7-GGUF
Text Generation
•
7B
•
Updated
Jan 7, 2024
•
7
s3nh/Sao10K-Stheno-L2-13B-GGUF
Text Generation
•
13B
•
Updated
Jan 7, 2024
•
11
•
1
s3nh/Henk717-spring-dragon-GGUF
Text Generation
•
13B
•
Updated
Jan 7, 2024
•
7
s3nh/elonmollusk-neuralogix-openhermes-v2-GGUF
Text Generation
•
7B
•
Updated
Jan 7, 2024
•
6
•
1
s3nh/BEE-spoke-data-TinyLlama-3T-1.1bee-GGUF
Text Generation
•
Updated
Jan 7, 2024
s3nh/Sao10K-Sensualize-Solar-10.7B-GGUF
Text Generation
•
11B
•
Updated
Jan 7, 2024
•
7
s3nh/Edentns-DataVortexM-7B-Instruct-v0.1-GGUF
Text Generation
•
7B
•
Updated
Jan 7, 2024
•
7
s3nh/s3nh-nsfw-noromaid-zephyr-GGUF
Text Generation
•
7B
•
Updated
Jan 7, 2024
•
88
•
6
s3nh/TencentARC-LLaMA-Pro-8B-Instruct-GGUF
Text Generation
•
8B
•
Updated
Jan 7, 2024
•
8
s3nh/sethuiyer-Dr_Samantha_7b_mistral-GGUF
Text Generation
•
7B
•
Updated
Jan 7, 2024
•
38
s3nh/s3nh-Medicine-Noromaid-13b-GGUF
Text Generation
•
10B
•
Updated
Jan 6, 2024
•
14
•
1
s3nh/s3nh-Law-Noromaid-13b-GGUF
Text Generation
•
10B
•
Updated
Jan 6, 2024
•
13
•
2
s3nh/s3nh-Finance-Noromaid-13b-GGUF
Text Generation
•
10B
•
Updated
Jan 6, 2024
•
12
•
1
s3nh/Finance-Noromaid-13b
Text Generation
•
10B
•
Updated
Jan 6, 2024
s3nh/jondurbin-bagel-1.1b-v0.3-GGUF
Text Generation
•
Updated
Jan 6, 2024
s3nh/Medicine-Noromaid-13b
Text Generation
•
10B
•
Updated
Jan 6, 2024
s3nh/Masterjp123-Clover3-13B-GGUF
Text Generation
•
13B
•
Updated
Jan 6, 2024
•
7
Previous
1
...
3
4
5
6
7
...
15
Next