Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
31
3
Stas Bekman
stas
Follow
allaye's profile picture
mayank-mishra's profile picture
GoHugo's profile picture
117 followers
·
4 following
https://stasosphere.com/machine-learning/
StasBekman
stas00
AI & ML interests
Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at Contextual.AI Training LLM/RAG/Generative AI/Machine Learning/Scalability
Recent Activity
updated
a model
21 days ago
stas/ml-engineering-book
updated
a model
21 days ago
stas/ml-engineering-book
posted
an
update
7 months ago
Do you want ArcticTraining at @SnowflakeDB to add an ability to post-train DeepSeek V3/R1 models with DPO using just a few GPU nodes? Please vote here and tell others about it: https://github.com/snowflakedb/ArcticTraining/discussions/58 ArcticTraining is an open-source, easy to use post-training framework for NVIDIA GPUs built on top of DeepSpeed.
View all activity
Organizations
stas
's datasets
8
Sort: Recently updated
stas/openwebtext-synthetic-testing
Updated
Nov 14, 2023
•
23
•
4
stas/oscar-en-10k
Viewer
•
Updated
Oct 19, 2022
•
10k
•
300
•
2
stas/c4-en-10k
Viewer
•
Updated
Oct 19, 2022
•
10k
•
363
•
4
stas/general-pmd-synthetic-testing
Updated
Oct 18, 2022
•
26
stas/cm4-synthetic-testing
Updated
Oct 18, 2022
•
30
stas/openwebtext-10k
Updated
Sep 15, 2021
•
802
•
31
stas/wmt14-en-de-pre-processed
Viewer
•
Updated
Feb 16, 2021
•
4.55M
•
76
•
3
stas/wmt16-en-ro-pre-processed
Viewer
•
Updated
Feb 16, 2021
•
614k
•
40