1 28 6

Dotanoob7

Dotanoob

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

HuggingFaceFW/blogpost-fineweb-v1

liked a Space about 2 months ago

nanotron/ultrascale-playbook

upvoted a paper 2 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

View all activity

Organizations

None yet

liked 2 Spaces about 2 months ago

1.14k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality text data for LLMs using FineWeb

3.4k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 5 papers 2 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 188

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 255

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 202

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 259

DINOv3

Paper • 2508.10104 • Published Aug 13 • 274

upvoted a collection 2 months ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 101

upvoted an article 4 months ago

Article

Upskill your LLMs with Gradio MCP Servers

Jul 9

• 20

liked a model 4 months ago

black-forest-labs/FLUX.1-Kontext-dev

Image-to-Image • Updated Jun 27 • 260k • • 2.39k

liked a Space 4 months ago

291

GPU Poor LLM Arena

🏆

Compact LLM Battle Arena: Frugal AI Face-Off!

upvoted a paper 4 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 237

upvoted 2 papers 6 months ago

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published May 7 • 28

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 96

upvoted 2 papers 7 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 76

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 110

liked a model 7 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27 • 286k • • 3.07k

upvoted 3 papers 9 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 90

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 421

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published Jan 17 • 52