Community Blog & Articles

Community Articles

CRAFT: Continuous Reasoning and Agentic Feedback Tuning

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

Nvidia Agentic Smart Router on Dell Enterprise Hub : Deepdive on Architecture,Design and Framework

🚀 SyGra V2.0.0

Code a simple RAG from scratch

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

KV Caching Explained: Optimizing Transformer Inference Efficiency

We’re open-sourcing our text-to-image model and the process behind it

Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 Minutes, $0.50

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Decoding Strategies in Large Language Models

Small Language Models (SLM): A Comprehensive Overview

From GRPO to DAPO and GSPO: What, Why, and How

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

Text-to-image Architectural Experiments

The Optimal Architecture for Small Language Models

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

announcementtransformers.jstransformers

Transformers.js v4 Preview: Now Available on NPM!

February 9, 2026

Introducing SyGra Studio

February 5, 2026

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

February 4, 2026

evaluationleaderboardcommunity

Community Evals: Because we're done trusting black-box leaderboards over the community

+3

February 4, 2026

H Company's new Holo2 model takes the lead in UI Localization

February 3, 2026

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

February 3, 2026

Training Design for Text-to-Image Models: Lessons from Ablations

February 3, 2026

Introducing Daggr: Chain apps programmatically, inspect visually

+1

January 29, 2026

upskillagent-skillsagentic

We Got Claude to Build CUDA Kernels and teach open models!

January 28, 2026

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

January 27, 2026

Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs

January 27, 2026

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

January 27, 2026

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

January 21, 2026

One Year Since the “DeepSeek Moment”

January 20, 2026

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

CRAFT: Continuous Reasoning and Agentic Feedback Tuning

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

Nvidia Agentic Smart Router on Dell Enterprise Hub : Deepdive on Architecture,Design and Framework

🚀 SyGra V2.0.0

Code a simple RAG from scratch

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

Announcing ReasoningLens — Visualizing and Diagnosing LLM Reasoning at a Glance

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

KV Caching Explained: Optimizing Transformer Inference Efficiency

We’re open-sourcing our text-to-image model and the process behind it

Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 Minutes, $0.50

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Decoding Strategies in Large Language Models

Small Language Models (SLM): A Comprehensive Overview

From GRPO to DAPO and GSPO: What, Why, and How

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

Text-to-image Architectural Experiments

The Optimal Architecture for Small Language Models

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

View all articles