Blog, Articles, and discussions

Introducing RTEB: A New Standard for Retrieval Evaluation

By October 1, 2025 guest • 47

Community Articles

view all

When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance

and 1 other •

2 days ago

• 10

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

about 19 hours ago

• 10

Code a simple RAG from scratch

•

Oct 29, 2024

• 207

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

2 days ago

• 9

Preserving Agency: Why AI Safety Needs Community, Not Corporate Control

•

3 days ago

• 9

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

and 6 others •

6 days ago

• 7

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 684

Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi

and 1 other •

7 days ago

• 6

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 72

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

22 days ago

• 102

How to Train an Antibody Developability Model

and 1 other •

15 days ago

• 14

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 176

arXiv实用技巧，如何让你的paper关注度变高？

•

Jul 8, 2024

• 14

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 99

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 225

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 75

The Reformer - Pushing the limits of language modeling

By July 3, 2020 • 3

How to generate text: using different decoding methods for language generation with Transformers

By March 1, 2020 • 251

How to train a new language model from scratch using Transformers and Tokenizers

By February 14, 2020 • 48

Community Articles

There is no such thing as a tokenizer-free lunch

•

7 days ago

• 68

Model Quality: Hugging Face Is All You Need

•

6 days ago

• 19

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

8 days ago

• 24

RexBERT: Encoders for a brave new world of E-Commerce

and 1 other •

11 days ago

• 46

When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance

and 1 other •

2 days ago

• 10

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

about 19 hours ago

• 10

Code a simple RAG from scratch

•

Oct 29, 2024

• 207

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

2 days ago

• 9

Preserving Agency: Why AI Safety Needs Community, Not Corporate Control

•

3 days ago

• 9

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

and 6 others •

6 days ago

• 7

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 684

Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi

and 1 other •

7 days ago

• 6

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 72

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

22 days ago

• 102

How to Train an Antibody Developability Model

and 1 other •

15 days ago

• 14

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 176

arXiv实用技巧，如何让你的paper关注度变高？

•

Jul 8, 2024

• 14

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 99

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 225

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 75

View all