Ankit

Ajax0564

Ajax0564

AI & ML interests

NLP

Recent Activity

upvoted an article 8 days ago

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

upvoted a paper about 1 month ago

AToken: A Unified Tokenizer for Vision

upvoted a paper about 2 months ago

SAIL-VL2 Technical Report

View all activity

Organizations

None yet

upvoted an article 8 days ago

Article

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

and 2 others •

10 days ago

• 55

upvoted a paper about 1 month ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17 • 36

upvoted a paper about 2 months ago

SAIL-VL2 Technical Report

Paper • 2509.14033 • Published Sep 17 • 44

upvoted an article 3 months ago

Article

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

Aug 8

• 76

upvoted 2 papers 3 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 306

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 123

upvoted an article 3 months ago

Article

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

Jun 26

• 48

upvoted 3 papers 4 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2 • 130

Ovis-U1 Technical Report

Paper • 2506.23044 • Published Jun 29 • 62

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding

Paper • 2506.16035 • Published Jun 19 • 88

upvoted an article 5 months ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

Jun 12

• 148

upvoted 2 papers 5 months ago

LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Paper • 2505.16933 • Published May 22 • 34

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21 • 96

upvoted a paper 6 months ago

Multi-Token Prediction Needs Registers

Paper • 2505.10518 • Published May 15 • 14

upvoted 3 articles 7 months ago

Article

The NLP Course is becoming the LLM Course!

Apr 3

• 100

Article

Open R1: How to use OlympicCoder locally for coding?

Mar 20

• 63

Article

Open-Source Handwritten Signature Detection Model

•

Mar 14

• 119

upvoted an article 8 months ago

Article

SigLIP 2: A better multilingual vision language encoder

Feb 21

• 187

liked a Space 9 months ago

3.4k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 9 months ago

Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

•

Feb 4

• 16

Ankit

AI & ML interests

Recent Activity

Organizations

Ajax0564's activity

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

The NLP Course is becoming the LLM Course!

Open R1: How to use OlympicCoder locally for coding?

Open-Source Handwritten Signature Detection Model

SigLIP 2: A better multilingual vision language encoder

The Ultra-Scale Playbook

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning