Nishith Jain
KingNish
AI & ML interests
AI is fun actually.
Recent Activity
liked
a model
1 day ago
microsoft/VibeVoice-Realtime-0.5B
upvoted
a
paper
1 day ago
PretrainZero: Reinforcement Active Pretraining
updated
a dataset
1 day ago
KingNish/multiturn_curated_dataset_test
Organizations
Reasoning
-
Large Language Models Think Too Fast To Explore Effectively
Paper • 2501.18009 • Published • 24 -
s1: Simple test-time scaling
Paper • 2501.19393 • Published • 124 -
Scalable-Softmax Is Superior for Attention
Paper • 2501.19399 • Published • 22 -
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers
Paper • 2502.20545 • Published • 22
Top LLM
Collection of TOP Open Source LLM, Sort by Best on top
-
meta-llama/Llama-3.1-405B-Instruct
Text Generation • 406B • Updated • 110k • • 587 -
mistralai/Mistral-Large-Instruct-2407
123B • Updated • 9.03k • 852 -
meta-llama/Llama-3.1-70B-Instruct
Text Generation • 71B • Updated • 665k • • 873 -
Qwen/Qwen2-72B-Instruct
Text Generation • 73B • Updated • 22.4k • • 718
Instant Space
Contains spaces which gives lightning fast results compare to others.
-
PausedFeatured624
Instant Video
⚡624Fast Text 2 Video Generator
-
Runtime error466
Instant Image
🔥4664k Image from text in 5 second
-
Running on Zero614
Real-Time Text-to-Image SDXL Lightning
⚡614Real-Time Image Generation with SDXL Lightning
-
RunningFeatured294
JARVIS
🔥294Voice Chat with JARVIS
MTP
-
Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding
Paper • 2402.05109 • Published • 2 -
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Paper • 2401.10774 • Published • 59 -
Better & Faster Large Language Models via Multi-token Prediction
Paper • 2404.19737 • Published • 80 -
Exploring the Latent Capacity of LLMs for One-Step Text Generation
Paper • 2505.21189 • Published • 61
Interesting
Top Mini LLM
Collection of top mini llms
-
KingNish/Qwen2.5-0.5b-Test-ft
Text Generation • 0.5B • Updated • 1.33k • 12 -
meta-llama/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 3.35M • • 1.19k -
Qwen/Qwen2.5-1.5B-Instruct
Text Generation • 2B • Updated • 5.11M • • 562 -
Qwen/Qwen2.5-0.5B-Instruct
Text Generation • 0.5B • Updated • 2.01M • 403
Paper-to-Read
-
Agent Workflow Memory
Paper • 2409.07429 • Published • 32 -
MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis
Paper • 2409.07129 • Published • 8 -
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance
Paper • 2409.04593 • Published • 26 -
Imagine yourself: Tuning-Free Personalized Image Generation
Paper • 2409.13346 • Published • 70
Reasoning in Latent Space
MTP
-
Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding
Paper • 2402.05109 • Published • 2 -
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Paper • 2401.10774 • Published • 59 -
Better & Faster Large Language Models via Multi-token Prediction
Paper • 2404.19737 • Published • 80 -
Exploring the Latent Capacity of LLMs for One-Step Text Generation
Paper • 2505.21189 • Published • 61
Reasoning
-
Large Language Models Think Too Fast To Explore Effectively
Paper • 2501.18009 • Published • 24 -
s1: Simple test-time scaling
Paper • 2501.19393 • Published • 124 -
Scalable-Softmax Is Superior for Attention
Paper • 2501.19399 • Published • 22 -
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers
Paper • 2502.20545 • Published • 22
Interesting
Top LLM
Collection of TOP Open Source LLM, Sort by Best on top
-
meta-llama/Llama-3.1-405B-Instruct
Text Generation • 406B • Updated • 110k • • 587 -
mistralai/Mistral-Large-Instruct-2407
123B • Updated • 9.03k • 852 -
meta-llama/Llama-3.1-70B-Instruct
Text Generation • 71B • Updated • 665k • • 873 -
Qwen/Qwen2-72B-Instruct
Text Generation • 73B • Updated • 22.4k • • 718
Top Mini LLM
Collection of top mini llms
-
KingNish/Qwen2.5-0.5b-Test-ft
Text Generation • 0.5B • Updated • 1.33k • 12 -
meta-llama/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 3.35M • • 1.19k -
Qwen/Qwen2.5-1.5B-Instruct
Text Generation • 2B • Updated • 5.11M • • 562 -
Qwen/Qwen2.5-0.5B-Instruct
Text Generation • 0.5B • Updated • 2.01M • 403
Instant Space
Contains spaces which gives lightning fast results compare to others.
-
PausedFeatured624
Instant Video
⚡624Fast Text 2 Video Generator
-
Runtime error466
Instant Image
🔥4664k Image from text in 5 second
-
Running on Zero614
Real-Time Text-to-Image SDXL Lightning
⚡614Real-Time Image Generation with SDXL Lightning
-
RunningFeatured294
JARVIS
🔥294Voice Chat with JARVIS
Paper-to-Read
-
Agent Workflow Memory
Paper • 2409.07429 • Published • 32 -
MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis
Paper • 2409.07129 • Published • 8 -
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance
Paper • 2409.04593 • Published • 26 -
Imagine yourself: Tuning-Free Personalized Image Generation
Paper • 2409.13346 • Published • 70