Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.17764

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20, 2024 • 14
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 61
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 49

Foundation AI Papers

Curated List of Must-Reads on LLM reasoning at Temus AI team

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Paper • 2310.04406 • Published Oct 6, 2023 • 10
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 110
ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization

Paper • 2402.09320 • Published Feb 14, 2024 • 6
Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 118

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623

HiDream-ai/HiDream-I1-Full

Text-to-Image • Updated 14 days ago • 249k • • 949
nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 7.46k • 547
Running

11.2k

11.2k

DeepSite v2

🐳

Generate any application with DeepSeek
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623

a collection of articles summarized and synthesized

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623
Raindrops IO

Collection

https://webmaker.app/create • 0 items • Updated 1 day ago • 1

Papers that made me go wow!

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623

Rewnozom/agent-zero-v1-a-01

Text Generation • 4B • Updated Jan 18 • 3 • 1
TheBloke/MythoMax-L2-13B-GGUF

13B • Updated Sep 27, 2023 • 119k • 165
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF

Text Generation • 18B • Updated 3 days ago • 94.9k • 289
QuantFactory/DarkIdol-Llama-3.1-8B-Instruct-1.2-Uncensored-GGUF

Text Generation • 8B • Updated Jul 29, 2024 • 19.7k • 100

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 372
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623
meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 728k • • 1.03k
keras-io/GauGAN-Image-generation

Updated Jul 5, 2024 • 6 • 4

🔥BitNet family of large language models (1-bit LLMs).

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.41k • 1.14k
microsoft/bitnet-b1.58-2B-4T-bf16

Text Generation • 2B • Updated May 1 • 3.44k • 33
microsoft/bitnet-b1.58-2B-4T-gguf

Text Generation • 2B • Updated May 1 • 4.67k • 188
BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 74

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20, 2024 • 14
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 61
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 49

a collection of articles summarized and synthesized

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623
Raindrops IO

Collection

https://webmaker.app/create • 0 items • Updated 1 day ago • 1

Foundation AI Papers

Curated List of Must-Reads on LLM reasoning at Temus AI team

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Paper • 2310.04406 • Published Oct 6, 2023 • 10
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 110
ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization

Paper • 2402.09320 • Published Feb 14, 2024 • 6
Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 118

Papers that made me go wow!

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623

Rewnozom/agent-zero-v1-a-01

Text Generation • 4B • Updated Jan 18 • 3 • 1
TheBloke/MythoMax-L2-13B-GGUF

13B • Updated Sep 27, 2023 • 119k • 165
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF

Text Generation • 18B • Updated 3 days ago • 94.9k • 289
QuantFactory/DarkIdol-Llama-3.1-8B-Instruct-1.2-Uncensored-GGUF

Text Generation • 8B • Updated Jul 29, 2024 • 19.7k • 100

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 372
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623
meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 728k • • 1.03k
keras-io/GauGAN-Image-generation

Updated Jul 5, 2024 • 6 • 4

HiDream-ai/HiDream-I1-Full

Text-to-Image • Updated 14 days ago • 249k • • 949
nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 7.46k • 547
Running

11.2k

11.2k

DeepSite v2

🐳

Generate any application with DeepSeek
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 623

🔥BitNet family of large language models (1-bit LLMs).

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.41k • 1.14k
microsoft/bitnet-b1.58-2B-4T-bf16

Text Generation • 2B • Updated May 1 • 3.44k • 33
microsoft/bitnet-b1.58-2B-4T-gguf

Text Generation • 2B • Updated May 1 • 4.67k • 188
BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 74

Previous
1
2
3
...
22
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs