Inference Providers
Active filters: sparse
mradermacher/llama2.c-stories110M-pruned50-GGUF
0.1B • Updated • 76
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
7B • Updated • 30
• 1
mradermacher/MiniChat-2-3B-pruned2.4-GGUF
3B • Updated • 13
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-i1-GGUF
7B • Updated • 98
mradermacher/llama2.c-stories110M-pruned50-i1-GGUF
0.1B • Updated • 71
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
7B • Updated • 62
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-i1-GGUF
7B • Updated • 87
tensorblock/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
tensorblock/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
mradermacher/Llama-2-7b-dolphin-open_platypus-pruned_70-GGUF
7B • Updated • 14
mradermacher/Llama-2-7b-dolphin-open_platypus-pruned_50-GGUF
7B • Updated • 39
mradermacher/Nous-Hermes-2-Yi-34B-pruned2.4-GGUF
34B • Updated • 28
mradermacher/Nous-Hermes-2-Yi-34B-pruned50-GGUF
34B • Updated • 32
ibm-granite/granite-embedding-30m-sparse
Feature Extraction
• 30.3M • Updated • 66.7k
• • 25
opensearch-project/opensearch-neural-sparse-encoding-multilingual-v1
Feature Extraction
• 0.2B • Updated • 12.2k
• • 17
mradermacher/opensearch-neural-sparse-encoding-doc-v2-mini-GGUF
22.6M • Updated • 120
mradermacher/SparseLlama-3-8B-pruned_50.2of4-GGUF
8B • Updated • 45
• 1
opensearch-project/opensearch-neural-sparse-encoding-doc-v3-distill
Feature Extraction
• 67M • Updated • 5.64k
• • 10
tjingrant/sparsellm-1b-40p
1B • Updated • 4
tjingrant/sparsellm-1b-60p-small-dense
0.7B • Updated • 13
tjingrant/sparsellm-1b-80p
1B • Updated • 2
tjingrant/sparsellm-1b-60p
1B • Updated • 1
tjingrant/sparsellm-1b-20p
1B • Updated • 4
tjingrant/sparsellm-1b-80p-small-dense
0.5B • Updated • 1
tjingrant/sparsellm-1b-40p-small-dense
0.9B • Updated • 14
tjingrant/sparsellm-1b-20p-small-dense
1B • Updated • 1
tensorblock/RedHatAI_llama2.c-stories110M-pruned50-GGUF
0.1B • Updated • 5
sparse-encoder-testing/splade-bert-tiny-nq
Feature Extraction
• 4.42M • Updated • 33k
tomaarsen/inference-free-splade-bert-tiny-nq-3e-3-lambda-corpus
Feature Extraction
• Updated • 2
tomaarsen/inference-free-splade-bert-tiny-nq-3e-6-lambda-corpus
Feature Extraction
• Updated • 1