Span Spek's picture

Span Spek

spanspek

·

AI & ML interests

None yet

Recent Activity

new activity about 12 hours ago

nvidia/Nemotron-Cascade-2-30B-A3B:Tool calling ability

liked a model about 18 hours ago

nvidia/Nemotron-Cascade-2-30B-A3B

new activity about 18 hours ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16:Request: Casade post-train Nemotron-3-Super

View all activity

Organizations

None yet

New activity in nvidia/Nemotron-Cascade-2-30B-A3B about 12 hours ago

Tool calling ability

#3 opened about 12 hours ago by

New activity in nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 about 18 hours ago

Request: Casade post-train Nemotron-3-Super

#21 opened 3 days ago by

New activity in Tesslate/OmniCoder-9B-GGUF 4 days ago

Error on LMStudio.

#1 opened 8 days ago by

New activity in Qwen/Qwen3.5-35B-A3B-GPTQ-Int4 17 days ago

What impact has quantization had on model performance / ability?

#4 opened 17 days ago by

New activity in LiquidAI/LFM2-24B-A2B 19 days ago

Actual use cases for this model

#8 opened 21 days ago by

New activity in unsloth/Qwen3.5-4B-GGUF 19 days ago

waiting

#1 opened 19 days ago by

New activity in noctrex/LFM2-24B-A2B-MXFP4_MOE-GGUF 21 days ago

Model performance

#1 opened 21 days ago by

New activity in nvidia/Nemotron-Cascade-14B-Thinking 27 days ago

Excellent model

#3 opened 27 days ago by

New activity in Nanbeige/Nanbeige4.1-3B about 1 month ago

Insane performance

#4 opened about 1 month ago by

New activity in lovedheart/Qwen3-Coder-Next-REAP-40B-A3B-GGUF about 1 month ago

Information about the dataset is needed

#1 opened about 1 month ago by

New activity in Qwen/Qwen3-Coder-Next about 1 month ago

FP-8 version please 🥺

#7 opened about 1 month ago by

New activity in moonshotai/Kimi-K2.5 about 2 months ago

What quantization is the base model?

#8 opened about 2 months ago by

New activity in nvidia/Nemotron-Cascade-14B-Thinking about 2 months ago

Extremely misleading benchmarks

#1 opened 3 months ago by

New activity in noctrex/LightOnOCR-2-1B-ocr-soup-GGUF about 2 months ago

Phenomenal model

#1 opened about 2 months ago by

New activity in ngxson/GLM-4.7-Flash-GGUF about 2 months ago

Deepseek architecture?

#1 opened 2 months ago by

New activity in noctrex/Nemotron-Cascade-14B-Thinking-MXFP4-GGUF about 2 months ago

Feedback from testing

#1 opened about 2 months ago by

New activity in noctrex/GLM-4.7-Flash-MXFP4_MOE-GGUF about 2 months ago

Feedback from running in LM Studio 0.39.3 with v1.103.2 of llama.cpp

#1 opened about 2 months ago by

New activity in nvidia/Qwen2.5-CascadeRL-RM-72B about 2 months ago

Possible use-cases?

#1 opened about 2 months ago by

New activity in lmstudio-community/GLM-4.7-Flash-GGUF about 2 months ago

A few observations - Memory estimation & thinking is getting stuck in loops

#1 opened about 2 months ago by

New activity in unsloth/GLM-4.7-Flash-GGUF about 2 months ago

DeepSeek architecture?

#2 opened about 2 months ago by