r PRO
oceansweep
AI & ML interests
None yet
Recent Activity
liked
a model
about 23 hours ago
Tesslate/UIGEN-X-32B-0727
liked
a model
3 days ago
zai-org/GLM-4.1V-9B-Thinking
liked
a model
5 days ago
nvidia/audio-flamingo-3-chat
Organizations
None yet
LLMs-Using
-
CohereLabs/c4ai-command-r-plus
Text Generation • 104B • Updated • 3.17k • • 1.75k -
microsoft/Phi-3-medium-128k-instruct
Text Generation • 14B • Updated • 10.3k • 382 -
crusoeai/Llama-3-8B-Instruct-Gradient-1048k-GGUF
8B • Updated • 1.33k • 71 -
gradientai/Llama-3-8B-Instruct-262k
Text Generation • 8B • Updated • 4.01k • 259
TTS
Music_Gen
Personal-Projects
Relevant-Papers-Midterm
-
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Paper • 2402.14848 • Published • 20 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 66 -
CRAG -- Comprehensive RAG Benchmark
Paper • 2406.04744 • Published • 49 -
Transformers meet Neural Algorithmic Reasoners
Paper • 2406.09308 • Published • 45
Parametric-Compression
Modeling-Martial-Artists
GGUF-related
VLMs
-
openbmb/MiniCPM-V-2
Visual Question Answering • 3B • Updated • 4.23k • 474 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 798 • 28 -
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 20.8k • 608 -
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper • 2311.06242 • Published • 94
LLM-Models
Datasweep
Papers
MAMBA-Models
Training-related
Coding
GGUF-related
LLMs-Using
-
CohereLabs/c4ai-command-r-plus
Text Generation • 104B • Updated • 3.17k • • 1.75k -
microsoft/Phi-3-medium-128k-instruct
Text Generation • 14B • Updated • 10.3k • 382 -
crusoeai/Llama-3-8B-Instruct-Gradient-1048k-GGUF
8B • Updated • 1.33k • 71 -
gradientai/Llama-3-8B-Instruct-262k
Text Generation • 8B • Updated • 4.01k • 259
VLMs
-
openbmb/MiniCPM-V-2
Visual Question Answering • 3B • Updated • 4.23k • 474 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 798 • 28 -
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 20.8k • 608 -
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper • 2311.06242 • Published • 94
TTS
LLM-Models
Music_Gen
Datasweep
Personal-Projects
Papers
Relevant-Papers-Midterm
-
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Paper • 2402.14848 • Published • 20 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 66 -
CRAG -- Comprehensive RAG Benchmark
Paper • 2406.04744 • Published • 49 -
Transformers meet Neural Algorithmic Reasoners
Paper • 2406.09308 • Published • 45
MAMBA-Models
Parametric-Compression
Training-related
Modeling-Martial-Artists