zhang
AI & ML interests
Recent Activity
Organizations
-
Running77
Browser only - Screen Capture & OCR
πOne-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running607607
First Agent Template
β‘Get current time in any timezone
-
Runtime error127127
OctoTools
πAn Agentic Framework with Tools for Complex Reasoning
-
Running139139
smolagents LLM leaderboard
πA leaderboard for LLMs powering smolagents
-
Running on Zero1.54k1.54k
Joy Caption Alpha Two
πGenerate captions for images in various styles
-
Running on Zero4040
Florence Llama
π¬Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 6 β’ 3 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 252 β’ 2
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 1.21k β’ 11 -
Running on Zero2.65k2.65k
F5-TTS
π£F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
Paused2.19k2.19k
FacePoke
πImport a portrait, click to move the head!
-
Running on L4653653
OpenAudio S1
πGenerate speech from text
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 11.3k β’ 703 -
Runtime error8282
Nanonets OCR
πDemo for Nanonets-OCR
-
Running on ZeroMCP377377
Multimodal OCR
πnanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCP136136
Multimodal OCR2
π»nanonets ocr / smoldocling / monkey ocr / typhoon ocr
-
Running on Zero1.58k1.58k
Flux.1-dev Upscaler
πUpscale low-resolution images to high resolution
-
Running on Zero435435
InvSR
πImage Super-resolution via Diffusion Inversion
-
Paused242242
FLUX Upsacle Image
π₯Upscale images with control and customization
-
Running on L4276276
Thera Arbitrary-Scale Super-Resolution
π₯Enhance image resolution with Thera
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 508 -
Running on Zero918918
OminiControl
πGenerate an edited image based on text and input image
-
Running on Zero393393
FLUXllama gpt-oss
πmcp_server & FLUX 4-bit Quantization + Enhanced
-
Running on L42.16k2.16k
MagicQuill
πͺΆGenerate edited images using scribble inputs
-
Running77
Browser only - Screen Capture & OCR
πOne-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running607607
First Agent Template
β‘Get current time in any timezone
-
Runtime error127127
OctoTools
πAn Agentic Framework with Tools for Complex Reasoning
-
Running139139
smolagents LLM leaderboard
πA leaderboard for LLMs powering smolagents
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 11.3k β’ 703 -
Runtime error8282
Nanonets OCR
πDemo for Nanonets-OCR
-
Running on ZeroMCP377377
Multimodal OCR
πnanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCP136136
Multimodal OCR2
π»nanonets ocr / smoldocling / monkey ocr / typhoon ocr
-
Running on Zero1.54k1.54k
Joy Caption Alpha Two
πGenerate captions for images in various styles
-
Running on Zero4040
Florence Llama
π¬Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 6 β’ 3 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 252 β’ 2
-
Running on Zero1.58k1.58k
Flux.1-dev Upscaler
πUpscale low-resolution images to high resolution
-
Running on Zero435435
InvSR
πImage Super-resolution via Diffusion Inversion
-
Paused242242
FLUX Upsacle Image
π₯Upscale images with control and customization
-
Running on L4276276
Thera Arbitrary-Scale Super-Resolution
π₯Enhance image resolution with Thera
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 508 -
Running on Zero918918
OminiControl
πGenerate an edited image based on text and input image
-
Running on Zero393393
FLUXllama gpt-oss
πmcp_server & FLUX 4-bit Quantization + Enhanced
-
Running on L42.16k2.16k
MagicQuill
πͺΆGenerate edited images using scribble inputs
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 1.21k β’ 11 -
Running on Zero2.65k2.65k
F5-TTS
π£F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
Paused2.19k2.19k
FacePoke
πImport a portrait, click to move the head!
-
Running on L4653653
OpenAudio S1
πGenerate speech from text