7 666 892

xziayro

xziayro

AI & ML interests

None yet

Recent Activity

liked a model about 13 hours ago

alibaba-pai/Z-Image-Fun-Lora-Distill

liked a Space 1 day ago

witcherderivia/TeleStyle

upvoted a paper 1 day ago

SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis

View all activity

Organizations

upvoted 3 papers 1 day ago

upvoted a collection 1 day ago

MOVA

Collection

3 items • Updated about 6 hours ago • 8

upvoted a paper 2 days ago

Flow Matching Meets PDEs: A Unified Framework for Physics-Constrained Generation

Paper • 2506.08604 • Published Jun 10, 2025 • 1

upvoted a paper 5 days ago

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Paper • 2602.02958 • Published 9 days ago • 32

upvoted a paper 6 days ago

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published 6 days ago • 34

upvoted an article 7 days ago

Article

Training Design for Text-to-Image Models: Lessons from Ablations

8 days ago

•

upvoted 3 papers 7 days ago

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published Oct 25, 2024 • 24

Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis

Paper • 2602.03139 • Published 9 days ago • 41

3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Paper • 2602.03796 • Published 8 days ago • 55

upvoted 2 papers 8 days ago

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

Paper • 2602.00919 • Published 11 days ago • 268

PISCES: Annotation-free Text-to-Video Post-Training via Optimal Transport-Aligned Rewards

Paper • 2602.01624 • Published 10 days ago • 23

upvoted a paper 9 days ago

M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models

Paper • 2512.22877 • Published Dec 28, 2025 • 2

upvoted a paper 12 days ago

JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion

Paper • 2601.22143 • Published 13 days ago • 6

upvoted a paper 13 days ago

UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders

Paper • 2601.17950 • Published 17 days ago • 4

upvoted 2 papers 16 days ago

SAMTok: Representing Any Mask with Two Words

Paper • 2601.16093 • Published 20 days ago • 41

SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer

Paper • 2601.16515 • Published 20 days ago • 15

upvoted 2 papers 17 days ago

VideoMaMa: Mask-Guided Video Matting via Generative Prior

Paper • 2601.14255 • Published 22 days ago • 15

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published 22 days ago • 74

xziayro

AI & ML interests

Recent Activity

Organizations

xziayro's activity

Training Design for Text-to-Image Models: Lessons from Ablations