93 114 46

YSH

BestWishYsh

https://shyuanbest.github.io/

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

BestWishYsh/OpenS2V-Eval:Update app.py

updated a Space 3 days ago

BestWishYsh/OpenS2V-Eval

upvoted a paper 13 days ago

Plan-X: Instruct Video Generation via Semantic Planning

View all activity

Organizations

authored 2 papers about 2 months ago

FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation

Paper • 2509.25187 • Published Sep 29 • 2

Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback

Paper • 2510.16888 • Published Oct 19 • 21

authored 5 papers 6 months ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3 • 58

MAGREF: Masked Guidance for Any-Reference Video Generation

Paper • 2505.23742 • Published May 29 • 9

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Paper • 2505.20292 • Published May 26 • 52

ImgEdit: A Unified Image Editing Dataset and Benchmark

Paper • 2505.20275 • Published May 26 • 18

Sci-Fi: Symmetric Constraint for Frame Inbetweening

Paper • 2505.21205 • Published May 27 • 5

authored a paper 8 months ago

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Paper • 2504.02782 • Published Apr 3 • 57

authored 2 papers 9 months ago

MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation

Paper • 2503.14428 • Published Mar 18 • 9

CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance

Paper • 2503.10391 • Published Mar 13 • 11

authored 3 papers about 1 year ago

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Paper • 2411.17459 • Published Nov 26, 2024 • 12

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published Nov 28, 2024 • 33

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Paper • 2411.17440 • Published Nov 26, 2024 • 37

authored 3 papers over 1 year ago

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model

Paper • 2409.01199 • Published Sep 2, 2024 • 14

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Paper • 2404.05014 • Published Apr 7, 2024 • 34

ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

Paper • 2406.18522 • Published Jun 26, 2024 • 20

YSH

AI & ML interests

Recent Activity

Organizations

BestWishYsh's activity