YongJin
PulYong
AI & ML interests
None yet
Organizations
AR Image Generation
-
Parallelized Autoregressive Visual Generation
Paper • 2412.15119 • Published • 53 -
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
Paper • 2412.17153 • Published • 39 -
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
Paper • 2412.18609 • Published • 18 -
Visual Autoregressive Modeling for Instruction-Guided Image Editing
Paper • 2508.15772 • Published • 9
ETC
Score Based Model
Score Based Model Paper Collections
-
Improved Denoising Diffusion Probabilistic Models
Paper • 2102.09672 • Published • 2 -
Denoising Diffusion Probabilistic Models
Paper • 2006.11239 • Published • 6 -
Denoising Diffusion Implicit Models
Paper • 2010.02502 • Published • 4 -
Diffusion Models Beat GANs on Image Synthesis
Paper • 2105.05233 • Published • 2
Unified MLLM
Unified model that generate Text, Image, Video
-
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
Paper • 2412.03069 • Published • 35 -
Are Emergent Abilities of Large Language Models a Mirage?
Paper • 2304.15004 • Published • 8 -
Scaling Image Tokenizers with Grouped Spherical Quantization
Paper • 2412.02632 • Published • 10 -
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Paper • 2410.13848 • Published • 34
Diffusion Language Model
Score Based Model
Score Based Model Paper Collections
-
Improved Denoising Diffusion Probabilistic Models
Paper • 2102.09672 • Published • 2 -
Denoising Diffusion Probabilistic Models
Paper • 2006.11239 • Published • 6 -
Denoising Diffusion Implicit Models
Paper • 2010.02502 • Published • 4 -
Diffusion Models Beat GANs on Image Synthesis
Paper • 2105.05233 • Published • 2
AR Image Generation
-
Parallelized Autoregressive Visual Generation
Paper • 2412.15119 • Published • 53 -
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
Paper • 2412.17153 • Published • 39 -
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
Paper • 2412.18609 • Published • 18 -
Visual Autoregressive Modeling for Instruction-Guided Image Editing
Paper • 2508.15772 • Published • 9
Unified MLLM
Unified model that generate Text, Image, Video
-
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
Paper • 2412.03069 • Published • 35 -
Are Emergent Abilities of Large Language Models a Mirage?
Paper • 2304.15004 • Published • 8 -
Scaling Image Tokenizers with Grouped Spherical Quantization
Paper • 2412.02632 • Published • 10 -
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Paper • 2410.13848 • Published • 34
ETC