Cahlen Humphreys PRO
cahlen
AI & ML interests
☠️💻
Recent Activity
liked
a dataset
11 days ago
MohamedRashad/SADA22
liked
a dataset
11 days ago
m6011/sada2022
liked
a dataset
11 days ago
khaledalganem/sada2022
Organizations
3D / Mesh
Gaussians and Nerfs
-
Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation
Paper • 2401.14257 • Published • 12 -
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation
Paper • 2402.05054 • Published • 29 -
MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction
Paper • 2402.12712 • Published • 18 -
GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting
Paper • 2402.10259 • Published • 16
Image Restoration
Surveys
TBR
Papers TO BE READ
-
3D-LLM: Injecting the 3D World into Large Language Models
Paper • 2307.12981 • Published • 37 -
Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study
Paper • 2401.17981 • Published • 1 -
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Paper • 2312.02126 • Published • 2 -
Relightable Gaussian Codec Avatars
Paper • 2312.03704 • Published • 33
Object Detection
Multimodal
Audio
Web Agents
Data Generation
3D Avatar Utils
-
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Paper • 2401.15687 • Published • 25 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper • 2312.03029 • Published • 26 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper • 2312.13578 • Published • 29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper • 2312.13150 • Published • 16
Spatial
LLM
-
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Paper • 2402.05140 • Published • 24 -
BitDelta: Your Fine-Tune May Only Be Worth One Bit
Paper • 2402.10193 • Published • 23 -
QLoRA: Efficient Finetuning of Quantized LLMs
Paper • 2305.14314 • Published • 54 -
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Paper • 2402.14658 • Published • 84
Video
Agents
World Models
Audio
3D / Mesh
Web Agents
Gaussians and Nerfs
-
Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation
Paper • 2401.14257 • Published • 12 -
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation
Paper • 2402.05054 • Published • 29 -
MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction
Paper • 2402.12712 • Published • 18 -
GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting
Paper • 2402.10259 • Published • 16
Data Generation
Image Restoration
3D Avatar Utils
-
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Paper • 2401.15687 • Published • 25 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper • 2312.03029 • Published • 26 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper • 2312.13578 • Published • 29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper • 2312.13150 • Published • 16
Surveys
Spatial
TBR
Papers TO BE READ
-
3D-LLM: Injecting the 3D World into Large Language Models
Paper • 2307.12981 • Published • 37 -
Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study
Paper • 2401.17981 • Published • 1 -
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Paper • 2312.02126 • Published • 2 -
Relightable Gaussian Codec Avatars
Paper • 2312.03704 • Published • 33
LLM
-
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Paper • 2402.05140 • Published • 24 -
BitDelta: Your Fine-Tune May Only Be Worth One Bit
Paper • 2402.10193 • Published • 23 -
QLoRA: Efficient Finetuning of Quantized LLMs
Paper • 2305.14314 • Published • 54 -
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Paper • 2402.14658 • Published • 84
Object Detection
Video
Multimodal
Agents