-
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 44 -
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Paper • 2501.06842 • Published • 16 -
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper • 2501.05441 • Published • 94
nDimensional
nDimensional
AI & ML interests
Computer Vision, Diffusers, Transformers, ML, NLP, Diffusion Models, Unsupervised Learning, JAX, Neural Networks
Recent Activity
liked
a dataset
about 16 hours ago
allenai/wildjailbreak
liked
a model
about 19 hours ago
google/gemma-3n-E4B-it
liked
a model
1 day ago
neta-art/Neta-Lumina
Organizations
None yet