Submitted by Nandan Kumar Jha 1 Spectral Scaling Laws in Language Models: How Effectively Do Feed-Forward Networks Use Their Latent Space? New York University 2