Risk-Averse Reinforcement Learning with Itakura-Saito Loss Paper • 2505.16925 • Published May 22 • 26
ZhMax/llama-2-13b-ebft-sparsegpt-outlier-wiki-block-outlier Text Generation • 13B • Updated Dec 19, 2024
ZhMax/llama-2-7b-ebft-sparsegpt-outlier-wiki-block-outlier Text Generation • 7B • Updated Dec 15, 2024
GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs Paper • 2408.15300 • Published Aug 27, 2024 • 3