The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm Paper • 2507.18553 • Published 5 days ago • 22
Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning Paper • 2507.16802 • Published 7 days ago • 6
Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows Paper • 2507.18405 • Published 5 days ago • 3
TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance Paper • 2507.18192 • Published 5 days ago • 6
SegDT: A Diffusion Transformer-Based Segmentation Model for Medical Imaging Paper • 2507.15595 • Published 8 days ago • 4
GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface Paper • 2507.18546 • Published 5 days ago • 12
DMOSpeech 2: Reinforcement Learning for Duration Prediction in Metric-Optimized Speech Synthesis Paper • 2507.14988 • Published 9 days ago • 7
DriftMoE: A Mixture of Experts Approach to Handle Concept Drifts Paper • 2507.18464 • Published 5 days ago • 8
Hierarchical Budget Policy Optimization for Adaptive Reasoning Paper • 2507.15844 • Published 8 days ago • 16
MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper • 2507.14958 • Published 9 days ago • 43
nablaNABLA: Neighborhood Adaptive Block-Level Attention Paper • 2507.13546 • Published 11 days ago • 110
EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion Paper • 2507.16535 • Published 7 days ago • 15
Finding Dori: Memorization in Text-to-Image Diffusion Models Is Less Local Than Assumed Paper • 2507.16880 • Published 7 days ago • 6
Mitigating Object Hallucinations via Sentence-Level Early Intervention Paper • 2507.12455 • Published 13 days ago • 7