view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator 23 days ago • 44
view article Article AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems 17 days ago • 40
view article Article Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture 4 days ago • 31
view article Article cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents 24 days ago • 10
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 • 63
view article Article AI Energy Score v2: Refreshed Leaderboard, now with Reasoning 🧠 Dec 4, 2025 • 10
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 268