Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

GadflyII/GLM-4.7-Flash-NVFP4

liked a model 1 day ago

zai-org/GLM-4.7-Flash

liked a model 4 days ago

OpenGVLab/InternVL3_5-38B-Instruct

View all activity

Organizations

upvoted a collection 4 days ago

Qwen3-VL

37 items • Updated 21 days ago • 588

upvoted a collection 5 days ago

TranslateGemma

3 items • Updated 6 days ago • 165

upvoted a paper 19 days ago

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published 23 days ago • 25

upvoted a paper 20 days ago

Scaling Open-Ended Reasoning to Predict the Future

Paper • 2512.25070 • Published 20 days ago • 15

upvoted a collection 20 days ago

IQuest-Coder

13 items • Updated 21 days ago • 87

upvoted a paper 23 days ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 93

upvoted a collection 27 days ago

Openhands Trajectories

Dataset of 67,074 OpenHands trajectories collected with Qwen3-Coder-480B-A35B-Instruct and two RFT checkpoints trained on the data • 3 items • Updated 29 days ago • 6

upvoted 4 papers about 1 month ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 83

VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse

Paper • 2512.14531 • Published Dec 16, 2025 • 13

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 42

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 106

upvoted 3 collections about 1 month ago

Molmo2

Artifacts for the Molmo2 release • 6 items • Updated 29 days ago • 30

Bolmo

Artifacts for the Bolmo release: https://allenai.org/papers/bolmo. • 4 items • Updated 29 days ago • 12

Olmo 3.1

The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 29 days ago • 44

upvoted a paper about 1 month ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 76

upvoted a collection about 2 months ago

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 148

upvoted 2 papers about 2 months ago

Rectifying LLM Thought from Lens of Optimization

Paper • 2512.01925 • Published Dec 1, 2025 • 24

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 178

upvoted a collection about 2 months ago

SenseNova-SI

Scaling Spatial Intelligence with Multimodal Foundation Models • 10 items • Updated 10 days ago • 15

upvoted a collection 2 months ago

DR Tulu

Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated Nov 25, 2025 • 31