A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper โข 2508.21148 โข Published Aug 28 โข 140
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper โข 2501.01427 โข Published Jan 2 โข 54
Insight-V Collection Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models โข 5 items โข Updated Nov 22, 2024 โข 11