Exploring Representation-Aligned Latent Space for Better Generation Paper • 2502.00359 • Published Feb 1 • 1
Manalyzer: End-to-end Automated Meta-analysis with Multi-agent System Paper • 2505.20310 • Published May 22 • 1
EarthSE: A Benchmark for Evaluating Earth Scientific Exploration Capability of LLMs Paper • 2505.17139 • Published May 22 • 2
OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data Paper • 2505.23522 • Published May 29 • 2
EarthSE: A Benchmark for Evaluating Earth Scientific Exploration Capability of LLMs Paper • 2505.17139 • Published May 22 • 2
PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models Paper • 2506.17667 • Published Jun 21 • 1
Manalyzer: End-to-end Automated Meta-analysis with Multi-agent System Paper • 2505.20310 • Published May 22 • 1