MMAFFBen: A Multilingual and Multimodal Affective Analysis Benchmark for Evaluating LLMs and VLMs Paper • 2505.24423 • Published May 30 • 1
FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial Information Paper • 2505.20650 • Published May 27 • 17
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation Paper • 2510.09116 • Published 25 days ago • 95
FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark for Evaluating LLMs Paper • 2510.08886 • Published 25 days ago • 19
From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models Paper • 2508.13491 • Published Aug 19 • 58
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation Paper • 2506.18095 • Published Jun 22 • 66
RKEFino1: A Regulation Knowledge-Enhanced Large Language Model Paper • 2506.05700 • Published Jun 6 • 4
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation Paper • 2506.14028 • Published Jun 16 • 93
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications Paper • 2503.20990 • Published Mar 26 • 19
view article Article Plutus: Pioneering Greek Financial AI in a Global Context By TheFinAI and 9 others • Feb 27 • 6
Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance Paper • 2502.18772 • Published Feb 26 • 33
FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading Paper • 2502.11433 • Published Feb 17 • 36
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published Feb 12 • 58
Retrieval-augmented Large Language Models for Financial Time Series Forecasting Paper • 2502.05878 • Published Feb 9 • 42
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models Paper • 2410.14059 • Published Oct 17, 2024 • 61
Back to the Future: Towards Explainable Temporal Reasoning with Large Language Models Paper • 2310.01074 • Published Oct 2, 2023 • 2
No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks Paper • 2403.06249 • Published Mar 10, 2024 • 3