EnerVerse-AC: Envisioning Embodied Environments with Action Condition Paper • 2505.09723 • Published May 14 • 23
Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction Paper • 2504.15573 • Published Apr 22
Persona Knowledge-Aligned Prompt Tuning Method for Online Debate Paper • 2410.04239 • Published Oct 5, 2024
ChatGPT Evaluation on Sentence Level Relations: A Focus on Temporal, Causal, and Discourse Relations Paper • 2304.14827 • Published Apr 28, 2023
Global and Local Hierarchy-aware Contrastive Framework for Implicit Discourse Relation Recognition Paper • 2211.13873 • Published Nov 25, 2022
XRJL-HKUST at SemEval-2021 Task 4: WordNet-Enhanced Dual Multi-head Co-Attention for Reading Comprehension of Abstract Meaning Paper • 2103.16102 • Published Mar 30, 2021
FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models Paper • 2310.20410 • Published Oct 31, 2023 • 1
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models Paper • 2401.16745 • Published Jan 30, 2024
Learning to Edit: Aligning LLMs with Knowledge Editing Paper • 2402.11905 • Published Feb 19, 2024 • 1
Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning Paper • 2203.06875 • Published Mar 14, 2022
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization Paper • 2408.07471 • Published Aug 14, 2024
Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge Paper • 2502.12501 • Published Feb 18 • 6
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References Paper • 2410.05193 • Published Oct 7, 2024 • 13
Lion: Adversarial Distillation of Closed-Source Large Language Model Paper • 2305.12870 • Published May 22, 2023