Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes Paper • 2507.12261 • Published 15 days ago • 1
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data Paper • 2507.00152 • Published about 1 month ago • 1
Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework Paper • 2506.15538 • Published Jun 18 • 1
Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals Paper • 2505.13972 • Published May 20 • 1
Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability Paper • 2505.13963 • Published May 20 • 1
Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods Paper • 2505.01198 • Published May 2 • 2
Inseq: An Interpretability Toolkit for Sequence Generation Models Paper • 2302.13942 • Published Feb 27, 2023 • 1
LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools Paper • 2401.12576 • Published Jan 23, 2024 • 2
Free-text Rationale Generation under Readability Level Control Paper • 2407.01384 • Published Jul 1, 2024
Inseq: An Interpretability Toolkit for Sequence Generation Models Paper • 2302.13942 • Published Feb 27, 2023 • 1
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Paper • 2304.01373 • Published Apr 3, 2023 • 9
Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model Paper • 2310.12611 • Published Oct 19, 2023
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 32
Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses Paper • 2408.00584 • Published Aug 1, 2024 • 7
Thermostat: A Large Collection of NLP Model Explanations and Analysis Tools Paper • 2108.13961 • Published Aug 31, 2021
Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based Methods Paper • 2210.07222 • Published Oct 13, 2022
InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations Paper • 2310.05592 • Published Oct 9, 2023
Multi-property Steering of Large Language Models with Dynamic Activation Composition Paper • 2406.17563 • Published Jun 25, 2024 • 4