arxiv:2509.24468

Bias Mitigation or Cultural Commonsense? Evaluating LLMs with a Japanese Dataset

Published on Sep 29

Authors:

Abstract

A benchmark evaluates the impact of debiasing methods on cultural commonsense in LLMs, revealing significant performance degradation.

AI-generated summary

Large language models (LLMs) exhibit social biases, prompting the development of various debiasing methods. However, debiasing methods may degrade the capabilities of LLMs. Previous research has evaluated the impact of bias mitigation primarily through tasks measuring general language understanding, which are often unrelated to social biases. In contrast, cultural commonsense is closely related to social biases, as both are rooted in social norms and values. The impact of bias mitigation on cultural commonsense in LLMs has not been well investigated. Considering this gap, we propose SOBACO (SOcial BiAs and Cultural cOmmonsense benchmark), a Japanese benchmark designed to evaluate social biases and cultural commonsense in LLMs in a unified format. We evaluate several LLMs on SOBACO to examine how debiasing methods affect cultural commonsense in LLMs. Our results reveal that the debiasing methods degrade the performance of the LLMs on the cultural commonsense task (up to 75% accuracy deterioration). These results highlight the importance of developing debiasing methods that consider the trade-off with cultural commonsense to improve fairness and utility of LLMs.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2509.24468 in a model README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2509.24468 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.