Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious Bias

Jayanta Sadhu; Maneesha Rani Saha; Rifat Shahriyar

arxiv: 2407.03536 · v3 · pith:344EPSOTnew · submitted 2024-07-03 · 💻 cs.CL

Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious Bias

Jayanta Sadhu , Maneesha Rani Saha , Rifat Shahriyar This is my paper

classification 💻 cs.CL

keywords biasbanglabiaseslanguagedifferentllmssocialwork

0 comments

read the original abstract

The rapid growth of Large Language Models (LLMs) has put forward the study of biases as a crucial field. It is important to assess the influence of different types of biases embedded in LLMs to ensure fair use in sensitive fields. Although there have been extensive works on bias assessment in English, such efforts are rare and scarce for a major language like Bangla. In this work, we examine two types of social biases in LLM generated outputs for Bangla language. Our main contributions in this work are: (1) bias studies on two different social biases for Bangla, (2) a curated dataset for bias measurement benchmarking and (3) testing two different probing techniques for bias detection in the context of Bangla. This is the first work of such kind involving bias assessment of LLMs for Bangla to the best of our knowledge. All our code and resources are publicly available for the progress of bias related research in Bangla NLP.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

How do datasets, developers, and models affect biases in a low-resourced language?: The Case of the Bengali Language
cs.CL 2025-06 conditional novelty 5.0

Bengali sentiment analysis models exhibit persistent identity-based biases across datasets and developer backgrounds despite similar semantic content.
Mitigating Extrinsic Gender Bias for Bangla Classification Tasks
cs.CL 2024-11 unverdicted novelty 5.0

Constructs gender-perturbed Bangla classification benchmarks and proposes RandSymKL debiasing that reduces extrinsic gender bias in pretrained models.