Mitigating Extrinsic Gender Bias for Bangla Classification Tasks

Arman Hassan Mahy; Azizah Mamun Abha; G M Shahariar; MD Piyal Ahmmed; Meherin Sultana; Sajib Kumar Saha Joy; Yue Dong

arxiv: 2411.10636 · v3 · pith:T6HWMJ63new · submitted 2024-11-16 · 💻 cs.CL · cs.AI· cs.LG

Mitigating Extrinsic Gender Bias for Bangla Classification Tasks

Sajib Kumar Saha Joy , Arman Hassan Mahy , Meherin Sultana , Azizah Mamun Abha , MD Piyal Ahmmed , Yue Dong , G M Shahariar This is my paper

Pith reviewed 2026-05-23 17:34 UTC · model grok-4.3

classification 💻 cs.CL cs.AIcs.LG

keywords gender biasBanglapretrained language modelsdebiasingsentiment analysistoxicity detectionhate speech detectionsarcasm detection

0 comments

The pith

RandSymKL reduces prediction shifts from gender swaps in Bangla classification tasks while preserving accuracy.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper constructs four manually annotated Bangla datasets for sentiment analysis, toxicity detection, hate speech detection, and sarcasm detection, each augmented with gender perturbations that swap names and terms to create minimal pairs. It introduces RandSymKL, a training procedure that combines randomized debiasing with symmetric KL divergence and cross-entropy loss to shrink the difference in model outputs across those pairs. Experiments on task-specific pretrained models show lower bias metrics than prior mitigation techniques while accuracy on the original tasks stays competitive. The datasets and code are released to support further work on low-resource language bias.

Core claim

We construct four benchmark datasets with nuanced gender perturbations that enable minimal-pair measurement of extrinsic gender bias and propose RandSymKL, a unified training approach that integrates symmetric KL divergence with cross-entropy loss to reduce those shifts across sentiment, toxicity, hate-speech, and sarcasm classifiers without sacrificing task performance.

What carries the argument

RandSymKL, the randomized debiasing strategy that adds symmetric KL divergence between predictions on gender-swapped pairs to the standard cross-entropy objective during fine-tuning.

If this is right

Bias is lowered on all four classification tasks relative to existing mitigation baselines.
Task accuracy remains competitive with the same baselines on the original test sets.
The approach applies directly to any task-specific pretrained Bangla model fine-tuned for classification.
Public release of the perturbed datasets allows direct replication and extension by other researchers.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same perturbation-and-KL approach could be tested on other low-resource languages whose scripts or morphology differ from Bangla.
If the bias reduction transfers to production systems, content-moderation tools in Bangla-speaking regions would flag fewer gender-neutral statements differently for male versus female referents.
The method might be combined with data-augmentation techniques that introduce more diverse name inventories to further strengthen generalization.

Load-bearing premise

That swapping gendered names and terms while keeping semantics fixed produces minimal pairs whose prediction shifts measure extrinsic gender bias and whose reduction will generalize beyond the four constructed datasets.

What would settle it

A held-out collection of new gender-perturbed Bangla examples on which the method produces no smaller average prediction shift than the baselines.

Figures

Figures reproduced from arXiv: 2411.10636 by Arman Hassan Mahy, Azizah Mamun Abha, G M Shahariar, MD Piyal Ahmmed, Meherin Sultana, Sajib Kumar Saha Joy, Yue Dong.

read the original abstract

In this study, we investigate extrinsic gender bias in Bangla pretrained language models, a largely underexplored area in low-resource languages. To assess this bias, we construct four manually annotated, task-specific benchmark datasets for sentiment analysis, toxicity detection, hate speech detection, and sarcasm detection. Each dataset is augmented using nuanced gender perturbations, where we systematically swap gendered names and terms while preserving semantic content, enabling minimal-pair evaluation of gender-driven prediction shifts. We then propose RandSymKL, a randomized debiasing strategy integrated with symmetric KL divergence and cross-entropy loss to mitigate the bias across task-specific pretrained models. RandSymKL is a refined training approach to integrate these elements in a unified way for extrinsic gender bias mitigation focused on classification tasks. Our approach was evaluated against existing bias mitigation methods, with results showing that our technique not only effectively reduces bias but also maintains competitive accuracy compared to other baseline approaches. To promote further research, we have made both our implementation and datasets publicly available: https://github.com/sajib-kumar/Mitigating-Bangla-Extrinsic-Gender-Bias

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

New Bangla datasets with gender perturbations are the useful part; the bias reduction claim has no numbers or checks attached to it.

read the letter

The paper's main output is four new task-specific Bangla datasets for sentiment, toxicity, hate speech, and sarcasm detection, each with manual gender perturbations that create minimal pairs by swapping names and terms. It also describes RandSymKL, a training recipe that adds randomized symmetric KL divergence to the usual cross-entropy loss. Both the data and code are released, which is the part that could actually help other people working on Bangla models. That is concrete and worth having for a low-resource language where bias work has been thin. The evaluation claim in the abstract is that the method cuts bias while keeping competitive accuracy against baselines, but the text supplies no accuracy numbers, no bias scores, no statistical tests, and no description of how the prediction shift was measured. Without those, the central result cannot be assessed. The stress-test point about whether the swaps preserve semantics in Bangla is also live: honorifics, kinship terms, and context-dependent marking make clean minimal pairs harder than in English, and the abstract offers no human ratings or overlap stats to confirm equivalence. If the full paper has those checks and the numbers, the work is worth referee time for the data alone. If the evaluation section stays as thin as the abstract, it is not ready. This is for researchers who need Bangla resources or who study bias mitigation in non-English settings; the data release gives it a narrow but real audience.

Referee Report

2 major / 1 minor

Summary. The paper constructs four manually annotated benchmark datasets for Bangla classification tasks (sentiment analysis, toxicity detection, hate speech detection, sarcasm detection) by augmenting them with gender perturbations that swap names and terms to create minimal pairs for measuring extrinsic bias in pretrained models. It proposes RandSymKL, a training procedure combining a randomized debiasing strategy with symmetric KL divergence and cross-entropy loss, and claims that this method reduces bias while preserving competitive accuracy relative to existing mitigation baselines. Datasets and code are released publicly.

Significance. If the results hold under rigorous validation, the work would address an underexplored area of bias mitigation for low-resource languages. The public release of datasets and implementation is a clear strength supporting reproducibility.

major comments (2)

[Abstract and benchmark dataset construction] The central claim that RandSymKL reduces bias while preserving accuracy rests on the four constructed datasets supplying valid minimal pairs. The manuscript states that the perturbations are 'nuanced' and 'preserve semantic content' (Abstract), but provides no quantitative checks such as human equivalence ratings, lexical overlap statistics, or control perturbations. In Bangla, honorifics, kinship terms, and context-dependent gender marking make clean swaps harder than in English; without such validation, any observed drop in prediction shift may be partly artifactual rather than evidence of extrinsic-bias mitigation.
[Evaluation and results] The evaluation section asserts that the approach 'effectively reduces bias' and 'maintains competitive accuracy' compared to baselines, yet the abstract supplies no specific metrics, tables of results, statistical tests, or details on how bias was quantified (e.g., prediction-shift magnitude before/after mitigation). This absence makes the headline claim unverifiable from the provided text and load-bearing for the contribution.

minor comments (1)

[Abstract] The acronym RandSymKL is introduced in the abstract without expansion or a formal definition/pseudocode of the randomized component.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive comments. We address each major comment below and outline the revisions we will make to improve the manuscript.

read point-by-point responses

Referee: [Abstract and benchmark dataset construction] The central claim that RandSymKL reduces bias while preserving accuracy rests on the four constructed datasets supplying valid minimal pairs. The manuscript states that the perturbations are 'nuanced' and 'preserve semantic content' (Abstract), but provides no quantitative checks such as human equivalence ratings, lexical overlap statistics, or control perturbations. In Bangla, honorifics, kinship terms, and context-dependent gender marking make clean swaps harder than in English; without such validation, any observed drop in prediction shift may be partly artifactual rather than evidence of extrinsic-bias mitigation.

Authors: We agree that quantitative validation of the gender perturbations is important for establishing the validity of the minimal pairs, particularly given the linguistic complexities of Bangla. The current manuscript relies on manual annotation and design choices to preserve semantics but does not report explicit quantitative checks such as human equivalence ratings or lexical overlap statistics. In the revised manuscript, we will add these analyses, including human ratings on a representative sample of perturbations and lexical overlap measures, along with discussion of how honorifics and kinship terms were handled. revision: yes
Referee: [Evaluation and results] The evaluation section asserts that the approach 'effectively reduces bias' and 'maintains competitive accuracy' compared to baselines, yet the abstract supplies no specific metrics, tables of results, statistical tests, or details on how bias was quantified (e.g., prediction-shift magnitude before/after mitigation). This absence makes the headline claim unverifiable from the provided text and load-bearing for the contribution.

Authors: The full manuscript includes detailed evaluation results with tables, bias quantification via prediction-shift magnitudes on minimal pairs, accuracy comparisons, and statistical details. However, the abstract is intentionally high-level and does not include these specifics. We will revise the abstract to incorporate key quantitative findings on bias reduction and accuracy to make the central claims verifiable directly from the abstract. revision: yes

Circularity Check

0 steps flagged

No circularity: purely empirical method and evaluation

full rationale

The paper presents a new randomized debiasing training procedure (RandSymKL) and evaluates it on four constructed Bangla datasets using minimal-pair prediction-shift metrics. No equations, derivations, or uniqueness theorems are claimed; the central result is an empirical comparison of accuracy and bias reduction against baselines. The method is defined directly by its loss combination and randomization, with no reduction of the reported improvement back to a fitted parameter or self-citation chain. The reader's assessment of score 1.0 is consistent with this self-contained empirical structure.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 1 invented entities

The central claim rests on the assumption that gender-swapped minimal pairs isolate extrinsic bias and that standard cross-entropy plus symmetric KL can be combined without introducing new uncontrolled hyperparameters that dominate the outcome.

axioms (2)

domain assumption Gender name and term swaps preserve semantic content and isolate extrinsic bias in classification predictions.
Invoked when constructing the four benchmark datasets from existing sentences.
domain assumption Pretrained Bangla language models exhibit measurable extrinsic gender bias on the constructed minimal-pair examples.
Stated as the motivation for the benchmark construction and debiasing experiments.

invented entities (1)

RandSymKL no independent evidence
purpose: Unified training objective that integrates randomized perturbations with symmetric KL divergence and cross-entropy for bias mitigation.
Newly proposed method whose effectiveness is asserted via comparison to baselines.

pith-pipeline@v0.9.0 · 5752 in / 1402 out tokens · 30991 ms · 2026-05-23T17:34:56.138342+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

40 extracted references · 40 canonical work pages · 5 internal anchors

[1]

Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, et al. 2023. Gpt-4 technical report. arXiv preprint arXiv:2303.08774

work page internal anchor Pith review Pith/arXiv arXiv 2023
[2]

Tasnim Sakib Apon, Ramisa Anan, Elizabeth Antora Modhu, Arjun Suter, Ifrit Jamal Sneha, and MD Golam Rabiul Alam. 2022. Banglasarc: A dataset for sarcasm detection. In 2022 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE), pages 1--5. IEEE

work page 2022
[3]

Tanveer Ahmed Belal, GM Shahariar, and Md Hasanul Kabir. 2023. Interpretable multi labeled bengali toxic comments classification using deep learning. In 2023 International Conference on Electrical, Computer and Communication Engineering (ECCE), pages 1--6. IEEE

work page 2023
[4]

Sohel Rahman, and Rifat Shahriyar

Abhik Bhattacharjee, Tahmid Hasan, Wasi Ahmad, Kazi Samin Mubasshir, Md Saiful Islam, Anindya Iqbal, M. Sohel Rahman, and Rifat Shahriyar. 2022. https://aclanthology.org/2022.findings-naacl.98 B angla BERT : Language model pretraining and benchmarks for low-resource language understanding evaluation in B angla . In Findings of the Association for Computat...

work page 2022
[5]

Tolga Bolukbasi, Kai-Wei Chang, James Y Zou, Venkatesh Saligrama, and Adam T Kalai. 2016. https://proceedings.neurips.cc/paper_files/paper/2016/file/a486cd07e4ac3d270571622f4f316ec5-Paper.pdf Man is to computer programmer as woman is to homemaker? debiasing word embeddings . In Advances in Neural Information Processing Systems, volume 29. Curran Associates, Inc

work page 2016
[6]

K Clark. 2020. Electra: Pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555

work page internal anchor Pith review Pith/arXiv arXiv 2020
[7]

Dipto Das, Shion Guha, and Bryan Semaan. 2023. Toward cultural bias evaluation datasets: The case of bengali gender, religious, and national identity. In Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP), pages 68--83

work page 2023
[8]

Mithun Das, Somnath Banerjee, and Animesh Mukherjee. 2022. Data bootstrapping approaches to improve low resource abusive language detection for indic languages. arXiv preprint arXiv:2204.12543

work page arXiv 2022
[9]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. https://doi.org/10.18653/v1/N19-1423 BERT : Pre-training of deep bidirectional transformers for language understanding . In Proceedings of the 2019 Conference of the North A merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long a...

work page doi:10.18653/v1/n19-1423 2019
[10]

FredZhangUBC. 2023. Bert small multilingual toxicity classification. https://github.com/fredzhang7/tfjs-node-tiny/releases/tag/text-classification. Accessed: October 16, 2024

work page 2023
[11]

Arid Hasan, Firoj Alam, Anika Anjum, Shudipta Das, and Afiyat Anjum

Md. Arid Hasan, Firoj Alam, Anika Anjum, Shudipta Das, and Afiyat Anjum. 2023. Blp 2023 task 2: Sentiment analysis. In Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), Singapore. Association for Computational Linguistics

work page 2023
[12]

Md Arid Hasan, Prerona Tarannum, Krishno Dey, Imran Razzak, and Usman Naseem. 2024. Do large language models speak all languages equally? a comparative study in low-resource settings. arXiv preprint arXiv:2408.02237

work page arXiv 2024
[13]

Max Hort, Zhenpeng Chen, Jie M Zhang, Mark Harman, and Federica Sarro. 2023. Bias mitigation for machine learning classifiers: A comprehensive survey. ACM Journal on Responsible Computing

work page 2023
[14]

Sophie Jentzsch and Cigdem Turan. 2023. Gender bias in bert--measuring and analysing biases through sentiment rating in a realistic downstream classification task. arXiv preprint arXiv:2306.15298

work page arXiv 2023
[15]

Mohsinul Kabir, Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Mir Tafseer Nayeem, M Saiful Bari, and Enamul Hoque. 2024. https://aclanthology.org/2024.lrec-main.201 B en LLM -eval: A comprehensive evaluation into the potentials and pitfalls of large language models on B engali NLP . In Proceedings of the 2024 Joint International Conference on Computatio...

work page 2024
[16]

Simran Khanuja, Diksha Bansal, Sarvesh Mehtani, Savya Khosla, Atreyee Dey, Balaji Gopalan, Dilip Kumar Margam, Pooja Aggarwal, Rajiv Teja Nagipogu, Shachi Dave, et al. 2021. Muril: Multilingual representations for indian languages. arXiv preprint arXiv:2103.10730

work page arXiv 2021
[17]

Svetlana Kiritchenko and Saif Mohammad. 2018. https://doi.org/10.18653/v1/S18-2005 Examining gender and race bias in two hundred sentiment analysis systems . In Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics, pages 43--53, New Orleans, Louisiana. Association for Computational Linguistics

work page doi:10.18653/v1/s18-2005 2018
[18]

Hadas Kotek, Rikker Dockum, and David Sun. 2023. Gender bias and stereotypes in large language models. In Proceedings of the ACM collective intelligence conference, pages 12--24

work page 2023
[19]

Md Kowsher, Abdullah As Sami, Nusrat Jahan Prottasha, Mohammad Shamsul Arefin, Pranab Kumar Dhar, and Takeshi Koshiba. 2022. Bangla-bert: transformer-based efficient model for transfer learning and language understanding. IEEE Access, 10:91855--91870

work page 2022
[20]

Sanzana Karim Lora, Ishrat Jahan, Rahad Hussain, Rifat Shahriyar, and ABM Alim Al Islam. 2023. A transformer-based generative adversarial learning to detect sarcasm from bengali text with correct classification of confusing text. Heliyon, 9(12)

work page 2023
[21]

Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of machine learning research, 21(140):1--67

work page 2020
[22]

Md Jamiur Rahman Rifat, Sheikh Abujar, Sheak Rashed Haider Noori, and Syed Akhter Hossain. 2019. https://doi.org/10.1109/ICCCNT45670.2019.8944804 Bengali named entity recognition: A survey with deep learning benchmark . In 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), pages 1--5

work page doi:10.1109/icccnt45670.2019.8944804 2019
[23]

Nauros Romim, Mosahed Ahmed, Hriteshwar Talukder, and Md Saiful Islam. 2021. Hate speech detection in the bengali language: A dataset and its baseline evaluation. In Proceedings of International Joint Conference on Advances in Computational Intelligence: IJCACI 2020, pages 457--468. Springer

work page 2021
[24]

Rachel Rudinger, Jason Naradowsky, Brian Leonard, and Benjamin Van Durme. 2018. https://doi.org/10.18653/v1/N18-2002 Gender bias in coreference resolution . In Proceedings of the 2018 Conference of the North A merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers) , pages 8--14, New Orleans, ...

work page doi:10.18653/v1/n18-2002 2018
[25]

Jayanta Sadhu, Ayan Antik Khan, Abhik Bhattacharjee, and Rifat Shahriyar. 2024 a . An empirical study on the characteristics of bias upon context length variation for bangla. arXiv preprint arXiv:2406.17375

work page arXiv 2024
[26]

Jayanta Sadhu, Maneesha Rani Saha, and Rifat Shahriyar. 2024 b . An empirical study of gendered stereotypes in emotional attributes for bangla in multilingual large language models. arXiv preprint arXiv:2407.06432

work page arXiv 2024
[27]

Jayanta Sadhu, Maneesha Rani Saha, and Rifat Shahriyar. 2024 c . Social bias in large language models for bangla: An empirical study on gender and religious bias. arXiv preprint arXiv:2407.03536

work page arXiv 2024
[28]

Salim Sazzed. 2020. Cross-lingual sentiment classification in low-resource bengali language. In Proceedings of the sixth workshop on noisy user-generated text (W-NUT 2020), pages 50--60

work page 2020
[29]

Hsuan Su, Cheng-Chu Cheng, Hua Farn, Shachi H Kumar, Saurav Sahay, Shang-Tse Chen, and Hung-yi Lee. 2023. Learning from red teaming: Gender bias provocation and mitigation in large language models. arXiv preprint arXiv:2310.11079

work page arXiv 2023
[30]

Tony Sun, Andrew Gaut, Shirlyn Tang, Yuxin Huang, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, and William Yang Wang. 2019 a . https://doi.org/10.18653/v1/P19-1159 Mitigating gender bias in natural language processing: Literature review . In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics...

work page doi:10.18653/v1/p19-1159 2019
[31]

Tony Sun, Andrew Gaut, Shirlyn Tang, Yuxin Huang, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, and William Yang Wang. 2019 b . Mitigating gender bias in natural language processing: Literature review. arXiv preprint arXiv:1906.08976

work page internal anchor Pith review Pith/arXiv arXiv 2019
[32]

Gemini Team, Rohan Anil, Sebastian Borgeaud, Yonghui Wu, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M Dai, Anja Hauth, et al. 2023. Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:2312.11805

work page internal anchor Pith review Pith/arXiv arXiv 2023
[33]

Pranav Tiwari, Aman Chandra Kumar, Aravindan Chandrabose, et al. 2022. Casteism in india, but not racism-a study of bias in word embeddings of indian languages. In Proceedings of the First Workshop on Language Technology and Resources for a Fair, Inclusive, and Safe Society within the 13th Language Resources and Evaluation Conference, pages 1--7

work page 2022
[34]

Ewoenam Tokpo, Pieter Delobelle, Bettina Berendt, and Toon Calders. 2023. How far can it go?: On intrinsic gender bias mitigation for text classification. arXiv preprint arXiv:2301.12855

work page arXiv 2023
[35]

Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timoth \'e e Lacroix, Baptiste Rozi \`e re, Naman Goyal, Eric Hambro, Faisal Azhar, et al. 2023. Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971

work page internal anchor Pith review Pith/arXiv arXiv 2023
[36]

Kellie Webster, Xuezhi Wang, Ian Tenney, Alex Beutel, Emily Pitler, Ellie Pavlick, Jilin Chen, and Slav Petrov. 2020. https://arxiv.org/abs/2010.06032 Measuring and reducing gendered correlations in pre-trained models . CoRR, abs/2010.06032

work page arXiv 2020
[37]

Zhongbin Xie, Vid Kocijan, Thomas Lukasiewicz, and Oana-Maria Camburu. 2023. Counter-gap: Counterfactual bias evaluation through gendered ambiguous pronouns. arXiv preprint arXiv:2302.05674

work page arXiv 2023
[38]

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei Chang. 2018. https://doi.org/10.18653/v1/N18-2003 Gender bias in coreference resolution: Evaluation and debiasing methods . In Proceedings of the 2018 Conference of the North A merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short P...

work page doi:10.18653/v1/n18-2003 2018
[39]

online" 'onlinestring :=

ENTRY address archivePrefix author booktitle chapter edition editor eid eprint eprinttype howpublished institution journal key month note number organization pages publisher school series title type volume year doi pubmed url lastchecked label extra.label sort.label short.list INTEGERS output.state before.all mid.sentence after.sentence after.block STRING...

work page
[40]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize " " * FUNCT...

work page

[1] [1]

Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, et al. 2023. Gpt-4 technical report. arXiv preprint arXiv:2303.08774

work page internal anchor Pith review Pith/arXiv arXiv 2023

[2] [2]

Tasnim Sakib Apon, Ramisa Anan, Elizabeth Antora Modhu, Arjun Suter, Ifrit Jamal Sneha, and MD Golam Rabiul Alam. 2022. Banglasarc: A dataset for sarcasm detection. In 2022 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE), pages 1--5. IEEE

work page 2022

[3] [3]

Tanveer Ahmed Belal, GM Shahariar, and Md Hasanul Kabir. 2023. Interpretable multi labeled bengali toxic comments classification using deep learning. In 2023 International Conference on Electrical, Computer and Communication Engineering (ECCE), pages 1--6. IEEE

work page 2023

[4] [4]

Sohel Rahman, and Rifat Shahriyar

Abhik Bhattacharjee, Tahmid Hasan, Wasi Ahmad, Kazi Samin Mubasshir, Md Saiful Islam, Anindya Iqbal, M. Sohel Rahman, and Rifat Shahriyar. 2022. https://aclanthology.org/2022.findings-naacl.98 B angla BERT : Language model pretraining and benchmarks for low-resource language understanding evaluation in B angla . In Findings of the Association for Computat...

work page 2022

[5] [5]

Tolga Bolukbasi, Kai-Wei Chang, James Y Zou, Venkatesh Saligrama, and Adam T Kalai. 2016. https://proceedings.neurips.cc/paper_files/paper/2016/file/a486cd07e4ac3d270571622f4f316ec5-Paper.pdf Man is to computer programmer as woman is to homemaker? debiasing word embeddings . In Advances in Neural Information Processing Systems, volume 29. Curran Associates, Inc

work page 2016

[6] [6]

K Clark. 2020. Electra: Pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555

work page internal anchor Pith review Pith/arXiv arXiv 2020

[7] [7]

Dipto Das, Shion Guha, and Bryan Semaan. 2023. Toward cultural bias evaluation datasets: The case of bengali gender, religious, and national identity. In Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP), pages 68--83

work page 2023

[8] [8]

Mithun Das, Somnath Banerjee, and Animesh Mukherjee. 2022. Data bootstrapping approaches to improve low resource abusive language detection for indic languages. arXiv preprint arXiv:2204.12543

work page arXiv 2022

[9] [9]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. https://doi.org/10.18653/v1/N19-1423 BERT : Pre-training of deep bidirectional transformers for language understanding . In Proceedings of the 2019 Conference of the North A merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long a...

work page doi:10.18653/v1/n19-1423 2019

[10] [10]

FredZhangUBC. 2023. Bert small multilingual toxicity classification. https://github.com/fredzhang7/tfjs-node-tiny/releases/tag/text-classification. Accessed: October 16, 2024

work page 2023

[11] [11]

Arid Hasan, Firoj Alam, Anika Anjum, Shudipta Das, and Afiyat Anjum

Md. Arid Hasan, Firoj Alam, Anika Anjum, Shudipta Das, and Afiyat Anjum. 2023. Blp 2023 task 2: Sentiment analysis. In Proceedings of the 1st International Workshop on Bangla Language Processing (BLP-2023), Singapore. Association for Computational Linguistics

work page 2023

[12] [12]

Md Arid Hasan, Prerona Tarannum, Krishno Dey, Imran Razzak, and Usman Naseem. 2024. Do large language models speak all languages equally? a comparative study in low-resource settings. arXiv preprint arXiv:2408.02237

work page arXiv 2024

[13] [13]

Max Hort, Zhenpeng Chen, Jie M Zhang, Mark Harman, and Federica Sarro. 2023. Bias mitigation for machine learning classifiers: A comprehensive survey. ACM Journal on Responsible Computing

work page 2023

[14] [14]

Sophie Jentzsch and Cigdem Turan. 2023. Gender bias in bert--measuring and analysing biases through sentiment rating in a realistic downstream classification task. arXiv preprint arXiv:2306.15298

work page arXiv 2023

[15] [15]

Mohsinul Kabir, Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Mir Tafseer Nayeem, M Saiful Bari, and Enamul Hoque. 2024. https://aclanthology.org/2024.lrec-main.201 B en LLM -eval: A comprehensive evaluation into the potentials and pitfalls of large language models on B engali NLP . In Proceedings of the 2024 Joint International Conference on Computatio...

work page 2024

[16] [16]

Simran Khanuja, Diksha Bansal, Sarvesh Mehtani, Savya Khosla, Atreyee Dey, Balaji Gopalan, Dilip Kumar Margam, Pooja Aggarwal, Rajiv Teja Nagipogu, Shachi Dave, et al. 2021. Muril: Multilingual representations for indian languages. arXiv preprint arXiv:2103.10730

work page arXiv 2021

[17] [17]

Svetlana Kiritchenko and Saif Mohammad. 2018. https://doi.org/10.18653/v1/S18-2005 Examining gender and race bias in two hundred sentiment analysis systems . In Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics, pages 43--53, New Orleans, Louisiana. Association for Computational Linguistics

work page doi:10.18653/v1/s18-2005 2018

[18] [18]

Hadas Kotek, Rikker Dockum, and David Sun. 2023. Gender bias and stereotypes in large language models. In Proceedings of the ACM collective intelligence conference, pages 12--24

work page 2023

[19] [19]

Md Kowsher, Abdullah As Sami, Nusrat Jahan Prottasha, Mohammad Shamsul Arefin, Pranab Kumar Dhar, and Takeshi Koshiba. 2022. Bangla-bert: transformer-based efficient model for transfer learning and language understanding. IEEE Access, 10:91855--91870

work page 2022

[20] [20]

Sanzana Karim Lora, Ishrat Jahan, Rahad Hussain, Rifat Shahriyar, and ABM Alim Al Islam. 2023. A transformer-based generative adversarial learning to detect sarcasm from bengali text with correct classification of confusing text. Heliyon, 9(12)

work page 2023

[21] [21]

Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J Liu. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of machine learning research, 21(140):1--67

work page 2020

[22] [22]

Md Jamiur Rahman Rifat, Sheikh Abujar, Sheak Rashed Haider Noori, and Syed Akhter Hossain. 2019. https://doi.org/10.1109/ICCCNT45670.2019.8944804 Bengali named entity recognition: A survey with deep learning benchmark . In 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), pages 1--5

work page doi:10.1109/icccnt45670.2019.8944804 2019

[23] [23]

Nauros Romim, Mosahed Ahmed, Hriteshwar Talukder, and Md Saiful Islam. 2021. Hate speech detection in the bengali language: A dataset and its baseline evaluation. In Proceedings of International Joint Conference on Advances in Computational Intelligence: IJCACI 2020, pages 457--468. Springer

work page 2021

[24] [24]

Rachel Rudinger, Jason Naradowsky, Brian Leonard, and Benjamin Van Durme. 2018. https://doi.org/10.18653/v1/N18-2002 Gender bias in coreference resolution . In Proceedings of the 2018 Conference of the North A merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers) , pages 8--14, New Orleans, ...

work page doi:10.18653/v1/n18-2002 2018

[25] [25]

Jayanta Sadhu, Ayan Antik Khan, Abhik Bhattacharjee, and Rifat Shahriyar. 2024 a . An empirical study on the characteristics of bias upon context length variation for bangla. arXiv preprint arXiv:2406.17375

work page arXiv 2024

[26] [26]

Jayanta Sadhu, Maneesha Rani Saha, and Rifat Shahriyar. 2024 b . An empirical study of gendered stereotypes in emotional attributes for bangla in multilingual large language models. arXiv preprint arXiv:2407.06432

work page arXiv 2024

[27] [27]

Jayanta Sadhu, Maneesha Rani Saha, and Rifat Shahriyar. 2024 c . Social bias in large language models for bangla: An empirical study on gender and religious bias. arXiv preprint arXiv:2407.03536

work page arXiv 2024

[28] [28]

Salim Sazzed. 2020. Cross-lingual sentiment classification in low-resource bengali language. In Proceedings of the sixth workshop on noisy user-generated text (W-NUT 2020), pages 50--60

work page 2020

[29] [29]

Hsuan Su, Cheng-Chu Cheng, Hua Farn, Shachi H Kumar, Saurav Sahay, Shang-Tse Chen, and Hung-yi Lee. 2023. Learning from red teaming: Gender bias provocation and mitigation in large language models. arXiv preprint arXiv:2310.11079

work page arXiv 2023

[30] [30]

Tony Sun, Andrew Gaut, Shirlyn Tang, Yuxin Huang, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, and William Yang Wang. 2019 a . https://doi.org/10.18653/v1/P19-1159 Mitigating gender bias in natural language processing: Literature review . In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics...

work page doi:10.18653/v1/p19-1159 2019

[31] [31]

Tony Sun, Andrew Gaut, Shirlyn Tang, Yuxin Huang, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, and William Yang Wang. 2019 b . Mitigating gender bias in natural language processing: Literature review. arXiv preprint arXiv:1906.08976

work page internal anchor Pith review Pith/arXiv arXiv 2019

[32] [32]

Gemini Team, Rohan Anil, Sebastian Borgeaud, Yonghui Wu, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M Dai, Anja Hauth, et al. 2023. Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:2312.11805

work page internal anchor Pith review Pith/arXiv arXiv 2023

[33] [33]

Pranav Tiwari, Aman Chandra Kumar, Aravindan Chandrabose, et al. 2022. Casteism in india, but not racism-a study of bias in word embeddings of indian languages. In Proceedings of the First Workshop on Language Technology and Resources for a Fair, Inclusive, and Safe Society within the 13th Language Resources and Evaluation Conference, pages 1--7

work page 2022

[34] [34]

Ewoenam Tokpo, Pieter Delobelle, Bettina Berendt, and Toon Calders. 2023. How far can it go?: On intrinsic gender bias mitigation for text classification. arXiv preprint arXiv:2301.12855

work page arXiv 2023

[35] [35]

Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timoth \'e e Lacroix, Baptiste Rozi \`e re, Naman Goyal, Eric Hambro, Faisal Azhar, et al. 2023. Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971

work page internal anchor Pith review Pith/arXiv arXiv 2023

[36] [36]

Kellie Webster, Xuezhi Wang, Ian Tenney, Alex Beutel, Emily Pitler, Ellie Pavlick, Jilin Chen, and Slav Petrov. 2020. https://arxiv.org/abs/2010.06032 Measuring and reducing gendered correlations in pre-trained models . CoRR, abs/2010.06032

work page arXiv 2020

[37] [37]

Zhongbin Xie, Vid Kocijan, Thomas Lukasiewicz, and Oana-Maria Camburu. 2023. Counter-gap: Counterfactual bias evaluation through gendered ambiguous pronouns. arXiv preprint arXiv:2302.05674

work page arXiv 2023

[38] [38]

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei Chang. 2018. https://doi.org/10.18653/v1/N18-2003 Gender bias in coreference resolution: Evaluation and debiasing methods . In Proceedings of the 2018 Conference of the North A merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short P...

work page doi:10.18653/v1/n18-2003 2018

[39] [39]

online" 'onlinestring :=

ENTRY address archivePrefix author booktitle chapter edition editor eid eprint eprinttype howpublished institution journal key month note number organization pages publisher school series title type volume year doi pubmed url lastchecked label extra.label sort.label short.list INTEGERS output.state before.all mid.sentence after.sentence after.block STRING...

work page

[40] [40]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize " " * FUNCT...

work page