FilBBQ provides a culturally adapted Filipino bias benchmark for QA models plus a multi-seed evaluation protocol that detects sexist and homophobic biases while showing score variability across runs.
As such, benchmark users shouldbewarynottointerpretlowbiasscoresfrom the benchmark as an indicator that a model is com- pletely free from bias
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Robust Bias Evaluation with FilBBQ: A Filipino Bias Benchmark for Question-Answering Language Models
FilBBQ provides a culturally adapted Filipino bias benchmark for QA models plus a multi-seed evaluation protocol that detects sexist and homophobic biases while showing score variability across runs.