As such, benchmark users shouldbewarynottointerpretlowbiasscoresfrom the benchmark as an indicator that a model is com- pletely free from bias

Ethical Considerations, Limitations Despite our efforts to incorporate into FilBBQ as many of the biases present in Philippine culture as possible,itisstillhighlyunlikelythatwew

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Robust Bias Evaluation with FilBBQ: A Filipino Bias Benchmark for Question-Answering Language Models

cs.CL · 2026-02-16 · unverdicted · novelty 7.0

FilBBQ provides a culturally adapted Filipino bias benchmark for QA models plus a multi-seed evaluation protocol that detects sexist and homophobic biases while showing score variability across runs.

citing papers explorer

Showing 1 of 1 citing paper.

Robust Bias Evaluation with FilBBQ: A Filipino Bias Benchmark for Question-Answering Language Models cs.CL · 2026-02-16 · unverdicted · none · ref 10
FilBBQ provides a culturally adapted Filipino bias benchmark for QA models plus a multi-seed evaluation protocol that detects sexist and homophobic biases while showing score variability across runs.

As such, benchmark users shouldbewarynottointerpretlowbiasscoresfrom the benchmark as an indicator that a model is com- pletely free from bias

fields

years

verdicts

representative citing papers

citing papers explorer