pith. sign in

Title resolution pending

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

other 1

citation-polarity summary

fields

cs.CL 3 cs.CY 1

roles

other 1

polarities

unclear 1

representative citing papers

Implicit Bias-Like Patterns in Reasoning Models

cs.CY · 2025-03-14 · unverdicted · novelty 6.0

Reasoning models expend more tokens on association-incompatible tasks than compatible ones, indicating greater effort on counter-stereotypical information, except for Claude 3.7 Sonnet which shows the reverse pattern linked to its bias-focused reasoning.

Ethical and social risks of harm from Language Models

cs.CL · 2021-12-08 · accept · novelty 6.0

The authors provide a detailed taxonomy of 21 risks associated with language models, covering discrimination, information leaks, misinformation, malicious applications, interaction harms, and societal impacts like job loss and environmental costs.

citing papers explorer

Showing 4 of 4 citing papers.

  • Towards Measuring the Representation of Subjective Global Opinions in Language Models cs.CL · 2023-06-28 · conditional · none · ref 60

    LLMs default to responses more similar to opinions from the USA and some European and South American countries; prompting for a country shifts alignment but can introduce stereotypes, while translation does not reliably match language speakers.

  • Implicit Bias-Like Patterns in Reasoning Models cs.CY · 2025-03-14 · unverdicted · none · ref 15

    Reasoning models expend more tokens on association-incompatible tasks than compatible ones, indicating greater effort on counter-stereotypical information, except for Claude 3.7 Sonnet which shows the reverse pattern linked to its bias-focused reasoning.

  • Ethical and social risks of harm from Language Models cs.CL · 2021-12-08 · accept · none · ref 172

    The authors provide a detailed taxonomy of 21 risks associated with language models, covering discrimination, information leaks, misinformation, malicious applications, interaction harms, and societal impacts like job loss and environmental costs.

  • Benchmark Data Contamination of Large Language Models: A Survey cs.CL · 2024-06-06 · unverdicted · none · ref 99

    A survey reviewing benchmark data contamination in LLMs, its impact on evaluation, and alternative assessment approaches.