Title resolution pending

Wilson, E · 1927 · arXiv 1459.1927

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Concept Inconsistency in Dermoscopic Concept Bottleneck Models: A Rough-Set Analysis of the Derm7pt Dataset

cs.LG · 2026-04-21 · conditional · novelty 7.0

Rough-set analysis finds 16.4% of 305 concept profiles in Derm7pt inconsistent (306 images), capping hard CBM accuracy at 92.1%; symmetric filtering produces a 705-image consistent benchmark where EfficientNet-B5 reaches 0.90 label accuracy.

Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

cs.AI · 2024-06-14 · conditional · novelty 7.0

LLMs trained on simple specification gaming generalize to zero-shot reward tampering including rewriting their own reward function.

Efficiency for Experts, Visibility for Newcomers: A Case Study of Label-Code Alignment in Kubernetes

cs.SE · 2026-03-25 · unverdicted · novelty 6.0 · 2 refs

Case study of 18,020 Kubernetes PRs shows label-diff congruence is prevalent and stable, with higher congruence linked to fewer review participants among core developers and more among one-time contributors.

Laissez-Faire Harms: Algorithmic Biases in Generative Language Models

cs.CL · 2024-04-11 · unverdicted · novelty 6.0

Generative LMs in laissez-faire open-ended prompting settings disproportionately generate subordinated portrayals of minoritized race, gender, and sexual orientation identities at rates hundreds to thousands of times higher than empowering ones.

PYTHALAB-MERA: Validation-Grounded Memory, Retrieval, and Acceptance Control for Frozen-LLM Coding Agents

cs.CL · 2026-05-08 · unverdicted · novelty 5.0

An external controller for frozen LLMs raises strict validation success on three RL coding tasks from 0/9 to 8/9 by selecting memory records and skills, running fail-fast checks, and propagating credit via eligibility traces.

A framework and implementation for data-driven trigger efficiency estimation at LHCb

hep-ex · 2025-05-21 · unverdicted · novelty 4.0

Framework and software implementation for data-driven trigger efficiency estimation at LHCb using reconstructed candidate properties.

citing papers explorer

Showing 6 of 6 citing papers.

Concept Inconsistency in Dermoscopic Concept Bottleneck Models: A Rough-Set Analysis of the Derm7pt Dataset cs.LG · 2026-04-21 · conditional · none · ref 24
Rough-set analysis finds 16.4% of 305 concept profiles in Derm7pt inconsistent (306 images), capping hard CBM accuracy at 92.1%; symmetric filtering produces a 705-image consistent benchmark where EfficientNet-B5 reaches 0.90 label accuracy.
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models cs.AI · 2024-06-14 · conditional · none · ref 59
LLMs trained on simple specification gaming generalize to zero-shot reward tampering including rewriting their own reward function.
Efficiency for Experts, Visibility for Newcomers: A Case Study of Label-Code Alignment in Kubernetes cs.SE · 2026-03-25 · unverdicted · none · ref 53 · 2 links
Case study of 18,020 Kubernetes PRs shows label-diff congruence is prevalent and stable, with higher congruence linked to fewer review participants among core developers and more among one-time contributors.
Laissez-Faire Harms: Algorithmic Biases in Generative Language Models cs.CL · 2024-04-11 · unverdicted · none · ref 93
Generative LMs in laissez-faire open-ended prompting settings disproportionately generate subordinated portrayals of minoritized race, gender, and sexual orientation identities at rates hundreds to thousands of times higher than empowering ones.
PYTHALAB-MERA: Validation-Grounded Memory, Retrieval, and Acceptance Control for Frozen-LLM Coding Agents cs.CL · 2026-05-08 · unverdicted · none · ref 28
An external controller for frozen LLMs raises strict validation success on three RL coding tasks from 0/9 to 8/9 by selecting memory records and skills, running fail-fast checks, and propagating credit via eligibility traces.
A framework and implementation for data-driven trigger efficiency estimation at LHCb hep-ex · 2025-05-21 · unverdicted · none · ref 38
Framework and software implementation for data-driven trigger efficiency estimation at LHCb using reconstructed candidate properties.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer