super hub

1_Reasoning

Ben Bolker, Douglas Bates, Martin Mächler, Steve Walker · 2015 · Journal of Statistical Software · DOI 10.18637/jss.v067.i01

21 Pith papers cite this work, alongside 72,323 external citations. Polarity classification is still indexing.

21 Pith papers citing it

72.3k external citations · Crossref

open at publisher browse 21 citing papers more from Ben Bolker

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 1 method 1 other 1

citation-polarity summary

background 1 unclear 1 use method 1

authors

Ben Bolker Douglas Bates Martin Mächler Steve Walker

co-cited works

representative citing papers

Human face perception reflects inverse-generative and naturalistic discriminative objectives

q-bio.NC · 2026-05-12 · unverdicted · novelty 7.0

Human face perception aligns with neural networks trained on inverse-generative and naturalistic discriminative tasks, as these best predict human dissimilarity judgments on controversial and random face pairs.

Machine individuality: Separating genuine idiosyncrasy from response bias in large language models

cs.AI · 2026-04-18 · unverdicted · novelty 7.0

Crossed random-effects models on LLM word ratings show 16.9% variance from genuine stimulus-specific individuality, exceeding null models and forming coherent per-model fingerprints.

SCOOTER: A Human Evaluation Framework for Unrestricted Adversarial Examples

cs.CV · 2025-07-10 · conditional · novelty 7.0

SCOOTER supplies best-practice guidelines, open tools, and a 3K-image benchmark with 34K+ human ratings showing that six tested unrestricted attacks produce images humans can detect as fake.

Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations

cs.RO · 2026-05-21 · unverdicted · novelty 6.0

Robots detect underspecified reward features via demonstration variation and query targeted natural language explanations to improve reward recovery from imperfect demos.

Generative AI-Based Monte Carlo Simulation for Method Evaluation Using Synthetic Multilevel Data

stat.ME · 2026-05-07 · unverdicted · novelty 6.0

A framework using generative AI to produce synthetic multilevel data for Monte Carlo simulations that evaluate the performance and parameter recovery of quantitative methods.

Quantifying the human visual exposome with vision language models

cs.AI · 2026-05-05 · unverdicted · novelty 6.0

Vision language models applied to daily-life photos quantify visual environmental features that correlate with momentary affect and chronic stress, establishing a paradigm for visual exposomics.

Can AI Debias the News? LLM Interventions Improve Cross-Partisan Receptivity but LLMs Overestimate Their Own Effectiveness

cs.CL · 2026-05-01 · unverdicted · novelty 6.0

Substantive LLM reframing boosts cross-partisan receptivity to news headlines without backfire, but models overestimate effect sizes and lack fidelity in modeling human psychological responses.

A paradox of AI fluency

cs.CL · 2026-04-28 · unverdicted · novelty 6.0

Fluent AI users adopt an active, iterative collaboration mode that produces more visible failures but better recovery and success on hard tasks, whereas novices experience more invisible failures from passive use.

The Effect of Idea Elaboration on the Automatic Assessment of Idea Originality

cs.HC · 2026-04-22 · unverdicted · novelty 6.0

LLM originality raters exhibit self-preference bias toward artificial responses that disappears after controlling for idea elaboration in the Alternate Uses Task.

Dual Alignment Between Language Model Layers and Human Sentence Processing

cs.CL · 2026-04-20 · unverdicted · novelty 6.0

Later LLM layers align better with human cognitive effort in syntactic ambiguity than early layers do, indicating dual processing modes and complementary benefits from multi-layer probability updates.

Implicit Bias-Like Patterns in Reasoning Models

cs.CY · 2025-03-14 · unverdicted · novelty 6.0

Reasoning models expend more tokens on association-incompatible tasks than compatible ones, indicating greater effort on counter-stereotypical information, except for Claude 3.7 Sonnet which shows the reverse pattern linked to its bias-focused reasoning.

A systematic framework for generating novel experimental hypotheses from language models

cs.CL · 2024-08-09 · unverdicted · novelty 6.0

A framework using language models to simulate non-existent experiments and derive novel testable hypotheses on dative verb acquisition and cross-structural generalization in children.

Bringing Age Back In: Accounting for Population Age Distribution in Forecasting Migration

stat.AP · 2024-02-21 · unverdicted · novelty 6.0

Introduces MASI to standardize net migration rates for age structure and applies a Bayesian hierarchical model to forecast adjusted total and age-sex specific migration rates through 2100, yielding narrower intervals and moderated decline projections.

A Scalable Parametric Item Calibration Engine (SPICE) for Explanatory IRT with Sparse Data

stat.ME · 2026-05-20 · unverdicted · novelty 5.0

SPICE is a scalable Bayesian MCMC engine for explanatory IRT calibration on sparsely linked persons and items in large assessment banks.

ProfileGLMM: a R Package Extending Bayesian Profile Regression using Generalised Linear Mixed Models

stat.ME · 2026-04-22 · unverdicted · novelty 5.0

ProfileGLMM is an R package extending Bayesian profile regression with GLMMs to support hierarchical data, random effects, and cluster-covariate interactions for continuous or binary outcomes.

What's in an accent? The impact of accented synthetic speech on lexical choice in human-machine dialogue

cs.HC · 2019-07-25 · unverdicted · novelty 5.0

Accented synthetic speech leads users to align their lexical choices with the perceived accent of the machine partner, mirroring human-human dialogue patterns.

Tracing the ongoing emergence of human-like reasoning in Large Language Models

cs.CL · 2026-05-20 · unverdicted · novelty 4.0

LLMs function as accurate semantic processors for conditionals but do not replicate the pragmatic inferences that define human reasoning.

Performance of low vision individuals when selecting a target with head-pointing in virtual reality

q-bio.NC · 2026-05-19 · unverdicted · novelty 4.0

Low vision individuals with central visual field loss can use head-pointing to select 2° targets in VR, reaching near-control performance with sufficiently large pointer activation zones.

Visual Accessibility in a Virtual Kitchen: Effects of Open Shelving on Performance, Cognitive Load, and Experience in Older Adults with and without MCI

cs.HC · 2026-04-25 · unverdicted · novelty 4.0

Open shelving in a virtual kitchen reduced task time and physical activity for older adults with and without MCI while increasing gaze entropy, with no change in subjective cognitive load or motivation.

The Impact of LLM Self-Consistency and Reasoning Effort on Automated Scoring Accuracy and Cost

cs.CY · 2026-04-03 · unverdicted · novelty 4.0

Strategic selection of LLMs and reasoning effort optimizes automated scoring accuracy and cost more effectively than self-consistency ensembling.

Thinking Fast, Thinking Wrong: Intuitiveness Modulates LLM Counterfactual Reasoning in Policy Evaluation

cs.AI · 2026-04-12

citing papers explorer

Showing 21 of 21 citing papers.

Human face perception reflects inverse-generative and naturalistic discriminative objectives q-bio.NC · 2026-05-12 · unverdicted · none · ref 89
Human face perception aligns with neural networks trained on inverse-generative and naturalistic discriminative tasks, as these best predict human dissimilarity judgments on controversial and random face pairs.
Machine individuality: Separating genuine idiosyncrasy from response bias in large language models cs.AI · 2026-04-18 · unverdicted · none · ref 12
Crossed random-effects models on LLM word ratings show 16.9% variance from genuine stimulus-specific individuality, exceeding null models and forming coherent per-model fingerprints.
SCOOTER: A Human Evaluation Framework for Unrestricted Adversarial Examples cs.CV · 2025-07-10 · conditional · none · ref 3
SCOOTER supplies best-practice guidelines, open tools, and a 3K-image benchmark with 34K+ human ratings showing that six tested unrestricted attacks produce images humans can detect as fake.
Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations cs.RO · 2026-05-21 · unverdicted · none · ref 7
Robots detect underspecified reward features via demonstration variation and query targeted natural language explanations to improve reward recovery from imperfect demos.
Generative AI-Based Monte Carlo Simulation for Method Evaluation Using Synthetic Multilevel Data stat.ME · 2026-05-07 · unverdicted · none · ref 46
A framework using generative AI to produce synthetic multilevel data for Monte Carlo simulations that evaluate the performance and parameter recovery of quantitative methods.
Quantifying the human visual exposome with vision language models cs.AI · 2026-05-05 · unverdicted · none · ref 33
Vision language models applied to daily-life photos quantify visual environmental features that correlate with momentary affect and chronic stress, establishing a paradigm for visual exposomics.
Can AI Debias the News? LLM Interventions Improve Cross-Partisan Receptivity but LLMs Overestimate Their Own Effectiveness cs.CL · 2026-05-01 · unverdicted · none · ref 3
Substantive LLM reframing boosts cross-partisan receptivity to news headlines without backfire, but models overestimate effect sizes and lack fidelity in modeling human psychological responses.
A paradox of AI fluency cs.CL · 2026-04-28 · unverdicted · none · ref 3
Fluent AI users adopt an active, iterative collaboration mode that produces more visible failures but better recovery and success on hard tasks, whereas novices experience more invisible failures from passive use.
The Effect of Idea Elaboration on the Automatic Assessment of Idea Originality cs.HC · 2026-04-22 · unverdicted · none · ref 11
LLM originality raters exhibit self-preference bias toward artificial responses that disappears after controlling for idea elaboration in the Alternate Uses Task.
Dual Alignment Between Language Model Layers and Human Sentence Processing cs.CL · 2026-04-20 · unverdicted · none · ref 70
Later LLM layers align better with human cognitive effort in syntactic ambiguity than early layers do, indicating dual processing modes and complementary benefits from multi-layer probability updates.
Implicit Bias-Like Patterns in Reasoning Models cs.CY · 2025-03-14 · unverdicted · none · ref 23
Reasoning models expend more tokens on association-incompatible tasks than compatible ones, indicating greater effort on counter-stereotypical information, except for Claude 3.7 Sonnet which shows the reverse pattern linked to its bias-focused reasoning.
A systematic framework for generating novel experimental hypotheses from language models cs.CL · 2024-08-09 · unverdicted · none · ref 16
A framework using language models to simulate non-existent experiments and derive novel testable hypotheses on dative verb acquisition and cross-structural generalization in children.
Bringing Age Back In: Accounting for Population Age Distribution in Forecasting Migration stat.AP · 2024-02-21 · unverdicted · none · ref 6
Introduces MASI to standardize net migration rates for age structure and applies a Bayesian hierarchical model to forecast adjusted total and age-sex specific migration rates through 2100, yielding narrower intervals and moderated decline projections.
A Scalable Parametric Item Calibration Engine (SPICE) for Explanatory IRT with Sparse Data stat.ME · 2026-05-20 · unverdicted · none · ref 98
SPICE is a scalable Bayesian MCMC engine for explanatory IRT calibration on sparsely linked persons and items in large assessment banks.
ProfileGLMM: a R Package Extending Bayesian Profile Regression using Generalised Linear Mixed Models stat.ME · 2026-04-22 · unverdicted · none · ref 52
ProfileGLMM is an R package extending Bayesian profile regression with GLMMs to support hierarchical data, random effects, and cluster-covariate interactions for continuous or binary outcomes.
What's in an accent? The impact of accented synthetic speech on lexical choice in human-machine dialogue cs.HC · 2019-07-25 · unverdicted · none · ref 3
Accented synthetic speech leads users to align their lexical choices with the perceived accent of the machine partner, mirroring human-human dialogue patterns.
Tracing the ongoing emergence of human-like reasoning in Large Language Models cs.CL · 2026-05-20 · unverdicted · none · ref 90
LLMs function as accurate semantic processors for conditionals but do not replicate the pragmatic inferences that define human reasoning.
Performance of low vision individuals when selecting a target with head-pointing in virtual reality q-bio.NC · 2026-05-19 · unverdicted · none · ref 74
Low vision individuals with central visual field loss can use head-pointing to select 2° targets in VR, reaching near-control performance with sufficiently large pointer activation zones.
Visual Accessibility in a Virtual Kitchen: Effects of Open Shelving on Performance, Cognitive Load, and Experience in Older Adults with and without MCI cs.HC · 2026-04-25 · unverdicted · none · ref 1
Open shelving in a virtual kitchen reduced task time and physical activity for older adults with and without MCI while increasing gaze entropy, with no change in subjective cognitive load or motivation.
The Impact of LLM Self-Consistency and Reasoning Effort on Automated Scoring Accuracy and Cost cs.CY · 2026-04-03 · unverdicted · none · ref 13
Strategic selection of LLMs and reasoning effort optimizes automated scoring accuracy and cost more effectively than self-consistency ensembling.
Thinking Fast, Thinking Wrong: Intuitiveness Modulates LLM Counterfactual Reasoning in Policy Evaluation cs.AI · 2026-04-12 · unreviewed · ref 4

1_Reasoning

hub tools

citation-role summary

citation-polarity summary

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer