Determinants of llm-assisted decision-making

Eva Eigner · 2024 · arXiv 2402.17385

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

OS-SPEAR: A Toolkit for the Safety, Performance,Efficiency, and Robustness Analysis of OS Agents

cs.CL · 2026-04-27 · unverdicted · novelty 7.0

OS-SPEAR is a new evaluation toolkit that tests 22 OS agents and identifies trade-offs between efficiency and safety or robustness.

Geometry-Calibrated Conformal Abstention for Language Models

cs.CL · 2026-04-30 · unverdicted · novelty 6.0

Geometry-calibrated conformal abstention lets language models abstain from uncertain queries with finite-sample guarantees on both participation rate and conditional correctness of answers.

Efficient Federated Search for Retrieval-Augmented Generation using Lightweight Routing

cs.LG · 2025-02-26 · unverdicted · novelty 6.0

RAGRoute introduces a neural router for federated RAG that dynamically selects relevant sources, reducing communication by up to 80.65% and latency by 52.50% while preserving accuracy on three benchmarks.

LLMs are the Ideal Candidate for Mixed-Initiative Game Design Pillar Workflows

cs.HC · 2026-05-10 · unverdicted · novelty 5.0

Large language models can meaningfully support the creation and decision-making processes for game design pillars in mixed-initiative workflows, as shown by a prototype tested in a game jam and with expert interviews.

User Detection and Response Patterns of Sycophantic Behavior in Conversational AI

cs.HC · 2026-01-15 · unverdicted · novelty 5.0

Reddit analysis shows users detect AI sycophancy through comparisons and consistency checks, apply mitigation prompts, and sometimes seek affirmative responses for support, indicating context-aware design is better than total elimination.

A closer look at how large language models trust humans: patterns and biases

cs.CL · 2025-04-22 · unverdicted · novelty 5.0

Across 43,200 simulations with five LLMs and five scenarios, model trust in humans aligns with human-like patterns driven by trustworthiness dimensions and is sometimes biased by age, gender, and religion.

Enhancing Trust in Large Language Models via Uncertainty-Calibrated Fine-Tuning

cs.CL · 2024-12-03 · unverdicted · novelty 5.0

Uncertainty-aware fine-tuning with a decision-theory-based loss produces better-calibrated uncertainty estimates than standard training on free-form QA tasks.

citing papers explorer

Showing 7 of 7 citing papers.

OS-SPEAR: A Toolkit for the Safety, Performance,Efficiency, and Robustness Analysis of OS Agents cs.CL · 2026-04-27 · unverdicted · none · ref 7
OS-SPEAR is a new evaluation toolkit that tests 22 OS agents and identifies trade-offs between efficiency and safety or robustness.
Geometry-Calibrated Conformal Abstention for Language Models cs.CL · 2026-04-30 · unverdicted · none · ref 4
Geometry-calibrated conformal abstention lets language models abstain from uncertain queries with finite-sample guarantees on both participation rate and conditional correctness of answers.
Efficient Federated Search for Retrieval-Augmented Generation using Lightweight Routing cs.LG · 2025-02-26 · unverdicted · none · ref 10
RAGRoute introduces a neural router for federated RAG that dynamically selects relevant sources, reducing communication by up to 80.65% and latency by 52.50% while preserving accuracy on three benchmarks.
LLMs are the Ideal Candidate for Mixed-Initiative Game Design Pillar Workflows cs.HC · 2026-05-10 · unverdicted · none · ref 13
Large language models can meaningfully support the creation and decision-making processes for game design pillars in mixed-initiative workflows, as shown by a prototype tested in a game jam and with expert interviews.
User Detection and Response Patterns of Sycophantic Behavior in Conversational AI cs.HC · 2026-01-15 · unverdicted · none · ref 10
Reddit analysis shows users detect AI sycophancy through comparisons and consistency checks, apply mitigation prompts, and sometimes seek affirmative responses for support, indicating context-aware design is better than total elimination.
A closer look at how large language models trust humans: patterns and biases cs.CL · 2025-04-22 · unverdicted · none · ref 2
Across 43,200 simulations with five LLMs and five scenarios, model trust in humans aligns with human-like patterns driven by trustworthiness dimensions and is sometimes biased by age, gender, and religion.
Enhancing Trust in Large Language Models via Uncertainty-Calibrated Fine-Tuning cs.CL · 2024-12-03 · unverdicted · none · ref 11
Uncertainty-aware fine-tuning with a decision-theory-based loss produces better-calibrated uncertainty estimates than standard training on free-form QA tasks.

Determinants of llm-assisted decision-making

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer