Medec: A benchmark for medical error detection and correction in clinical notes

Asma Ben Abacha, Wen wai Yim, Yujuan Fu, Zhaoyi Sun, Meliha Yetisgen, Fei Xia, Thomas Lin · 2025 · arXiv 2412.19260

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

dataset 1

citation-polarity summary

use dataset 1

representative citing papers

Understanding the Mechanism of Altruism in Large Language Models

econ.GN · 2026-04-21 · unverdicted · novelty 6.0

A small set of sparse autoencoder features in LLMs drives shifts between generous and selfish allocations in dictator games, with causal patching and steering confirming their role and generalization to other social games.

Scaling Laws for Moral Machine Judgment in Large Language Models

cs.CY · 2026-01-25 · conditional · novelty 5.0

Moral alignment in LLMs improves with model size according to the power law D ∝ S^{-0.10} (R²=0.50).

Search-Based Multi-Trajectory Refinement for Safe C-to-Rust Translation with Large Language Models

cs.PL · 2025-05-21 · unverdicted · novelty 5.0

LAC2R uses MCTS to systematically explore multiple LLM refinement trajectories for C-to-Rust translation and reports superior safety and correctness on small-scale benchmarks.

Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models

cs.AI · 2025-03-12 · unverdicted · novelty 5.0

The paper unifies perspectives on Long CoT in reasoning LLMs by introducing a taxonomy, detailing characteristics of deep reasoning and reflection, and discussing emergence phenomena and future directions.

citing papers explorer

Showing 4 of 4 citing papers.

Understanding the Mechanism of Altruism in Large Language Models econ.GN · 2026-04-21 · unverdicted · none · ref 207
A small set of sparse autoencoder features in LLMs drives shifts between generous and selfish allocations in dictator games, with causal patching and steering confirming their role and generalization to other social games.
Scaling Laws for Moral Machine Judgment in Large Language Models cs.CY · 2026-01-25 · conditional · none · ref 16
Moral alignment in LLMs improves with model size according to the power law D ∝ S^{-0.10} (R²=0.50).
Search-Based Multi-Trajectory Refinement for Safe C-to-Rust Translation with Large Language Models cs.PL · 2025-05-21 · unverdicted · none · ref 1
LAC2R uses MCTS to systematically explore multiple LLM refinement trajectories for C-to-Rust translation and reports superior safety and correctness on small-scale benchmarks.
Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models cs.AI · 2025-03-12 · unverdicted · none · ref 1
The paper unifies perspectives on Long CoT in reasoning LLMs by introducing a taxonomy, detailing characteristics of deep reasoning and reflection, and discussing emergence phenomena and future directions.

Medec: A benchmark for medical error detection and correction in clinical notes

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer