pith. sign in

Title resolution pending

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

method 2 background 1

citation-polarity summary

years

2026 4

representative citing papers

Variance-aware Reward Modeling with Anchor Guidance

stat.ML · 2026-05-12 · unverdicted · novelty 7.0

Anchor-guided variance-aware reward modeling uses two response-level anchors to resolve non-identifiability in Gaussian models of pluralistic preferences, yielding provable identification, a joint training objective, and improved RLHF performance.

Semi-supervised Method for Risk Prediction with Doubly Censored EHR Data

stat.ME · 2026-05-08 · unverdicted · novelty 7.0

Proposes a novel semi-supervised estimator for risk prediction under double censoring that combines limited gold-standard labels with large-scale surrogates, proves theoretical validity, and shows efficiency gains over supervised methods in simulations and a T2D EHR application.

citing papers explorer

Showing 4 of 4 citing papers.

  • Variance-aware Reward Modeling with Anchor Guidance stat.ML · 2026-05-12 · unverdicted · none · ref 7

    Anchor-guided variance-aware reward modeling uses two response-level anchors to resolve non-identifiability in Gaussian models of pluralistic preferences, yielding provable identification, a joint training objective, and improved RLHF performance.

  • Semi-supervised Method for Risk Prediction with Doubly Censored EHR Data stat.ME · 2026-05-08 · unverdicted · none · ref 1

    Proposes a novel semi-supervised estimator for risk prediction under double censoring that combines limited gold-standard labels with large-scale surrogates, proves theoretical validity, and shows efficiency gains over supervised methods in simulations and a T2D EHR application.

  • Structure Learning for Directed Trees with Zero-Inflated Compositional Nodes stat.ME · 2026-05-04 · unverdicted · none · ref 50

    A new directed tree structure learning framework for zero-inflated compositional nodes uses KL divergence scoring and column-stochastic transition matrices for conditional expectations, with proven consistency and finite-sample guarantees.

  • SeqLoRA: Bilevel Orthogonal Adaptation for Continual Multi-Concept Generation cs.LG · 2026-05-21 · conditional · none · ref 31

    SeqLoRA applies bilevel optimization to sequential LoRA adaptation for continual multi-concept text-to-image generation with theoretical bounds on forgetting and interference.