Mixed citations

Title resolution pending

· 2019 · arXiv 2500.333070

Mixed citation behavior. Most common role is method (57%).

49 Pith papers citing it

Method 57% of classified citations

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

method 4 background 2 baseline 1

citation-polarity summary

use method 4 background 2 baseline 1

representative citing papers

Surrogate-Gated Generation and Foundation-Model Embeddings for Bayesian Materials Design

cond-mat.mtrl-sci · 2026-06-26 · unverdicted · novelty 7.0

A Gaussian process surrogate gate inserted between generative crystal models and property oracles matches or exceeds ungated fine-tuning while using roughly one-fifth the oracle calls for heat capacity and bulk modulus.

Forecasting Conceptual Diffusion in Science: The Case of Quantum Computing

cs.SI · 2026-06-02 · unverdicted · novelty 7.0

LightGBM models on citation and diversity features predict exogenous diffusion of quantum computing concepts with R² up to 0.78 while endogenous reinforcement remains largely unpredictable after growth controls, with replications in other fields.

EURO-5K: When Does Domain Pretraining Matter? Benchmarking Transformers for EU Reporting Obligation Extraction

cs.CL · 2026-06-02 · unverdicted · novelty 7.0

Introduces EURO-5K dataset from 136 EU acts and benchmarks full fine-tuning vs QLoRA for BERT and LLM models on reporting obligation extraction, reporting 0.89 F1 with limited gains from legal pretraining except under parameter-efficient adaptation.

Learn from your own latents and not from tokens: A sample-complexity theory

cs.LG · 2026-05-26 · unverdicted · novelty 7.0

Latent prediction SSL recovers latent trees from PCFG data with sample complexity constant in hierarchy depth L (up to logs), unlike exponential for token-level or supervised methods.

NeuroTrain: Surveying Local Learning Rules for Spiking Neural Networks with an Open Benchmarking Framework

cs.NE · 2026-05-14 · unverdicted · novelty 7.0

A taxonomy of SNN training algorithms is presented with the release of NeuroTrain, an open benchmarking framework for reproducible comparisons across datasets and architectures.

Beyond ECE: Calibrated Size Ratio, Risk Assessment, and Confidence-Weighted Metrics

cs.LG · 2026-05-03 · unverdicted · novelty 7.0

Introduces Calibrated Size Ratio (CSR) and confidence-weighted metrics to better detect overconfidence risk and calibration issues beyond the limitations of ECE.

Evaluating LLMs on Large-Scale Graph Property Estimation via Random Walks

cs.LG · 2026-05-02 · unverdicted · novelty 7.0

EstGraph benchmark evaluates LLMs on estimating properties of very large graphs from random-walk samples that fit in context limits.

Your Loss is My Gain: Low Stake Attacks on Liquid Staking Pools

cs.GT · 2026-05-01 · unverdicted · novelty 7.0

A low-stake adversary can degrade a liquid staking pool's performance via consensus manipulation and profit from the resulting drop in its LST value through application-layer financial positions.

Change is Hard: Consistent Player Behavior Across Games with Conflicting Incentives

cs.HC · 2026-03-17 · unverdicted · novelty 7.0

Players exhibit consistent flexibility or specialization behavior across two games with conflicting performance incentives, indicating individual agency dominates structural differences.

Slay the Shear: A Unified Statistical Framework for Weak Gravitational Lensing Shear Estimation

astro-ph.CO · 2026-06-24 · unverdicted · novelty 6.0

Unified framework proves the score function yields the minimum-variance unbiased shear estimator and that response-weighted inverse-variance weights minimize shape noise independent of galaxy shape distributions, with RDSM reducing noise by ~17.5% at LSST depth.

When Is an LLM Worth It for Hyperparameter Optimization? A Budget-Matched Study on Tabular Data Finds the Warm-Start Is a Default Configuration, Not the Model

cs.LG · 2026-06-19 · unverdicted · novelty 6.0 · 2 refs

On eight PMLB tabular benchmarks, an LLM HPO advisor adds only +0.40 pp CV accuracy beyond a fixed default seed and is overtaken by seeded classical methods within 5-12 evaluations, with no held-out test gain.

Rigorous uncertainty quantification of probabilistic AI weather forecasts with conformal prediction

physics.ao-ph · 2026-06-17 · unverdicted · novelty 6.0

Online conformal prediction post-processing guarantees calibrated uncertainty coverage for GenCast, NeuralGCM, and AIFS-ENS forecasts of temperature and precipitation including extremes.

P$^2$CE: Model-Agnostic Plausible Pareto-Optimal Counterfactual Explanations

cs.LG · 2026-06-16 · unverdicted · novelty 6.0

P²CE is a model-agnostic algorithm for plausible Pareto-optimal counterfactual explanations that uses isolation forest for plausibility and SHAP for efficiency, claiming better quality and speed on three datasets.

MSC-CMA-ES: Structure-Aware Restarts for CMA-ES via Cyclic Nearest-Better Basin Discovery

cs.NE · 2026-06-14 · unverdicted · novelty 6.0

MSC-CMA-ES makes CMA-ES restarts structure-aware via cyclic nearest-better basin discovery on Sobol pre-samples, achieving 2.7x higher target coverage than BIPOP-CMA-ES on composition functions across CEC suites.

Explainable Forecasting of Scientific Breakthroughs from Concept Network Dynamics

cs.SI · 2026-06-02 · unverdicted · novelty 6.0

A two-stage LightGBM model on 59 features from concept networks forecasts link formation and intensity with ROC-AUC 0.95-0.967 across domains.

How Many Trees in a Random Forest? A Revisited Approach with Plateau Search and Optuna Integration

cs.LG · 2026-06-02 · conditional · novelty 6.0

A triplet-based plateau search algorithm is proposed to adaptively determine a near-minimal number of trees for random forests by monitoring relative OOB score changes across forest size triplets, removing n_trees from the TPE search space.

Machine-learning-accelerated discovery of synthesizable high-temperature altermagnets with giant spin splitting

cond-mat.mtrl-sci · 2026-05-27 · unverdicted · novelty 6.0

ML-accelerated screening of 8640 AB2C2D variants yields 34 low-hull-energy altermagnets with spin splittings exceeding 1.5 eV, including RbMn2Te2O with 1.88 eV splitting and ~390 K Neel temperature.

Towards Discovery of Polymers for Insulin Delivery via Physics-Grounded Agentic Workflows

q-bio.QM · 2026-05-12 · unverdicted · novelty 6.0

An LLM-orchestrated physics simulation search identifies polymers with strong insulin interactions, outperforming standard optimization methods by significant margins.

AutoLLMResearch: Training Research Agents for Automating LLM Experiment Configuration - Learning from Cheap, Optimizing Expensive

cs.AI · 2026-05-12 · unverdicted · novelty 6.0 · 2 refs

AutoLLMResearch trains agents in a multi-fidelity LLMConfig-Gym environment formulated as a long-horizon MDP to enable cross-fidelity extrapolation for automating high-cost LLM experiment configurations.

Coverage is not enough: Frequentist tests of simulation-based inference for primordial non-Gaussianity

astro-ph.CO · 2026-05-01 · unverdicted · novelty 6.0

Coverage tests for simulation-based inference of f_NL can pass while the posteriors are underconfident in the tails and sometimes yield weaker constraints than using power spectrum or bispectrum alone.

RL-STPA: Adapting System-Theoretic Hazard Analysis for Safety-Critical Reinforcement Learning

cs.LG · 2026-04-16 · unverdicted · novelty 6.0

RL-STPA adapts STPA for RL via hierarchical subtask decomposition, coverage-guided perturbation testing, and iterative checkpoints that feed hazards back into training, demonstrated on autonomous drone navigation to reveal loss scenarios missed by standard evaluations.

CivBench: Progress-Based Evaluation for LLMs' Strategic Decision-Making in Civilization V

cs.AI · 2026-04-09 · unverdicted · novelty 6.0

CivBench trains models on turn-level states in Civilization V to predict victory probabilities, providing a progress-based evaluation of LLM strategic capabilities across 307 games with 7 models.

Multi-Label Phase Diagram Prediction in Complex Alloys via Physics-Informed Graph Attention Networks

cs.LG · 2026-04-09 · conditional · novelty 6.0

Physics-informed graph attention networks predict multi-phase equilibria in Ag-Bi-Cu-Sn alloys with 96% exact-set accuracy on in-domain data and strong generalization to unseen sections.

Unsupervised domain adaptation for radioisotope identification in gamma spectroscopy

cs.LG · 2026-03-05 · conditional · novelty 6.0

Unsupervised domain adaptation via feature alignment raises radioisotope identification accuracy on real LaBr3 gamma spectra from 0.754 to 0.904 for models trained only on synthetic data.

citing papers explorer

Showing 1 of 1 citing paper after filters.

AutoLLMResearch: Training Research Agents for Automating LLM Experiment Configuration - Learning from Cheap, Optimizing Expensive cs.AI · 2026-05-12 · unverdicted · none · ref 20 · 2 links
AutoLLMResearch trains agents in a multi-fidelity LLMConfig-Gym environment formulated as a long-horizon MDP to enable cross-fidelity extrapolation for automating high-cost LLM experiment configurations.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer