An agentic AI workflow evolves an adaptive XGBoost quantile regression ensemble that reduces watershed-averaged forecast error by up to 29% versus California's operational forecasts for April-July runoff at 1-6 month leads across 23 Sierra Nevada sites.
hub Mixed citations
Lundberg, Gabriel Erion, Hugh Chen, Alex DeGrave, Jordan M
Mixed citation behavior. Most common role is method (60%).
hub tools
citation-role summary
citation-polarity summary
representative citing papers
StoryScope extracts narrative features showing AI stories favor tidy plots and over-explain themes while human stories show more moral ambiguity and temporal complexity, enabling strong detection and attribution.
ProxySHAP approximates higher-order Shapley and Banzhaf interactions via tree proxies plus residual correction and a polynomial-time interventional TreeSHAP generalization for tree ensembles.
RCProb uses Dirichlet-smoothed class priors and Beta-smoothed condition likelihoods in a Naive Bayes formulation to extract rules from tree ensembles approximately 22 times faster than RuleCOSI+ while maintaining competitive accuracy and producing more compact rule sets on 33 benchmark datasets.
MONET represents tasks as graph nodes and uses neighbor-based crossover plus per-task mutation to transfer knowledge, matching or exceeding MAP-Elites performance on four large-scale simulation domains.
Physics-guided data augmentation combined with neural networks enables accurate indentation size effect correction in steels from small sets of shallow nanoindentation measurements, outperforming Nix-Gao in the shallow regime.
T-SHAP stabilizes SHAP attributions temporally for LSTM fall detection, achieving 94.3% accuracy and improved faithfulness on NTU RGB+D dataset.
Quantum-chemical bonding descriptors improve machine learning predictions of materials properties and enable symbolic regression to recover intuitive expressions for force constants and thermal conductivity.
Nutritional features improve XGBoost prediction of comment engagement on Reddit food posts by nearly 5%, with higher calorie density linked to greater engagement.
Hybrid predictive modeling of crash data identifies non-use of restraints as the primary risk factor for severe injury in collisions involving trees.
PCA and k-means on NHANES data identified four reproductive phenotypes in U.S. women aged 20-44, with one fragile subgroup showing 77.5% early multimorbidity prevalence; XGBoost improved discrimination over logistic regression but had worse calibration.
Multimodal fusion of histopathology patches and EHR data reaches macro AUC 0.997 for breast cancer classification, beating unimodal baselines especially on the imbalanced mitosis class.
citing papers explorer
-
Probabilistic Seasonal Streamflow Forecasting Across California's Sierra Nevada Watersheds with Agentic AI
An agentic AI workflow evolves an adaptive XGBoost quantile regression ensemble that reduces watershed-averaged forecast error by up to 29% versus California's operational forecasts for April-July runoff at 1-6 month leads across 23 Sierra Nevada sites.
-
StoryScope: Investigating idiosyncrasies in AI fiction
StoryScope extracts narrative features showing AI stories favor tidy plots and over-explain themes while human stories show more moral ambiguity and temporal complexity, enabling strong detection and attribution.
-
Proxy-Based Approximation of Shapley and Banzhaf Interactions
ProxySHAP approximates higher-order Shapley and Banzhaf interactions via tree proxies plus residual correction and a polynomial-time interventional TreeSHAP generalization for tree ensembles.
-
RCProb: Probabilistic Rule Extraction for Efficient Simplification of Tree Ensembles
RCProb uses Dirichlet-smoothed class priors and Beta-smoothed condition likelihoods in a Naive Bayes formulation to extract rules from tree ensembles approximately 22 times faster than RuleCOSI+ while maintaining competitive accuracy and producing more compact rule sets on 33 benchmark datasets.
-
Multi-Task Optimization over Networks of Tasks
MONET represents tasks as graph nodes and uses neighbor-based crossover plus per-task mutation to transfer knowledge, matching or exceeding MAP-Elites performance on four large-scale simulation domains.
-
Data-Efficient Indentation Size Effect Correction in Steels Using Machine Learning and Physics-Guided Augmentation
Physics-guided data augmentation combined with neural networks enables accurate indentation size effect correction in steels from small sets of shallow nanoindentation measurements, outperforming Nix-Gao in the shallow regime.
-
Explainable Fall Detection for Elderly Monitoring via Temporally Stable SHAP in Skeleton-Based Human Activity Recognition
T-SHAP stabilizes SHAP attributions temporally for LSTM fall detection, achieving 94.3% accuracy and improved faithfulness on NTU RGB+D dataset.
-
A critical assessment of bonding descriptors for predicting materials properties
Quantum-chemical bonding descriptors improve machine learning predictions of materials properties and enable symbolic regression to recover intuitive expressions for force constants and thermal conductivity.
-
Reddit's Appetite: Predicting User Engagement with Nutritional Content
Nutritional features improve XGBoost prediction of comment engagement on Reddit food posts by nearly 5%, with higher calorie density linked to greater engagement.
-
From Canopy to Collision: A Hybrid Predictive Framework for Identifying Risk Factors in Tree-Involved Traffic Crashes
Hybrid predictive modeling of crash data identifies non-use of restraints as the primary risk factor for severe injury in collisions involving trees.
-
AI-Derived Reproductive Phenotypes and Explainable ML for Concurrent Early Multimorbidity in U.S. Women: NHANES 2017-March 2020
PCA and k-means on NHANES data identified four reproductive phenotypes in U.S. women aged 20-44, with one fragile subgroup showing 77.5% early multimorbidity prevalence; XGBoost improved discrimination over logistic regression but had worse calibration.
-
Multimodal Fusion of Histopathology Images and Electronic Health Records for Early Breast Cancer Diagnosis
Multimodal fusion of histopathology patches and EHR data reaches macro AUC 0.997 for breast cancer classification, beating unimodal baselines especially on the imbalanced mitosis class.
- An Empirical Study of Machine Learning Robustness and Scalability for Imbalanced Tabular Clinical Data in Emergency and Critical Care