LLMs exhibit epistemically vacuous confidence on clinical tabular data, but cross-model attribution divergence with XGBoost enables a calibrator that reduces expected calibration error from 0.254 to 0.080.
Measuring what llms think they do: Shap faithfulness and deployability on financial tabular classification
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
A multi-agent LLM equity system produces statistically significant outperformance on S&P 500 stocks, with strong-buy portfolios returning +2.18% monthly versus +1.15% for the equal-weight benchmark over 19 months.
citing papers explorer
-
Signal or Noise in Multi-Agent LLM-based Stock Recommendations?
A multi-agent LLM equity system produces statistically significant outperformance on S&P 500 stocks, with strong-buy portfolios returning +2.18% monthly versus +1.15% for the equal-weight benchmark over 19 months.