A3M integrates adaptive DRL, adversarial opponent modeling, and multi-objective rewards to cut regret 30-40% versus baselines while remaining robust to strategy shifts in repeated auctions.
arXiv preprint arXiv:2601.20679 (2026)
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CL 4years
2026 4verdicts
UNVERDICTED 4roles
background 1polarities
background 1representative citing papers
EGAD adaptively distills LLM knowledge at the token level by using entropy to create a curriculum from low- to high-entropy tokens, adjust temperature, and switch between logits-only and feature-based branches.
EVLA combines a Unified Co-State Encoder and Electro-aware Structured Reasoning Chain with physics-guided training to produce energy-optimal driving decisions, reporting +5.6% accuracy gains over fine-tuned VLM baselines on a driving QA benchmark.
FinInvest-GTCN combines graph, temporal, and causal networks with meta-causal adaptation to improve risk-adjusted predictions for VC investments, achieving RA-MSE of 2.51 and 18.7% higher simulated returns on proprietary data.
citing papers explorer
-
A3M: Adaptive, Adversarial and Multi-Objective Learning for Strategic Bidding in Repeated Auctions
A3M integrates adaptive DRL, adversarial opponent modeling, and multi-objective rewards to cut regret 30-40% versus baselines while remaining robust to strategy shifts in repeated auctions.
-
EGAD: Entropy-Guided Adaptive Distillation for Token-Level Knowledge Transfer
EGAD adaptively distills LLM knowledge at the token level by using entropy to create a curriculum from low- to high-entropy tokens, adjust temperature, and switch between logits-only and feature-based branches.
-
EVLA: An Electro-Aware Multimodal Assistant for Physically-Grounded Driving Reasoning and Control
EVLA combines a Unified Co-State Encoder and Electro-aware Structured Reasoning Chain with physics-guided training to produce energy-optimal driving decisions, reporting +5.6% accuracy gains over fine-tuned VLM baselines on a driving QA benchmark.
-
FinInvest-GTCN: Explainable Graph-Temporal-Causal Modeling for Risk-Aware Investment Decision Optimization
FinInvest-GTCN combines graph, temporal, and causal networks with meta-causal adaptation to improve risk-adjusted predictions for VC investments, achieving RA-MSE of 2.51 and 18.7% higher simulated returns on proprietary data.