EEG foundation models show no single winner across failure modes, attend to correct brain regions but decode corrupted signals, and retain task information in early layers while late layers adapt during fine-tuning.
Title resolution pending
17 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
representative citing papers
Christoffel-DPS is a distribution-free optimal sensor placement framework for diffusion posterior sampling that provides non-asymptotic recovery bounds and outperforms Gaussian baselines on non-Gaussian benchmarks.
AsmRAG detects malware at 96% F1 and attributes families at 95% F1 by retrieving functionally similar assembly code via LLM embeddings and density-weighted anchor selection, remaining robust to metamorphic obfuscation.
XAI explanations should be narratives with continuous structure, cause-effect, fluency and diversity, and new metrics are needed to evaluate this better than standard NLP scores.
UFPR-VeSV is a new real-world dataset for fine-grained vehicle classification and automatic license plate recognition collected from Brazilian police cameras, with benchmarks demonstrating its difficulty and the value of joint task use.
Introduces a unified evaluation framework for XAI using five principled metrics and the PGCA method that fuses grid perturbation with Grad-CAM++ , reporting top scores in fidelity, interpretability and fairness on ResNet-50 models across five image domains.
Empirical comparison shows gradient-based explanations for GNN node similarities are actionable, consistent, and retain effects when sparsified, unlike mutual information explanations.
A CNN classifies lung cytology patches as benign or malignant at 100% sensitivity and 96.4% specificity, then routes to one of two Transformer decoders to generate findings text achieving BLEU-4 of 0.828 on 801 images.
Deep learning model on large uniform healthy MRI dataset achieves brain age MAE of 4.06 years (hold-out) and 4.21 years (independent set), with frontal lobe prominence and age gaps linked to neuropsychological scores.
Compares LIME, input perturbation and attention for explaining QA on KB+text; proposes automatic evaluation paradigm and finds input perturbation superior in both automatic and human studies.
Grad-ECLIP is an equivalent but flawed variant of attention-based interpretation, with two principles proposed to ensure model explanations reflect the original model.
A three-stage framework combines dual-head CNNs, saliency attribution, neuroanatomical atlas mapping, and LLMs to generate interpretable reports for brain tumor classification on MRI images.
STR-Net achieves AUROC of 0.933 for binary bone-loss screening and 0.801 correlation for T-score estimation from knee X-rays on a held-out test set.
An integrated fringe projection and AI pipeline delivers aligned high-accuracy 3D sensing and instance segmentation for autonomous HDD disassembly at 77.7 FPS.
A time-aware ResNet-based model on PET/CT images improves overall survival prediction in NSCLC by incorporating temporal data, achieving 4.3% higher AUC than fixed-time baselines.
DB-FGA-Net fuses VGG16 and Xception backbones with a new Frequency-Gated Attention module to reach 99.24% accuracy on 4-class brain tumor classification without augmentation and generalizes to 95.77% on an independent dataset.
citing papers explorer
-
Beyond Accuracy: Robustness, Interpretability and Expressiveness of EEG Foundation Models
EEG foundation models show no single winner across failure modes, attend to correct brain regions but decode corrupted signals, and retain task information in early layers while late layers adapt during fine-tuning.
-
Christoffel-DPS: Optimal sensor placement in diffusion posterior sampling for arbitrary distributions
Christoffel-DPS is a distribution-free optimal sensor placement framework for diffusion posterior sampling that provides non-asymptotic recovery bounds and outperforms Gaussian baselines on non-Gaussian benchmarks.
-
AsmRAG: LLM-Driven Malware Detection by Retrieving Functionally Similar Assembly Code
AsmRAG detects malware at 96% F1 and attributes families at 95% F1 by retrieving functionally similar assembly code via LLM embeddings and density-weighted anchor selection, remaining robust to metamorphic obfuscation.
-
On the Importance and Evaluation of Narrativity in Natural Language AI Explanations
XAI explanations should be narratives with continuous structure, cause-effect, fluency and diversity, and new metrics are needed to evaluate this better than standard NLP scores.
-
Toward Unified Fine-Grained Vehicle Classification and Automatic License Plate Recognition
UFPR-VeSV is a new real-world dataset for fine-grained vehicle classification and automatic license plate recognition collected from Brazilian police cameras, with benchmarks demonstrating its difficulty and the value of joint task use.
-
A Unified Framework for Evaluating and Enhancing the Transparency of Explainable AI Methods via Perturbation-Gradient Consensus Attribution
Introduces a unified evaluation framework for XAI using five principled metrics and the PGCA method that fuses grid perturbation with Grad-CAM++ , reporting top scores in fidelity, interpretability and fairness on ResNet-50 models across five image domains.
-
Explaining Graph Neural Networks for Node Similarity on Graphs
Empirical comparison shows gradient-based explanations for GNN node similarities are actionable, consistent, and retain effects when sparsified, unlike mutual information explanations.
-
Automated Description Generation of Cytologic Findings for Lung Cytological Images Using a Pretrained Vision Model and Dual Text Decoders: Preliminary Study
A CNN classifies lung cytology patches as benign or malignant at 100% sensitivity and 96.4% specificity, then routes to one of two Transformer decoders to generate findings text achieving BLEU-4 of 0.828 on 801 images.
-
Estimating brain age based on a healthy population with deep learning and structural MRI
Deep learning model on large uniform healthy MRI dataset achieves brain age MAE of 4.06 years (hold-out) and 4.21 years (independent set), with frontal lobe prominence and age gaps linked to neuropsychological scores.
-
Interpretable Question Answering on Knowledge Bases and Text
Compares LIME, input perturbation and attention for explaining QA on KB+text; proposes automatic evaluation paradigm and finds input perturbation superior in both automatic and human studies.
-
Debunking Grad-ECLIP: A Comprehensive Study on Its Incorrectness and Fundamental Principles for Model Interpretation
Grad-ECLIP is an equivalent but flawed variant of attention-based interpretation, with two principles proposed to ensure model explanations reflect the original model.
-
Bridging visual saliency and large language models for explainable deep learning in medical imaging
A three-stage framework combines dual-head CNNs, saliency attribution, neuroanatomical atlas mapping, and LLMs to generate interpretable reports for brain tumor classification on MRI images.
-
Opportunistic Bone-Loss Screening from Routine Knee Radiographs Using a Multi-Task Deep Learning Framework with Sensitivity-Constrained Threshold Optimization
STR-Net achieves AUROC of 0.933 for binary bone-loss screening and 0.801 correlation for T-score estimation from knee X-rays on a held-out test set.
-
Fringe Projection Based Vision Pipeline for Autonomous Hard Drive Disassembly
An integrated fringe projection and AI pipeline delivers aligned high-accuracy 3D sensing and instance segmentation for autonomous HDD disassembly at 77.7 FPS.
-
Time-driven Survival Analysis from FDG-PET/CT in Non-Small Cell Lung Cancer
A time-aware ResNet-based model on PET/CT images improves overall survival prediction in NSCLC by incorporating temporal data, achieving 4.3% higher AUC than fixed-time baselines.
-
DB-FGA-Net: Dual Backbone Frequency Gated Attention Network for Multi-Class Brain Tumor Classification with Grad-CAM Interpretability
DB-FGA-Net fuses VGG16 and Xception backbones with a new Frequency-Gated Attention module to reach 99.24% accuracy on 4-class brain tumor classification without augmentation and generalizes to 95.77% on an independent dataset.
- The Neglected Baseline in Model Interpretation