hub

The HAM10000 dataset, a large collection of multi -source dermatoscopic images of common pigmented skin lesions

Philipp Tschandl, Cliff Rosendahl, Harald Kittler · 2018 · Scientific Data · DOI 10.1038/sdata.2018.161

15 Pith papers cite this work, alongside 3,113 external citations. Polarity classification is still indexing.

15 Pith papers citing it

3,113 external citations · Crossref

open at publisher browse 15 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

IMCBench: A benchmark for multimodal LLMs in Image-grounded Medical Conversations

cs.AI · 2026-06-26 · unverdicted · novelty 7.0

IMCBench is a new benchmark for image-grounded multi-turn medical conversations that evaluates eight multimodal LLMs on safety, accuracy, and uncertainty, finding Claude Opus highest overall but safety drops for malignant and rare conditions.

DermAgent: A Self-Reflective Agentic System for Dermatological Image Analysis with Multi-Tool Reasoning and Traceable Decision-Making

cs.CV · 2026-05-14 · unverdicted · novelty 7.0

DermAgent orchestrates seven vision-language tools in a Plan-Execute-Reflect loop with dual-modality retrieval from 413k cases and a critic module to outperform GPT-4o by 17.6% in zero-shot dermatological diagnosis accuracy.

MedCore: Boundary-Preserving Medical Core Pruning for MedSAM

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

MedCore achieves 60% parameter and 58.4% FLOP reduction on MedSAM with Dice 0.9549 and preserved boundary metrics via dual-intervention pruning and a new boundary leverage principle.

Concept Inconsistency in Dermoscopic Concept Bottleneck Models: A Rough-Set Analysis of the Derm7pt Dataset

cs.LG · 2026-04-21 · conditional · novelty 7.0

Rough-set analysis finds 16.4% of 305 concept profiles in Derm7pt inconsistent (306 images), capping hard CBM accuracy at 92.1%; symmetric filtering produces a 705-image consistent benchmark where EfficientNet-B5 reaches 0.90 label accuracy.

Contour Refinement using Discrete Diffusion in Low Data Regime

cs.CV · 2026-02-05 · unverdicted · novelty 7.0

A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.

Correcting Performance Estimation Bias in Imbalanced Classification with Minority Subconcepts

cs.LG · 2026-04-28 · unverdicted · novelty 6.0

The authors introduce predicted-weighted balanced accuracy (pBA), a utility-weighted evaluation metric that uses predicted subconcept posteriors to reduce bias from within-class heterogeneity in imbalanced data.

Cross-Domain Few-Shot Segmentation via Ordinary Differential Equations over Time Intervals

cs.CV · 2025-09-01 · unverdicted · novelty 6.0

FSS-TIs models cross-domain few-shot segmentation as an ODE process with Fourier-based spectral perturbations to create domain-agnostic features and enable effective fine-tuning on limited support samples.

MARVEL: Margin-Aware Robust von Mises-Fischer Expert Learning for Long-Tailed Out-of-Distribution Detection

cs.CV · 2026-07-02 · unverdicted · novelty 5.0

MARVEL introduces a multi-expert NvMF-based system with an outlier expert that reduces FPR95 in OOD detection on medical datasets by 8-37%.

Federated Medical Image Classification under Class and Domain Imbalance exploiting Synthetic Sample Generation

cs.CV · 2026-04-29 · unverdicted · novelty 5.0

FedSSG generates and shares synthetic samples within a federated setup to reduce class imbalance and domain shift problems in medical image classification.

IViT: A Novel Interpretable Visual Transformer for Skin Disease Detection

eess.IV · 2026-06-22 · unverdicted · novelty 4.0

IViT applies quadratic programming to a pre-trained Vision Transformer with a multi-objective loss, achieving 93.80% accuracy on six skin disease datasets (0.21% below baseline) while reducing feature redundancy by 29.5% and producing clinically consistent activations.

Cascade Classification of Dermoscopic Images of Skin Neoplasms with Controllable Sensitivity and External Clinical Validation

cs.CV · 2026-06-11 · unverdicted · novelty 4.0

Cascade classification improves macro F1 over single-stage for some models by allowing sensitivity control but reveals a large generalization gap on external clinical data.

Methodology for Creating a Clinically Verified Dermoscopic Image Dataset

cs.CV · 2026-05-24 · unverdicted · novelty 4.0

Describes a methodology and the resulting dataset of 1,026 dermoscopic images with structured metadata and verified diagnostic labels for medical informatics research.

MedGemma 1.5 Technical Report

cs.AI · 2026-04-06 · unverdicted · novelty 4.0

MedGemma 1.5 4B reports absolute gains of 11% on 3D MRI classification, 3% on 3D CT, 47% macro F1 on pathology slides, 35% IoU on anatomical localization, and 5-22% on clinical QA tasks over MedGemma 1.

MedGemma vs GPT-4: Open-Source and Proprietary Zero-shot Medical Disease Classification from Images

cs.CV · 2025-12-29 · unverdicted · novelty 4.0

Fine-tuned MedGemma outperforms untuned GPT-4 in zero-shot medical image disease classification, achieving 80.37% versus 69.58% mean test accuracy with higher sensitivity for cancer and pneumonia.

Safeguarding AI in Medical Imaging: Post-Hoc Out-of-Distribution Detection with Normalizing Flows

cs.CV · 2025-02-17 · unverdicted · novelty 4.0

Post-hoc normalizing flows for OOD detection in medical imaging achieve 84.61% AUROC on MedOOD and 93.8% on MedMNIST, outperforming ViM, MDS, and ReAct.

citing papers explorer

Showing 15 of 15 citing papers.

IMCBench: A benchmark for multimodal LLMs in Image-grounded Medical Conversations cs.AI · 2026-06-26 · unverdicted · none · ref 17
IMCBench is a new benchmark for image-grounded multi-turn medical conversations that evaluates eight multimodal LLMs on safety, accuracy, and uncertainty, finding Claude Opus highest overall but safety drops for malignant and rare conditions.
DermAgent: A Self-Reflective Agentic System for Dermatological Image Analysis with Multi-Tool Reasoning and Traceable Decision-Making cs.CV · 2026-05-14 · unverdicted · none · ref 25
DermAgent orchestrates seven vision-language tools in a Plan-Execute-Reflect loop with dual-modality retrieval from 413k cases and a critic module to outperform GPT-4o by 17.6% in zero-shot dermatological diagnosis accuracy.
MedCore: Boundary-Preserving Medical Core Pruning for MedSAM cs.CV · 2026-05-13 · unverdicted · none · ref 46
MedCore achieves 60% parameter and 58.4% FLOP reduction on MedSAM with Dice 0.9549 and preserved boundary metrics via dual-intervention pruning and a new boundary leverage principle.
Concept Inconsistency in Dermoscopic Concept Bottleneck Models: A Rough-Set Analysis of the Derm7pt Dataset cs.LG · 2026-04-21 · conditional · none · ref 15
Rough-set analysis finds 16.4% of 305 concept profiles in Derm7pt inconsistent (306 images), capping hard CBM accuracy at 92.1%; symmetric filtering produces a 705-image consistent benchmark where EfficientNet-B5 reaches 0.90 label accuracy.
Contour Refinement using Discrete Diffusion in Low Data Regime cs.CV · 2026-02-05 · unverdicted · none · ref 17
A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.
Correcting Performance Estimation Bias in Imbalanced Classification with Minority Subconcepts cs.LG · 2026-04-28 · unverdicted · none · ref 23
The authors introduce predicted-weighted balanced accuracy (pBA), a utility-weighted evaluation metric that uses predicted subconcept posteriors to reduce bias from within-class heterogeneity in imbalanced data.
Cross-Domain Few-Shot Segmentation via Ordinary Differential Equations over Time Intervals cs.CV · 2025-09-01 · unverdicted · none · ref 59
FSS-TIs models cross-domain few-shot segmentation as an ODE process with Fourier-based spectral perturbations to create domain-agnostic features and enable effective fine-tuning on limited support samples.
MARVEL: Margin-Aware Robust von Mises-Fischer Expert Learning for Long-Tailed Out-of-Distribution Detection cs.CV · 2026-07-02 · unverdicted · none · ref 65
MARVEL introduces a multi-expert NvMF-based system with an outlier expert that reduces FPR95 in OOD detection on medical datasets by 8-37%.
Federated Medical Image Classification under Class and Domain Imbalance exploiting Synthetic Sample Generation cs.CV · 2026-04-29 · unverdicted · none · ref 30
FedSSG generates and shares synthetic samples within a federated setup to reduce class imbalance and domain shift problems in medical image classification.
IViT: A Novel Interpretable Visual Transformer for Skin Disease Detection eess.IV · 2026-06-22 · unverdicted · none · ref 2
IViT applies quadratic programming to a pre-trained Vision Transformer with a multi-objective loss, achieving 93.80% accuracy on six skin disease datasets (0.21% below baseline) while reducing feature redundancy by 29.5% and producing clinically consistent activations.
Cascade Classification of Dermoscopic Images of Skin Neoplasms with Controllable Sensitivity and External Clinical Validation cs.CV · 2026-06-11 · unverdicted · none · ref 4
Cascade classification improves macro F1 over single-stage for some models by allowing sensitivity control but reveals a large generalization gap on external clinical data.
Methodology for Creating a Clinically Verified Dermoscopic Image Dataset cs.CV · 2026-05-24 · unverdicted · none · ref 1
Describes a methodology and the resulting dataset of 1,026 dermoscopic images with structured metadata and verified diagnostic labels for medical informatics research.
MedGemma 1.5 Technical Report cs.AI · 2026-04-06 · unverdicted · none · ref 17
MedGemma 1.5 4B reports absolute gains of 11% on 3D MRI classification, 3% on 3D CT, 47% macro F1 on pathology slides, 35% IoU on anatomical localization, and 5-22% on clinical QA tasks over MedGemma 1.
MedGemma vs GPT-4: Open-Source and Proprietary Zero-shot Medical Disease Classification from Images cs.CV · 2025-12-29 · unverdicted · none · ref 28
Fine-tuned MedGemma outperforms untuned GPT-4 in zero-shot medical image disease classification, achieving 80.37% versus 69.58% mean test accuracy with higher sensitivity for cancer and pneumonia.
Safeguarding AI in Medical Imaging: Post-Hoc Out-of-Distribution Detection with Normalizing Flows cs.CV · 2025-02-17 · unverdicted · none · ref 51
Post-hoc normalizing flows for OOD detection in medical imaging achieve 84.61% AUROC on MedOOD and 93.8% on MedMNIST, outperforming ViM, MDS, and ReAct.

The HAM10000 dataset, a large collection of multi -source dermatoscopic images of common pigmented skin lesions

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer