arXiv preprint arXiv:2010.00747 , year=

Zhang, Y · 2020 · arXiv 2010.00747

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs

cs.CV · 2023-03-02 · conditional · novelty 7.0

BiomedCLIP, pretrained on the new 15-million-pair PMC-15M dataset, achieves state-of-the-art performance on diverse biomedical vision-language tasks and even outperforms radiology-specific models on chest X-ray pneumonia detection.

Hierarchical Text-Conditional Image Generation with CLIP Latents

cs.CV · 2022-04-13 · accept · novelty 7.0

A hierarchical prior-decoder model using CLIP latents generates more diverse text-conditional images than direct methods while preserving photorealism and caption fidelity.

Prognostic Value of Lung Ultrasound Biomarkers for Readmission Risk in Congestive Heart Failure: A Pilot Data-Driven Analysis

eess.SP · 2026-05-16 · unverdicted · novelty 6.0

Pilot study uses pretrained video encoder features from lung ultrasound to predict 30-day CHF readmission, finding lower-lung views and temporal differences most informative with top MLP F1 of 0.80.

Demystifying CLIP Data

cs.CV · 2023-09-28 · accept · novelty 6.0

MetaCLIP curates balanced 400M-pair subsets from CommonCrawl that outperform CLIP data, reaching 70.8% zero-shot ImageNet accuracy on ViT-B versus CLIP's 68.3%.

CheXanatomy: Anatomy-Aware Vision-Language Modeling for Chest Radiographs

cs.CV · 2026-06-07 · unverdicted · novelty 5.0

CheXanatomy trains VLMs to generate 2D anatomical masks via next-token prediction on synthetic CXRs from CT, matching U-Net performance with better domain-shift robustness and sample efficiency.

PaCX-MAE: Physiology-Augmented Chest X-Ray Masked Autoencoder

cs.CV · 2026-06-01 · unverdicted · novelty 5.0

PaCX-MAE augments masked autoencoding of chest X-rays with dual contrastive-predictive alignment to ECG and laboratory embeddings, reporting gains on physiology-dependent tasks while remaining unimodal at test time.

Ultrasound Vision-Language Alignment via Contrastive Learning

cs.CV · 2026-05-04 · conditional · novelty 4.0

EchoCare-CLIP achieves 0.682 paired alignment on a 16K ultrasound image-text corpus but downstream zero-shot classification peaks at 0.709 on BUSI only with partial fine-tuning, while full fine-tuning overfits.

PaliGemma: A versatile 3B VLM for transfer

cs.CV · 2024-07-10 · unverdicted · novelty 4.0

PaliGemma is an open 3B VLM based on SigLIP and Gemma that achieves strong performance on nearly 40 diverse open-world tasks including benchmarks, remote-sensing, and segmentation.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Prognostic Value of Lung Ultrasound Biomarkers for Readmission Risk in Congestive Heart Failure: A Pilot Data-Driven Analysis eess.SP · 2026-05-16 · unverdicted · none · ref 38
Pilot study uses pretrained video encoder features from lung ultrasound to predict 30-day CHF readmission, finding lower-lung views and temporal differences most informative with top MLP F1 of 0.80.
CheXanatomy: Anatomy-Aware Vision-Language Modeling for Chest Radiographs cs.CV · 2026-06-07 · unverdicted · none · ref 27
CheXanatomy trains VLMs to generate 2D anatomical masks via next-token prediction on synthetic CXRs from CT, matching U-Net performance with better domain-shift robustness and sample efficiency.
PaCX-MAE: Physiology-Augmented Chest X-Ray Masked Autoencoder cs.CV · 2026-06-01 · unverdicted · none · ref 16
PaCX-MAE augments masked autoencoding of chest X-rays with dual contrastive-predictive alignment to ECG and laboratory embeddings, reporting gains on physiology-dependent tasks while remaining unimodal at test time.
PaliGemma: A versatile 3B VLM for transfer cs.CV · 2024-07-10 · unverdicted · none · ref 164
PaliGemma is an open 3B VLM based on SigLIP and Gemma that achieves strong performance on nearly 40 diverse open-world tasks including benchmarks, remote-sensing, and segmentation.

arXiv preprint arXiv:2010.00747 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer