ViperGPT generates executable Python code to compose pre-trained vision-and-language modules into programs that answer visual queries, reaching state-of-the-art results with no additional training.
Title resolution pending
7 Pith papers cite this work. Polarity classification is still indexing.
representative citing papers
Adding register tokens to Vision Transformers eliminates high-norm background artifacts and raises state-of-the-art performance on dense visual prediction tasks.
Inpainting auxiliary task improves clustering of embeddings for individual zebrafish identification based on skin patterns.
Shapley value and variational importance switch methods produce consistent rankings of filter importance in CNNs, enabling compression and interpretability.
Transfer learning with a Zoobot CNN on SDSS DR18 data identifies 3,679 lopsided spiral galaxies at 87% test accuracy, with lopsided systems showing higher star formation, bluer colors, lower mass and concentration.
A decoupled watershed-plus-EfficientNet pipeline recovers 75.95% of cells without annotations and reaches 98.36% stage classification accuracy with instance-level explainability on the NIH BBBC041 dataset.
citing papers explorer
-
Exploring Clustering Capability of Inpainting Model Embeddings for Pattern-based Individual Identification
Inpainting auxiliary task improves clustering of embeddings for individual zebrafish identification based on skin patterns.