ViperGPT generates executable Python code to compose pre-trained vision-and-language modules into programs that answer visual queries, reaching state-of-the-art results with no additional training.
Title resolution pending
7 Pith papers cite this work. Polarity classification is still indexing.
representative citing papers
Adding register tokens to Vision Transformers eliminates high-norm background artifacts and raises state-of-the-art performance on dense visual prediction tasks.
Inpainting auxiliary task improves clustering of embeddings for individual zebrafish identification based on skin patterns.
Shapley value and variational importance switch methods produce consistent rankings of filter importance in CNNs, enabling compression and interpretability.
Transfer learning with a Zoobot CNN on SDSS DR18 data identifies 3,679 lopsided spiral galaxies at 87% test accuracy, with lopsided systems showing higher star formation, bluer colors, lower mass and concentration.
A decoupled watershed-plus-EfficientNet pipeline recovers 75.95% of cells without annotations and reaches 98.36% stage classification accuracy with instance-level explainability on the NIH BBBC041 dataset.
citing papers explorer
-
ViperGPT: Visual Inference via Python Execution for Reasoning
ViperGPT generates executable Python code to compose pre-trained vision-and-language modules into programs that answer visual queries, reaching state-of-the-art results with no additional training.
-
Vision Transformers Need Registers
Adding register tokens to Vision Transformers eliminates high-norm background artifacts and raises state-of-the-art performance on dense visual prediction tasks.
-
Exploring Clustering Capability of Inpainting Model Embeddings for Pattern-based Individual Identification
Inpainting auxiliary task improves clustering of embeddings for individual zebrafish identification based on skin patterns.
-
Neuron ranking -- an informed way to condense convolutional neural networks architecture
Shapley value and variational importance switch methods produce consistent rankings of filter importance in CNNs, enabling compression and interpretability.
-
Identifying lopsidedness in spiral galaxies using a Deep Convolutional Neural Network
Transfer learning with a Zoobot CNN on SDSS DR18 data identifies 3,679 lopsided spiral galaxies at 87% test accuracy, with lopsided systems showing higher star formation, bluer colors, lower mass and concentration.
-
MalariAI: A Label-Resilient Decoupled Framework for Universal Cell Segmentation and Explainable Stage Classification in Dense Malaria Blood Smears
A decoupled watershed-plus-EfficientNet pipeline recovers 75.95% of cells without annotations and reaches 98.36% stage classification accuracy with instance-level explainability on the NIH BBBC041 dataset.
- KIGNet: Physics-Motivated Multi-Graph Representation Learning for Explainable Jet Tagging