MirrorCheck detects adversarial attacks on VLMs via T2I regeneration for semantic consistency checks, using stochastic model selection and one-time perturbations for robustness against adaptive attacks.
Densenet: Im- plementing efficient convnet descriptor pyramids
6 Pith papers cite this work. Polarity classification is still indexing.
abstract
Convolutional Neural Networks (CNNs) can provide accurate object classification. They can be extended to perform object detection by iterating over dense or selected proposed object regions. However, the runtime of such detectors scales as the total number and/or area of regions to examine per image, and training such detectors may be prohibitively slow. However, for some CNN classifier topologies, it is possible to share significant work among overlapping regions to be classified. This paper presents DenseNet, an open source system that computes dense, multiscale features from the convolutional layers of a CNN based object classifier. Future work will involve training efficient object detectors with DenseNet feature descriptors.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 6roles
baseline 1polarities
baseline 1representative citing papers
TVRN combines invertible wavelet-based networks with a surrogate gradient approximator and compression-aware asymmetric design to improve frame-rate rescaling quality under real codecs.
The Twisted-Path Particle Filter parameterizes twisting functions via neural networks and optimizes them against a path-measure KL divergence to improve continuous-time particle filtering.
InternVL scales a vision model to 6B parameters and aligns it with LLMs using web data to achieve state-of-the-art results on 32 visual-linguistic benchmarks.
A pairwise-augmented loss on CNNs is reported to deliver state-of-the-art accuracy on primate face classification, verification, closed-set and open-set identification for two species.
Genetic algorithm searches convolution cell architectures with weight sharing via SGD, reporting 96% accuracy on CIFAR10 and 80.1% on CIFAR100.
citing papers explorer
-
MirrorCheck: Efficient Adversarial Defense for Vision-Language Models
MirrorCheck detects adversarial attacks on VLMs via T2I regeneration for semantic consistency checks, using stochastic model selection and one-time perturbations for robustness against adaptive attacks.
-
TVRN: Invertible Neural Networks for Compression-Aware Temporal Video Rescaling
TVRN combines invertible wavelet-based networks with a surrogate gradient approximator and compression-aware asymmetric design to improve frame-rate rescaling quality under real codecs.
-
Guidance for twisted particle filter: a continuous-time perspective
The Twisted-Path Particle Filter parameterizes twisting functions via neural networks and optimizes them against a path-measure KL divergence to improve continuous-time particle filtering.
-
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
InternVL scales a vision model to 6B parameters and aligns it with LLMs using web data to achieve state-of-the-art results on 32 visual-linguistic benchmarks.
-
Primate Face Identification in the Wild
A pairwise-augmented loss on CNNs is reported to deliver state-of-the-art accuracy on primate face classification, verification, closed-set and open-set identification for two species.
-
Genetic Network Architecture Search
Genetic algorithm searches convolution cell architectures with weight sharing via SGD, reporting 96% accuracy on CIFAR10 and 80.1% on CIFAR100.