Andrew Zisserman

Identifiers

name variant Andrew Zisserman 0.60 · backfill

Papers (76)

GMOS: Grounding Moving Object Segmentation in 3D Space and Time cs.CV · 2026 · author #4
Perception Test 2025: Challenge Summary and a Unified VQA Extension cs.CV · 2026 · author #7
Recurrent Video Masked Autoencoders cs.CV · 2025 · author #6
Adapting MLLMs for Nuanced Video Retrieval cs.CV · 2025 · author #2
Unique Lives, Shared World: Learning from Single-Life Videos cs.CV · 2025 · author #10
Inferring Dynamic Physical Properties from Video Foundation Models cs.CV · 2025 · author #4
Flamingo: a Visual Language Model for Few-Shot Learning cs.CV · 2022 · author #26
Perceiver IO: A General Architecture for Structured Inputs & Outputs cs.LG · 2021 · author #13
My lips are concealed: Audio-visual speech enhancement through obstructions cs.CV · 2019 · author #3
LAEO-Net: revisiting people Looking At Each Other in videos cs.CV · 2019 · author #4
A Hierarchical Probabilistic U-Net for Modeling Multi-Scale Ambiguities cs.CV · 2019 · author #7
Object Discovery with a Copy-Pasting GAN cs.CV · 2019 · author #2
Exploiting temporal context for 3D human pose estimation in the wild cs.CV · 2019 · author #3
Temporal Cycle-Consistency Learning cs.CV · 2019 · author #5
The StreetLearn Environment and Dataset cs.AI · 2019 · author #10
Utterance-level Aggregation For Speaker Recognition In The Wild eess.AS · 2019 · author #4
Video Action Transformer Network cs.CV · 2018 · author #4
The Visual Centrifuge: Model-Free Layered Video Representations cs.CV · 2018 · author #3
Class-Agnostic Counting cs.CV · 2018 · author #3
GhostVLAD for set-based face recognition cs.CV · 2018 · author #3
Learning to Read by Spelling: Towards Unsupervised Text Recognition cs.CV · 2018 · author #3
From Same Photo: Cheating on Visual Kinship Challenges cs.CV · 2018 · author #2
Turning a Blind Eye: Explicit Removal of Biases and Variation from Deep Neural Network Embeddings cs.CV · 2018 · author #2
Deep Audio-Visual Speech Recognition cs.CV · 2018 · author #5
3D Surface Reconstruction by Pointillism cs.CV · 2018 · author #2
LRS3-TED: a large-scale dataset for visual speech recognition cs.CV · 2018 · author #3
Self-supervised learning of a facial attribute embedding from video cs.CV · 2018 · author #3
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild cs.CV · 2018 · author #4
A Short Note about Kinetics-600 cs.CV · 2018 · author #5
Comparator Networks cs.CV · 2018 · author #3
X2Face: A network for controlling face generation by using images, audio, and pose codes cs.CV · 2018 · author #3
A Better Baseline for AVA cs.CV · 2018 · author #4
Multicolumn Networks for Face Recognition cs.CV · 2018 · author #2
Inductive Visual Localisation: Factorised Training for Superior Generalisation cs.CV · 2018 · author #3
Deep Lip Reading: a comparison of models and an online application cs.CV · 2018 · author #3
Massively Parallel Video Networks cs.CV · 2018 · author #4
Learnable PINs: Cross-Modal Embeddings for Person Identity cs.CV · 2018 · author #3
The Conversation: Deep Audio-Visual Speech Enhancement cs.CV · 2018 · author #3
Seeing Voices and Hearing Faces: Cross-modal biometric matching cs.CV · 2018 · author #3
Learning to Navigate in Cities Without a Map cs.AI · 2018 · author #9
Kickstarting Deep Reinforcement Learning cs.LG · 2018 · author #9
Smooth Loss Functions for Deep Top-k Classification cs.LG · 2018 · author #2
From Benedict Cumberbatch to Sherlock Holmes: Character Identification in TV series without a Script cs.CV · 2018 · author #2
What have we learned from deep representations for action recognition? cs.CV · 2018 · author #4
Objects that Sound cs.CV · 2017 · author #2
SilNet : Single- and Multi-View Reconstruction by Learning from Silhouettes cs.CV · 2017 · author #2
VGGFace2: A dataset for recognising faces across pose and age cs.CV · 2017 · author #5
Detect to Track and Track to Detect cs.CV · 2017 · author #3
Multi-task Self-Supervised Visual Learning cs.CV · 2017 · author #2
Self-Supervised Learning for Spinal MRIs cs.CV · 2017 · author #3
Temporal HeartNet: Towards Human-Level Automatic Analysis of Fetal Cardiac Screening Video cs.CV · 2017 · author #4
Look, Listen and Learn cs.CV · 2017 · author #2
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset cs.CV · 2017 · author #2
The Kinetics Human Action Video Dataset cs.CV · 2017 · author #12
You said that? cs.CV · 2017 · author #3
From Images to 3D Shape Attributes cs.CV · 2016 · author #3
Interferences in match kernels cs.CV · 2016 · author #4
Trusting SVM for Piecewise Linear CNNs cs.LG · 2016 · author #2
Signs in time: Encoding human motion as a temporal image cs.CV · 2016 · author #2
Recurrent Human Pose Estimation cs.CV · 2016 · author #2
Synthetic Data for Text Localisation in Natural Images cs.CV · 2016 · author #3
Convolutional Two-Stream Network Fusion for Video Action Recognition cs.CV · 2016 · author #3
Template Adaptation for Face Verification and Identification cs.CV · 2016 · author #6
Personalizing Human Video Pose Estimation cs.CV · 2015 · author #5
Flowing ConvNets for Human Pose Estimation in Videos cs.CV · 2015 · author #3
Spatial Transformer Networks cs.CV · 2015 · author #3
Automatic Discovery and Optimization of Parts for Image Classification cs.CV · 2014 · author #3
Deep Structured Output Learning for Unconstrained Text Recognition cs.CV · 2014 · author #4
Reading Text in the Wild with Convolutional Neural Networks cs.CV · 2014 · author #4
Very Deep Convolutional Networks for Large-Scale Image Recognition cs.CV · 2014 · author #2
Efficient On-the-fly Category Retrieval using ConvNets and GPUs cs.CV · 2014 · author #3
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition cs.CV · 2014 · author #4
Two-Stream Convolutional Networks for Action Recognition in Videos cs.CV · 2014 · author #2
Speeding up Convolutional Neural Networks with Low Rank Expansions cs.CV · 2014 · author #3
Return of the Devil in the Details: Delving Deep into Convolutional Nets cs.CV · 2014 · author #4
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps cs.CV · 2013 · author #3

Mentions

1406.2227 #4 · backfill · confidence 0.70 Andrew Zisserman
1406.2199 #2 · backfill · confidence 0.70 Andrew Zisserman
1405.3866 #3 · backfill · confidence 0.70 Andrew Zisserman
1405.3531 #4 · backfill · confidence 0.70 Andrew Zisserman
1312.6034 #3 · backfill · confidence 0.70 Andrew Zisserman
2605.30352 #4 · arxiv_oai · confidence 0.70 Andrew Zisserman
2512.04085 #10 · arxiv_oai · confidence 0.70 Andrew Zisserman
2107.14795 #13 · arxiv_oai · confidence 0.70 Andrew Zisserman

Frequent Coauthors

Karen Simonyan 14 shared papers
Andrea Vedaldi 11 shared papers
Joon Son Chung 8 shared papers
Weidi Xie 7 shared papers
Carl Doersch 6 shared papers
Joao Carreira 6 shared papers
Arsha Nagrani 5 shared papers
Max Jaderberg 5 shared papers
Triantafyllos Afouras 5 shared papers
Jo\~ao Carreira 4 shared papers
Olivia Wiles 4 shared papers
Relja Arandjelovi\'c 4 shared papers
Ankush Gupta 3 shared papers
Axel Pinz 3 shared papers
Christoph Feichtenhofer 3 shared papers
Daniel Zoran 3 shared papers
Jean-Baptiste Alayrac 3 shared papers
Koray Kavukcuoglu 3 shared papers
Oriol Vinyals 3 shared papers
Samuel Albanie 3 shared papers