pith. sign in

Andrew Zisserman

Identifiers

  • name variant Andrew Zisserman 0.60 · backfill

Papers (76)

  1. GMOS: Grounding Moving Object Segmentation in 3D Space and Time cs.CV · 2026 · author #4
  2. Perception Test 2025: Challenge Summary and a Unified VQA Extension cs.CV · 2026 · author #7
  3. Recurrent Video Masked Autoencoders cs.CV · 2025 · author #6
  4. Adapting MLLMs for Nuanced Video Retrieval cs.CV · 2025 · author #2
  5. Unique Lives, Shared World: Learning from Single-Life Videos cs.CV · 2025 · author #10
  6. Inferring Dynamic Physical Properties from Video Foundation Models cs.CV · 2025 · author #4
  7. Flamingo: a Visual Language Model for Few-Shot Learning cs.CV · 2022 · author #26
  8. Perceiver IO: A General Architecture for Structured Inputs & Outputs cs.LG · 2021 · author #13
  9. My lips are concealed: Audio-visual speech enhancement through obstructions cs.CV · 2019 · author #3
  10. LAEO-Net: revisiting people Looking At Each Other in videos cs.CV · 2019 · author #4
  11. A Hierarchical Probabilistic U-Net for Modeling Multi-Scale Ambiguities cs.CV · 2019 · author #7
  12. Object Discovery with a Copy-Pasting GAN cs.CV · 2019 · author #2
  13. Exploiting temporal context for 3D human pose estimation in the wild cs.CV · 2019 · author #3
  14. Temporal Cycle-Consistency Learning cs.CV · 2019 · author #5
  15. The StreetLearn Environment and Dataset cs.AI · 2019 · author #10
  16. Utterance-level Aggregation For Speaker Recognition In The Wild eess.AS · 2019 · author #4
  17. Video Action Transformer Network cs.CV · 2018 · author #4
  18. The Visual Centrifuge: Model-Free Layered Video Representations cs.CV · 2018 · author #3
  19. Class-Agnostic Counting cs.CV · 2018 · author #3
  20. GhostVLAD for set-based face recognition cs.CV · 2018 · author #3
  21. Learning to Read by Spelling: Towards Unsupervised Text Recognition cs.CV · 2018 · author #3
  22. From Same Photo: Cheating on Visual Kinship Challenges cs.CV · 2018 · author #2
  23. Turning a Blind Eye: Explicit Removal of Biases and Variation from Deep Neural Network Embeddings cs.CV · 2018 · author #2
  24. Deep Audio-Visual Speech Recognition cs.CV · 2018 · author #5
  25. 3D Surface Reconstruction by Pointillism cs.CV · 2018 · author #2
  26. LRS3-TED: a large-scale dataset for visual speech recognition cs.CV · 2018 · author #3
  27. Self-supervised learning of a facial attribute embedding from video cs.CV · 2018 · author #3
  28. Emotion Recognition in Speech using Cross-Modal Transfer in the Wild cs.CV · 2018 · author #4
  29. A Short Note about Kinetics-600 cs.CV · 2018 · author #5
  30. Comparator Networks cs.CV · 2018 · author #3
  31. X2Face: A network for controlling face generation by using images, audio, and pose codes cs.CV · 2018 · author #3
  32. A Better Baseline for AVA cs.CV · 2018 · author #4
  33. Multicolumn Networks for Face Recognition cs.CV · 2018 · author #2
  34. Inductive Visual Localisation: Factorised Training for Superior Generalisation cs.CV · 2018 · author #3
  35. Deep Lip Reading: a comparison of models and an online application cs.CV · 2018 · author #3
  36. Massively Parallel Video Networks cs.CV · 2018 · author #4
  37. Learnable PINs: Cross-Modal Embeddings for Person Identity cs.CV · 2018 · author #3
  38. The Conversation: Deep Audio-Visual Speech Enhancement cs.CV · 2018 · author #3
  39. Seeing Voices and Hearing Faces: Cross-modal biometric matching cs.CV · 2018 · author #3
  40. Learning to Navigate in Cities Without a Map cs.AI · 2018 · author #9
  41. Kickstarting Deep Reinforcement Learning cs.LG · 2018 · author #9
  42. Smooth Loss Functions for Deep Top-k Classification cs.LG · 2018 · author #2
  43. From Benedict Cumberbatch to Sherlock Holmes: Character Identification in TV series without a Script cs.CV · 2018 · author #2
  44. What have we learned from deep representations for action recognition? cs.CV · 2018 · author #4
  45. Objects that Sound cs.CV · 2017 · author #2
  46. SilNet : Single- and Multi-View Reconstruction by Learning from Silhouettes cs.CV · 2017 · author #2
  47. VGGFace2: A dataset for recognising faces across pose and age cs.CV · 2017 · author #5
  48. Detect to Track and Track to Detect cs.CV · 2017 · author #3
  49. Multi-task Self-Supervised Visual Learning cs.CV · 2017 · author #2
  50. Self-Supervised Learning for Spinal MRIs cs.CV · 2017 · author #3
  51. Temporal HeartNet: Towards Human-Level Automatic Analysis of Fetal Cardiac Screening Video cs.CV · 2017 · author #4
  52. Look, Listen and Learn cs.CV · 2017 · author #2
  53. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset cs.CV · 2017 · author #2
  54. The Kinetics Human Action Video Dataset cs.CV · 2017 · author #12
  55. You said that? cs.CV · 2017 · author #3
  56. From Images to 3D Shape Attributes cs.CV · 2016 · author #3
  57. Interferences in match kernels cs.CV · 2016 · author #4
  58. Trusting SVM for Piecewise Linear CNNs cs.LG · 2016 · author #2
  59. Signs in time: Encoding human motion as a temporal image cs.CV · 2016 · author #2
  60. Recurrent Human Pose Estimation cs.CV · 2016 · author #2
  61. Synthetic Data for Text Localisation in Natural Images cs.CV · 2016 · author #3
  62. Convolutional Two-Stream Network Fusion for Video Action Recognition cs.CV · 2016 · author #3
  63. Template Adaptation for Face Verification and Identification cs.CV · 2016 · author #6
  64. Personalizing Human Video Pose Estimation cs.CV · 2015 · author #5
  65. Flowing ConvNets for Human Pose Estimation in Videos cs.CV · 2015 · author #3
  66. Spatial Transformer Networks cs.CV · 2015 · author #3
  67. Automatic Discovery and Optimization of Parts for Image Classification cs.CV · 2014 · author #3
  68. Deep Structured Output Learning for Unconstrained Text Recognition cs.CV · 2014 · author #4
  69. Reading Text in the Wild with Convolutional Neural Networks cs.CV · 2014 · author #4
  70. Very Deep Convolutional Networks for Large-Scale Image Recognition cs.CV · 2014 · author #2
  71. Efficient On-the-fly Category Retrieval using ConvNets and GPUs cs.CV · 2014 · author #3
  72. Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition cs.CV · 2014 · author #4
  73. Two-Stream Convolutional Networks for Action Recognition in Videos cs.CV · 2014 · author #2
  74. Speeding up Convolutional Neural Networks with Low Rank Expansions cs.CV · 2014 · author #3
  75. Return of the Devil in the Details: Delving Deep into Convolutional Nets cs.CV · 2014 · author #4
  76. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps cs.CV · 2013 · author #3

Mentions

  • 1406.2227 #4 · backfill · confidence 0.70 Andrew Zisserman
  • 1406.2199 #2 · backfill · confidence 0.70 Andrew Zisserman
  • 1405.3866 #3 · backfill · confidence 0.70 Andrew Zisserman
  • 1405.3531 #4 · backfill · confidence 0.70 Andrew Zisserman
  • 1312.6034 #3 · backfill · confidence 0.70 Andrew Zisserman
  • 2605.30352 #4 · arxiv_oai · confidence 0.70 Andrew Zisserman
  • 2512.04085 #10 · arxiv_oai · confidence 0.70 Andrew Zisserman
  • 2107.14795 #13 · arxiv_oai · confidence 0.70 Andrew Zisserman

Frequent Coauthors