Computer Vision + Generative AI

Tony Ng

Senior Research Scientist at Google DeepMind, working on generative AI and multimodal systems.

Tony Ng

Senior Research Scientist at Google DeepMind. PhD at MatchLab, Imperial College London. Ex-Meta, Synthesia & Scape Technologies.

Research Focus

Generative Systems

Diffusion models for image, video, and audio generation with real-world quality and reliability constraints.

Visual Localization

Learning-based localization that blends geometry with deep representations for AR/VR at scale.

Privacy + Security

Content-concealing descriptors and robust perception for privacy-preserving visual systems.

Now

I am a Senior Research Scientist at Google DeepMind on the Science and Strategic Initiatives team. I am interested in generative AI, multimodal systems, and evaluation frameworks that move beyond surface-level metrics.

I am open to collaborations on generative media systems, privacy-preserving perception, and robust evaluation.

selected publications

  1. CVPR
    TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
    Zhiheng Liu, Weiming Ren, Haozhe Liu, Zijian Zhou, Shoufa Chen, Haonan Qiu, Xiaoke Huang, Zhaochong An, Fanny Yang, Aditya Patel, Viktar Atliha, Tony Ng, Xiao Han, Chuyan Zhu, Chenyang Zhang, Ding Liu, Juan-Manuel Perez-Rua, Sen He, Jürgen Schmidhuber, Wenhu Chen, Ping Luo, Wei Liu, Tao Xiang, Jonas Schult, Yuren Cong
    In CVPR, 2026
  2. CVPR
    VecGlypher: Unified Vector Glyph Generation with Language Models
    Xiaoke Huang, Bhavul Gauri, Kam Woh Ng, Tony Ng, Mengmeng Xu, Zhiheng Liu, Weiming Ren, Zhaochong An, Zijian Zhou, Haonan Qiu, Yuyin Zhou, Sen He, Ziheng Wang, Tao Xiang, Xiao Han
    In CVPR, 2026
  3. CVPR
    NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning
    Tony Ng, Hyo Jin Kim, Vincent Lee, Daniel DeTone, Tsun-Yi Yang, Tianwei Shen, Eddy Ilg, Vassileios Balntas, Krystian Mikolajczyk, Chris Sweeney
    In CVPR, 2022
  4. arXiv
    Reassessing the Limitations of CNN Methods for Camera Pose Regression
    Tony Ng, Adrian Lopez-Rodriguez, Vassileios Balntas, Krystian Mikolajczyk
    arXiv preprint, 2021
  5. ECCV
    SOLAR: Second-Order Loss and Attention for Image Retrieval
    Tony Ng, Vassileios Balntas, Yurun Tian, Krystian Mikolajczyk
    In ECCV, 2020

news

Apr 20, 2026 Started a new role as Senior Research Scientist at Google DeepMind on the Science and Strategic Initiatives team.
Feb 23, 2026 Two papers were accepted to CVPR 2026: TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models and VecGlypher: Unified Vector Glyph Generation with Language Models.
Dec 10, 2025 New preprint: TUNA — Taming Unified Visual Representations for Native Unified Multimodal Models (arXiv:2512.02014).
Aug 19, 2024 Started a new role as an AI Research Scientist at Meta, focusing on diffusion models for image, video, and audio generation.
Feb 6, 2023 Joined Synthesia as a Research Engineer, working on controllable video diffusion models for AI dubbing on avatars.