Skanda Koppula Research Engineer @ Google DeepMind

me at skoppula dot com  (PGP key, fingerprint / Google Scholar / GitHub)

I am currently a research engineer at Google DeepMind, and part of the PRISM vision research group at University College London. I work on 3D vision, video understanding, visual representation learning, generative modeling, and 3D motion capture. My research background covers computer vision, natural language understanding, computer security, and computer architecture.

In the past, I was a visiting researcher at ETH Zürich on a Fulbright Research Fellowship. I completed my MEng with the Energy Efficient Circuits and Systems group and Spoken Languages and Systems Group at MIT CSAIL. Years ago, I worked on NVidia's e2e Autonomous Driving Team, Google speech research team, Yahoo, and Square's hardware security team.

I spent many memorable years at MIT, Andover, and HCSSiM. I love painting and building racecars.

Publications

Efficiently Reconstructing Dynamic Scenes One 🎯 D4RT at a Time [arXiV]
Chuhan Zhang, Guillaume Le Moing, Skanda Koppula, Ignacio Rocco, Liliane Momeni, Junyu Xie, Shuyang Sun, Rahul Sukthankar, Joëlle K. Barral, Raia Hadsell, Zoubin Ghahramani, Andrew Zisserman, Junlin Zhang, Mehdi S. M. Sajjadi

TAPNext: Tracking Any Point (TAP) as Next Token Prediction [arXiV]
Artem Zholus, Carl Doersch, Yi Yang, Skanda Koppula, Viorica Patraucean, Xu Owen He, Ignacio Rocco, Mehdi S. M. Sajjadi, Sarath Chandar, Ross Goroshin
International Conference on Computer Vision (ICCV 2025).

SciVid: Cross-Domain Evaluation of Video Models in Scientific Applications [arXiV]
Yana Hasson, Pauline Luc, Liliane Momeni, Maks Ovsjanikov, Guillaume Le Moing, Alina Kuznetsova, Ira Ktena, Jennifer J. Sun, Skanda Koppula, Dilara Gokay, Joseph Heyward, Etienne Pot, Andrew Zisserman
International Conference on Computer Vision (ICCV 2025).

TAPVid-3D: A Benchmark for Tracking Any Point in 3D [arXiV]
Skanda Koppula*, Ignacio Rocco*, Yi Yang, Joe Heyward, João Carreira, Andrew Zisserman, Gabriel Brostow, Carl Doersch
NeurIPS 2024 (Datasets and Benchmarks).

Scaling 4D Representations [arXiV]
João Carreira, Dilara Gokay, Michael King, Chuhan Zhang, Ignacio Rocco, Aravindh Mahendran, Thomas Albert Keck, Joseph Heyward, Skanda Koppula, Etienne Pot, Goker Erdogan, Yana Hasson, Yi Yang, Klaus Greff, Guillaume Le Moing, Sjoerd van Steenkiste, Daniel Zoran, Drew A. Hudson, Pedro Vélez, Luisa Polanía, Luke Friedman, Chris Duvarney, Ross Goroshin, Kelsey Allen, Jacob Walker, Rishabh Kabra, Eric Aboussouan, Jennifer Sun, Thomas Kipf, Carl Doersch, Viorica Pătrăucean, Dima Damen, Pauline Luc, Mehdi S. M. Sajjadi, Andrew Zisserman

BootsTAP: Bootstrapped Training for Tracking-Any-Point [arXiV]
Carl Doersch, Yi Yang, Dilara Gokay, Pauline Luc, Skanda Koppula, Ankush Gupta, Joseph Heyward, Ross Goroshin, João Carreira, Andrew Zisserman
Asian Conference on Computer Vision (ACCV 2024).

A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames [arXiV]
Pinelopi Papalampidi*, Skanda Koppula*, Shreya Pathak*, Justin Chiu, Joe Heyward, Viorica Patraucean, Jiajun Shen, Antoine Miech, Andrew Zisserman, Aida Nematzdeh
Conference on Computer Vision and Pattern Recognition (CVPR 2024).

Perception Test: A Diagnostic Benchmark for Multimodal Video Models [arXiV]
Viorica Patraucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira
NeurIPS 2023 (Datasets and Benchmarks).

Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation [arXiV]
Mohit Sharma, Claudio Fantacci, Yuxiang Zhou, Skanda Koppula, Jon Scholz, Nicolas Heess, Yusuf Aytar
International Conference on Learning Representations (ICLR 2023).

Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods [arXiV]
Skanda Koppula, Yazhe Li, Andrew Jaegle, Evan Shelhamer, Nikhil Parthasarathy, Relja Arandjelovic, João Carreira, Olivier J. Henaff
NeurIPS 2022 Workshop: Self-Supervised Learning, Theory and Practice (NeurIPS SSL 2022).

Hierarchical Perceiver [arXiv]
João Carreira, Skanda Koppula, Daniel Zoran, Adria Recasens, Catalin Ionescu, Olivier J. Henaff, Evan Shelhamer, Relja Arandjelovic, Matt Botvinick, Oriol Vinyals, Karen Simonyan, Andrew Zisserman, Andrew Jaegle

Object discovery and representation networks [arXiv]
Olivier J. Henaff, Skanda Koppula, Evan Shelhamer, Daniel Zoran, Andrew Jaegle, Andrew Zisserman, João Carreira, Relja Arandjelović
European Conference on Computer Vision (ECCV 2022), Oral

Perceiver IO: A General Architecture for Structured Inputs & Outputs [arXiv]
Andrew Jaegle, Sebastian Borgeaud, Jean-Baptiste Alayrac, Carl Doersch, Catalin Ionescu, David Ding, Skanda Koppula, Daniel Zoran, Andrew Brock, Evan Shelhamer, Olivier Hénaff, Matthew M. Botvinick, Andrew Zisserman, Oriol Vinyals, João Carreira
International Conference on Learning Representations (ICLR 2022)

Efficient Visual Pretraining with Contrastive Detection [pdf] [arXiv]
Olivier J. Henaff, Skanda Koppula, Jean-Baptiste Alayrac, Aaron van den Oord, Oriol Vinyals, Joao Carreira
International Conference on Computer Vision (ICCV 2021), Oral

A Deep Learning Approach for Characterizing Major Galaxy Mergers [pdf] [external]
Skanda Koppula, Victor Bapst, Marc Huertas-Company, Sam Blackwell, Agnieszka Grabska-Barwinska, et al.
NeurIPS 2020 Workshop: ML for Physical Science (NeurIPS ML4PS 2020).

Accurate, Low-Latency Visual Perception for Autonomous Racing: Challenges, Mechanisms, and Practical Solutions [pdf] [arXiv]
Kieran Strobel, Sibo Zhu, Raphael Chang, and Skanda Koppula
The IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020).

EDEN: Enabling Energy-Efficient, High-Performance Deep Neural Network Inference Using Approximate DRAM [pdf] [arXiv]
Skanda Koppula, Lois Orosa, A. Giray Yağlıkçı, Roknoddin Azizi, Taha Shahroodi, Konstantinos Kanellopoulos, and Onur Mutlu
The 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2019).

SMASH: Co-designing Software Compression and Hardware-Accelerated Indexing for Efficient Sparse Matrix Operations [pdf] [arXiv]
Konstantinos Kanellopoulos, Nandita Vijaykumar, Christina Giannoula, Roknoddin Azizi, Skanda Koppula, Nika Mansouri Ghiasi, Taha Shahroodi, Juan Gomez Luna, and Onur Mutlu
The 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2019).

EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network Accelerators [arXiv]
Lois Orosa, Skanda Koppula, Yaman Umuroglu, Konstantinos Kanellopoulos, Juan Gomez-Luna, Michaela Blott, Kees Vissers, Onur Mutlu
IEEE Transactions on Computers (Special Issue on Hardware Acceleration of Machine Learning, 2022).

Understanding Recurrent Neural State Using Memory Signatures [pdf] [arXiv]
Skanda Koppula, Khe Chai Sim, and Kean Chin
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018).

Energy-Efficient Speaker Identification With Low-Precision Networks [pdf] [code]
Skanda Koppula, James Glass, and Anantha P. Chandrakasan
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018).

Work from my undergraduate studies:

Learning a CNN-based End-to-End Controller for a Formula SAE Racecar [pdf] [arXiv] [code]
Skanda Koppula and Kevin Chan
Technical Report

Applying the Residue Number System to Neural Network Inference [arXiv] [code]
Mohamed Abdelhamid and Skanda Koppula
Technical Report

Robust Predictive Bayesian Analysis for Genome-Wide Association and Expression Studies [pdf] [PubMed]
Skanda Koppula, Amin Zollanvari, Ning An, and Gil Alterovitz
In the Proceedings of the American Medical Informatics Joint Summit (AMIA 2013)

Projects

Open-Source Battery Management System for Automotive Applications [code]
MIT Formula SAE, MIT Electric Vehicle Team
The first automotive-grade battery management system with open-source firmware and PCBs.

Power-Based Side-Channel Attack for AES Key Extraction on the ATMega328 Microcontroller [pdf] [code]
Utsav Banerjee, Skanda Koppula, and Lisa Ho
Demonstrated power-based sidechannel attack on AES-ECB running on an Arduino.

A Public-Key Authentication Scheme for Controller Area Networks [pdf] [code]
Skanda Koppula, Matt Chang, and Nicolas Bravo
Design and prototyping of a scheme to secure an automotive CAN bus.

Bayesian Clustering and Topic Discovery: Adventures with Gene Expression Data [pdf] [code]
Karren Yang and Skanda Koppula
Experiments to cluster and interpret gene expression using heirarchical Bayesian models.

Teaching

Fall '25: Instructor at Mediterranean Machine Learning Summer School
Fall '16 and '17: TA for 6.006 Introduction to Algorithms.
January '15: Electronics Instructor at Korea International School, MIT Global Teaching Labs.
Fall/Spring '15: Volunteer at Tutoring Plus

This site was based on the design here.