2

MV2MAE: Multi-View Video Masked Autoencoders

Videos captured from multiple viewpoints can help in perceiving the 3D structure of the world and benefit computer vision tasks such as action recognition, tracking, etc. In this paper, we present a method for self-supervised learning from …

ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst

Our goal is to train a policy for autonomous driving via imitation learning that is robust enough to drive a real vehicle. We find that standard behavior cloning is insufficient for handling complex driving scenarios, even when we leverage a …

Geometric Polynomial Constraints in Higher-order Graph Matching

Correspondence is a ubiquitous problem in computer vision and graph matching has been a natural way to formalize correspondence as an optimization problem. Recently, graph matching solvers have included higher-order terms representing affinities …

Automated Image Alignment For Change Detection In ROP

We have developed a novel fundus image alignment procedure that facilitates flicker-based identification of changes in width and tortuosity of retinal blood vessels. Since our alignment process implicitly accounts for unknown camera parameters, it …

DARPA Urban Challenge