Codes and Expansions (CodEx) Seminar


Alex Cloninger (University of California at San Diego)
From Local Views to Global Embedding: Methods in Bottom-up Manifold Learning

High-dimensional datasets often reside on a low-dimensional geometrical manifold. Manifold learning algorithms aim to retrieve this underlying structure by mapping the data into lower dimensions while minimizing some measure of local (and possibly global) distortion incurred by the map. However, attaining small (or even finite) distortion is not guaranteed, or even possible, with most manifold learning schemes. Bottom-up approaches address this problem by first constructing low-distortion low-dimensional local views of the data that are provably reliable, and then integrating them together to obtain a global embedding without inducing too much additional distortion. In our work, we investigate the following questions:

  1. How to obtain low-distortion low-dimensional local views of high-dimensional data that are robust to noise?
  2. How to integrate these local views in an efficient manner to produce a low-dimensional global embedding with distortion guarantees?
  3. How does the distortion incurred in the low-dimensional embedding impacts the performance of the downstream tasks?
This talk covers joint work with Dhruv Kohli, Gal Mishne, Sawyer Robertson, Johannes Nieuwenhuis, and Devika Narain.