Project

Interactive embedding of high-dimensional data with shape templates

Code

3F021721

Duration

01 November 2021 → 31 October 2026

Funding

Research Foundation - Flanders (FWO)

Promotor

Jefrey Lijffijt

Fellow

Edith Heiter

Research disciplines

Natural sciences
- Data mining
- Machine learning and decision making
- Visual data analysis

Keywords

Representation learning Topological structure Human-in-the-loop

Project description

Dimensionality reduction (DR) is widely used to condense data for subsequent application of machine learning algorithms and to learn about the high-level structure of the data by providing low-dimensional embeddings used for visualization. Existing DR techniques such as t-SNE or UMAP are not guaranteed to represent the high-dimensional topological structure of the data faithfully. While topological embedding methods aim at modeling the underlying structure as closely as possible, they remain a black box for the user and do not allow for interactive exploration. In this project we aim at developing dimensionality reduction methods to fit the data on a low-dimensional shape template in an interactive fashion. To this end, we formalize the notion of shape templates and investigate how to automatically extract them from the data. We then integrate these templates into existing dimensionality reduction methods and design feedback mechanisms to evaluate the fit of the data on the template. This interactive setting can then be used to gain insights into the high-dimensional shape of the data that would remain hidden when using a static embedding from existing methods.