-
Natural sciences
- Machine learning and decision making
- Bio-informatics
- Analysis of next-generation sequence data
The purpose of this project is to design efficient computational algorithms for the analysis of large-scale biological data, and to develop software applications to make these algorithms easily available for domain experts (applied scientists) without a computational background. We will focus on a few applications in particular: (a) the use of long-read sequencing data, as generated by the MinION sequencer, for the improvement of metagenomic and pangenomic analyses of bacterial taxonomy and function, and (b) the analysis of 2D and 3D (medical) imaging data, obtained from microscopy, hyperspectroscopy, or through CT/MRI scans. Data of the relevant kind are in both cases already generated at GUGC. For the second part of the project, we will develop algorithmic proofs-of-concept into standalone, mature software applications, which will be made freely available to facilitate the analysis of large volumes of data.