Project

Computational analysis of large-scale biological data

Code
BOF/STA/202109/039
Duration
02 May 2022 → 01 May 2026
Funding
Regional and community funding: Special Research Fund
Research disciplines
  • Natural sciences
    • Machine learning and decision making
    • Bio-informatics
    • Analysis of next-generation sequence data
Keywords
bioinformatics biomedical image analysis software application next-generation sequencing data
 
Project description

The purpose of this project is to design efficient computational algorithms for the analysis of large-scale biological data, and to develop software applications to make these algorithms easily available for domain experts (applied scientists) without a computational background. We will focus on a few applications in particular: (a) the use of long-read sequencing data, as generated by the MinION sequencer, for the improvement of metagenomic and pangenomic analyses of bacterial taxonomy and function, and (b) the analysis of 2D and 3D (medical) imaging data, obtained from microscopy, hyperspectroscopy, or through CT/MRI scans. Data of the relevant kind are in both cases already generated at GUGC. For the second part of the project, we will develop algorithmic proofs-of-concept into standalone, mature software applications, which will be made freely available to facilitate the analysis of large volumes of data.