Project

Search Schemes for Sequence Alignment to Pan-Genome Graphs

Code
01D04821
Duration
01 October 2021 → 31 October 2021
Funding
Regional and community funding: Special Research Fund
Promotor
Research disciplines
  • Medical and health sciences
    • Analysis of next-generation sequence data
    • Development of bioinformatics software, tools and databases
  • Engineering and technology
    • Bio-informatics
Keywords
Approximate string matching bio-informatics pan-genomics sequence-to-graph alignment
 
Project description

We will develop scalable, graph-based pan-genome representations as well as algorithms that enable efficient search functionality. The first goal is the detection of non-contiguous occurrences of reads against the pan-genome. Moreover, we will enable compatibility with long, high-error reads next to short, low-error reads. Specifically, we will study pan-genome graph representations based on the Burrows Wheeler transform (BWT).