Table of Contents Introduction Research planImprove query performance using Heavy-Light Decomposition Add more query types Extend to non-exact suffix-prefix-overlap that allows for read errors Implement an algorithm to build string graphs, and possibly a full assembler This is a research proposal for a 5 month internship at CWI during autumn/winter 2023-2024. Introduction An important problem in bioinformatics is genome assembly: DNA sequencing machines read substrings of a full DNA genome, a...