Differential compression for colored de Bruijn graphs

DSpace/Manakin Repository

Show simple item record

dc.contributor.advisor Pibiri, Giulio Ermanno it_IT
dc.contributor.author Campanelli, Alessio <2000> it_IT
dc.date.accessioned 2024-09-29 it_IT
dc.date.accessioned 2024-11-13T12:08:25Z
dc.date.available 2024-11-13T12:08:25Z
dc.date.issued 2024-10-17 it_IT
dc.identifier.uri http://hdl.handle.net/10579/27717
dc.description.abstract The problem of sequence identification or matching is relevant for many important tasks in Computational Biology, such as metagenomics and pangenome analysis. Due to the complex nature of such analyses and the large scale of the reference collections, a resource-efficient solution is critical. To solve this problem, we propose a lossless compressed data structure for colored de Bruijn graphs, which can be regarded as a map from k-mers to their color sets. The color set of a k-mer is the collection of all the identifiers of the references in which that k-mer can be found. Our solutions exploit the repetitiveness of the color sets when indexing large collections of related genomes, extracting repeating patterns and encoding them once, instead of redundantly replicating their representation. Experimental results show that these representations substantially improve over the space effectiveness of the best previous solutions while impacting only marginally the efficiency of the queries. it_IT
dc.language.iso en it_IT
dc.publisher Università Ca' Foscari Venezia it_IT
dc.rights © Alessio Campanelli, 2024 it_IT
dc.title Differential compression for colored de Bruijn graphs it_IT
dc.title.alternative Meta-Differential compression for colored de Bruijn graphs it_IT
dc.type Master's Degree Thesis it_IT
dc.degree.name Computer science and information technology it_IT
dc.degree.level Laurea magistrale it_IT
dc.degree.grantor Dipartimento di Scienze Ambientali, Informatica e Statistica it_IT
dc.description.academicyear sessione_autunnale_23-24_appello_14-10-24 it_IT
dc.rights.accessrights openAccess it_IT
dc.thesis.matricno 878170 it_IT
dc.subject.miur INF/01 INFORMATICA it_IT
dc.description.note it_IT
dc.degree.discipline it_IT
dc.contributor.co-advisor it_IT
dc.date.embargoend it_IT
dc.provenance.upload Alessio Campanelli ([email protected]), 2024-09-29 it_IT
dc.provenance.plagiarycheck None it_IT


Files in this item

This item appears in the following Collection(s)

Show simple item record