dc.contributor.author
Pan, Chenxu
dc.date.accessioned
2025-03-31T07:19:56Z
dc.date.available
2025-03-31T07:19:56Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/46318
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-46031
dc.description.abstract
Advances in sequencing technology have facilitated population-scale long-read analysis, in which one of the main challenges is arguably developing high-performance computational pipelines. Sequence alignment and assembly are two main long-read analysis methods. Alignment-based pipelines are commonly more efficient and require less read coverage than assembly-based ones, and thus are more applicable to population-scale analysis. However, alignment-based pipelines are less effective in reconstructing highly diverse structures in ultra-long reads such as intra-read SVs. Here, we propose a new filter-based pipeline that is designed to capture rearrangement signals at an earlier stage than conventional pipelines to improve long-read analysis performance. To this end, we investigated the feasibility and essential methods of the design and assessed the performance of the pipeline. Correspondingly, this work comprises three parts starting with data structure optimizations then module development and finally large-scale assessments. Assessments based on high-quality datasets suggest that filter-based pipelines are comparable to or outperform conventional pipelines in terms of detecting complex intra-read rearrangements and computational efficiency. Therefore, the newly proposed pipeline may further benefit population-scale long-read analysis.
en
dc.format.extent
142 Seiten
dc.rights.uri
http://www.fu-berlin.de/sites/refubium/rechtliches/Nutzungsbedingungen
dc.subject
Sequence analysis
en
dc.subject
Long-read mapping
en
dc.subject
Algorithm optimization
en
dc.subject.ddc
000 Computer science, information, and general works::000 Computer Science, knowledge, systems::000 Computer science, information, and general works
dc.title
High-performance algorithms and applications of long-read mapping and SV detection
dc.contributor.gender
male
dc.contributor.firstReferee
Reinert, Knut
dc.contributor.furtherReferee
Renard, Bernhard
dc.date.accepted
2024-12-13
dc.identifier.urn
urn:nbn:de:kobv:188-refubium-46318-6
refubium.affiliation
Mathematik und Informatik
dcterms.accessRights.dnb
free
dcterms.accessRights.openaire
open access