dc.contributor.author
Winkler, Jörg
dc.contributor.author
Urgese, Gianvito
dc.contributor.author
Ficarra, Elisa
dc.contributor.author
Reinert, Knut
dc.date.accessioned
2022-01-17T08:47:07Z
dc.date.available
2022-01-17T08:47:07Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/33565
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-33286
dc.description.abstract
Background
The function of non-coding RNA sequences is largely determined by their spatial conformation, namely the secondary structure of the molecule, formed by Watson–Crick interactions between nucleotides. Hence, modern RNA alignment algorithms routinely take structural information into account. In order to discover yet unknown RNA families and infer their possible functions, the structural alignment of RNAs is an essential task. This task demands a lot of computational resources, especially for aligning many long sequences, and it therefore requires efficient algorithms that utilize modern hardware when available. A subset of the secondary structures contains overlapping interactions (called pseudoknots), which add additional complexity to the problem and are often ignored in available software.
Results
We present the SeqAn-based software LaRA 2 that is significantly faster than comparable software for accurate pairwise and multiple alignments of structured RNA sequences. In contrast to other programs our approach can handle arbitrary pseudoknots. As an improved re-implementation of the LaRA tool for structural alignments, LaRA 2 uses multi-threading and vectorization for parallel execution and a new heuristic for computing a lower boundary of the solution. Our algorithmic improvements yield a program that is up to 130 times faster than the previous version.
Conclusions
With LaRA 2 we provide a tool to analyse large sets of RNA secondary structures in relatively short time, based on structural alignment. The produced alignments can be used to derive structural motifs for the search in genomic databases.
en
dc.format.extent
22 Seiten
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
RNA secondary structure
en
dc.subject
Integer linear program
en
dc.subject
Bioinformatics
en
dc.subject
Structural alignment
en
dc.subject
Parallel computing
en
dc.subject.ddc
000 Informatik, Informationswissenschaft, allgemeine Werke::000 Informatik, Wissen, Systeme::004 Datenverarbeitung; Informatik
dc.title
LaRA 2: parallel and vectorized program for sequence–structure alignment of RNA sequences
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation.articlenumber
18
dcterms.bibliographicCitation.doi
10.1186/s12859-021-04532-7
dcterms.bibliographicCitation.journaltitle
BMC Bioinformatics
dcterms.bibliographicCitation.volume
23
dcterms.bibliographicCitation.url
https://doi.org/10.1186/s12859-021-04532-7
refubium.affiliation
Mathematik und Informatik
refubium.affiliation.other
Institut für Informatik

refubium.funding
Springer Nature DEAL
refubium.note.author
Die Publikation wurde aus Open Access Publikationsgeldern der Freien Universität Berlin gefördert.
refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.isPartOf.eissn
1471-2105