dc.contributor.author
Mehringer, Svenja
dc.contributor.author
Seiler, Enrico
dc.contributor.author
Droop, Felix
dc.contributor.author
Darvish, Mitra
dc.contributor.author
Rahn, René
dc.contributor.author
Vingron, Martin
dc.contributor.author
Reinert, Knut
dc.date.accessioned
2023-06-01T08:32:57Z
dc.date.available
2023-06-01T08:32:57Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/39632
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-39350
dc.description.abstract
We present a novel data structure for searching sequences in large databases: the Hierarchical Interleaved Bloom Filter (HIBF). It is extremely fast and space efficient, yet so general that it could serve as the underlying engine for many applications. We show that the HIBF is superior in build time, index size, and search time while achieving a comparable or better accuracy compared to other state-of-the-art tools. The HIBF builds an index up to 211 times faster, using up to 14 times less space, and can answer approximate membership queries faster by a factor of up to 129.
en
dc.format.extent
25 Seiten
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
Approximate membership query
en
dc.subject
Sequence search
en
dc.subject
Alignment free analysis
en
dc.subject
Bloom filter
en
dc.subject
Metagenomics
en
dc.subject.ddc
500 Naturwissenschaften und Mathematik::570 Biowissenschaften; Biologie::570 Biowissenschaften; Biologie
dc.title
Hierarchical Interleaved Bloom Filter: enabling ultrafast, approximate sequence queries
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation.articlenumber
131
dcterms.bibliographicCitation.doi
10.1186/s13059-023-02971-4
dcterms.bibliographicCitation.journaltitle
Genome Biology
dcterms.bibliographicCitation.volume
24
dcterms.bibliographicCitation.url
https://doi.org/10.1186/s13059-023-02971-4
refubium.affiliation
Mathematik und Informatik
refubium.affiliation.other
Institut für Informatik
refubium.funding
Springer Nature DEAL
refubium.note.author
Die Publikation wurde aus Open Access Publikationsgeldern der Freien Universität Berlin gefördert.
refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.isPartOf.eissn
1474-760X