Haupttitel:
Hierarchical Interleaved Bloom Filter: enabling ultrafast, approximate sequence queries
Autor*in:
Mehringer, Svenja; Seiler, Enrico; Droop, Felix; Darvish, Mitra; Rahn, René; Vingron, Martin; Reinert, Knut
Datum der Freigabe:
2023-06-01T08:32:57Z
Abstract:
We present a novel data structure for searching sequences in large databases: the Hierarchical Interleaved Bloom Filter (HIBF). It is extremely fast and space efficient, yet so general that it could serve as the underlying engine for many applications. We show that the HIBF is superior in build time, index size, and search time while achieving a comparable or better accuracy compared to other state-of-the-art tools. The HIBF builds an index up to 211 times faster, using up to 14 times less space, and can answer approximate membership queries faster by a factor of up to 129.
Teil des Identifiers:
e-ISSN (online): 1474-760X
Freie Schlagwörter:
Approximate membership query
Sequence search
Miminizer
Alignment free analysis
Bloom filter
Metagenomics
DDC-Klassifikation:
570 Biowissenschaften; Biologie
Publikationstyp:
Wissenschaftlicher Artikel
Zeitschrift:
Genome Biology
Fachbereich/Einrichtung:
Mathematik und Informatik
Institut für Informatik
Anmerkungen:
Die Publikation wurde aus Open Access Publikationsgeldern der Freien Universität Berlin gefördert.