dc.contributor.author
Kirchhoff, Agnes
dc.contributor.author
Bügel, Ulrich
dc.contributor.author
Santamaria, Eduard
dc.contributor.author
Reimeier, Fabian
dc.contributor.author
Röpert, Dominik
dc.contributor.author
Tebbje, Alexander
dc.contributor.author
Güntsch, Anton
dc.contributor.author
Chaves, Fernando
dc.contributor.author
Steinke, Karl-Heinz
dc.contributor.author
Berendsohn, Walter G.
dc.date.accessioned
2018-10-25T13:41:08Z
dc.date.available
2018-10-25T13:41:08Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/23122
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-916
dc.description.abstract
Over the past years, herbarium collections worldwide have started to digitize millions of specimens on an industrial scale. Although the imaging costs are steadily falling, capturing the accompanying label information is still predominantly done manually and develops into the principal cost factor. In order to streamline the process of capturing herbarium specimen metadata, we specified a formal extensible workflow integrating a wide range of automated specimen image analysis services. We implemented the workflow on the basis of OpenRefine together with a plugin for handling service calls and responses. The evolving system presently covers the generation of optical character recognition (OCR) from specimen images, the identification of regions of interest in images and the extraction of meaningful information items from OCR. These implementations were developed as part of the Deutsche Forschungsgemeinschaft-funded a standardised and optimised process for data acquisition from digital images of herbarium specimens (StanDAP-Herb) Project.
en
dc.format.extent
11 Seiten
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
herbarium collection
en
dc.subject
image analysis
dc.subject.ddc
500 Naturwissenschaften und Mathematik::580 Pflanzen (Botanik)::580 Pflanzen (Botanik)
dc.subject.ddc
600 Technik, Medizin, angewandte Wissenschaften::650 Management, Öffentlichkeitsarbeit::658 Allgemeines Management
dc.title
Toward a service-based workflow for automated information extraction from herbarium specimens
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation.doi
10.1093/database/bay103
dcterms.bibliographicCitation.journaltitle
Database
dcterms.bibliographicCitation.pagestart
1
dcterms.bibliographicCitation.pageend
11
dcterms.bibliographicCitation.volume
2018
dcterms.bibliographicCitation.url
https://doi.org/10.1093/database/bay103
de
refubium.affiliation
Botanischer Garten und Botanisches Museum Berlin-Dahlem (BGBM)
refubium.funding
Deutsche Forschungsgemeinschaft (DFG)
refubium.note.author
Gefördert durch die DFG und den Open-Access-Publikationsfonds der Freien Universität Berlin.
refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.isPartOf.issn
1758-0463