dc.contributor.author
Efeoglu, Sefika
dc.contributor.author
Paschke, Adrian
dc.date.accessioned
2025-12-10T07:47:51Z
dc.date.available
2025-12-10T07:47:51Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/50768
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-50495
dc.description.abstract
Information extraction (IE) is a transformative process that converts unstructured text data into a structured format by employing entity and relation extraction (RE) methodologies. Identifying the relation between a pair of entities plays a crucial role within this framework. Despite the availability of various techniques for RE, their efficacy heavily depends on access to labeled data and substantial computational resources. To address these challenges, large language models (LLMs) have emerged as promising solutions; however, they are prone to generating hallucinated responses due to the limitations of their training data. To overcome these shortcomings, this work proposes a retrieval-augmented generation-based relation extraction (RAG4RE) approach to enhance RE performance. We evaluate the effectiveness of RAG4RE using various LLMs. By leveraging established benchmarks such as TACRED, TACREV, Re-TACRED and SemEval RE datasets, we aim to comprehensively assess the efficacy of our methodology. Specifically, we employ prominent LLMs, including Flan T5, Llama2, and Mistral, in our investigation. The results of our work demonstrate that RAG4RE outperforms traditional RE methods based solely on LLMs, with significant improvements observed in the TACRED dataset and its variations. Furthermore, our approach exhibits remarkable performance compared to previous RE methodologies across both TACRED and TACREV datasets, underscoring its efficacy and potential for advancing RE tasks in natural language processing.
en
dc.format.extent
20 Seiten
dc.rights
This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
relation extraction
en
dc.subject
large language models
en
dc.subject
retrieval-augmentation generation
en
dc.subject.ddc
000 Informatik, Informationswissenschaft, allgemeine Werke::000 Informatik, Wissen, Systeme::004 Datenverarbeitung; Informatik
dc.title
Retrieval-Augmented Generation-Based Relation Extraction
dc.type
Wissenschaftlicher Artikel
dc.date.updated
2025-12-09T19:59:23Z
dcterms.bibliographicCitation.doi
10.1177/22104968251385519
dcterms.bibliographicCitation.journaltitle
Semantic Web
dcterms.bibliographicCitation.number
5
dcterms.bibliographicCitation.volume
16
dcterms.bibliographicCitation.url
https://doi.org/10.1177/22104968251385519
refubium.affiliation
Mathematik und Informatik
refubium.affiliation.other
Institut für Informatik

refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.isPartOf.eissn
2210-4968
refubium.resourceType.provider
DeepGreen