language-agnostic relation extraction from abstracts in wikis

;Nicolas Heist;Sven Hertling;Heiko Paulheim

doi:10.3390/info9040075

language-agnostic relation extraction from abstracts in wikis

Clicks: 208

ID: 228562

2018

Free PDF

Article Quality & Performance Metrics

Overall Quality Improving Quality

0.0 /100

Combines engagement data with AI-assessed academic quality

Reader Engagement Steady Performance

30.0 /100

205 views

33 readers

AI Quality Assessment

Not analyzed

Abstract

EN
- Turkish
- Spanish
- Portuguese
- Arabic
- Chinese
- French
- German
- Indonesian
- Russian
- Thai

Large-scale knowledge graphs, such as DBpedia, Wikidata, or YAGO, can be enhanced by relation extraction from text, using the data in the knowledge graph as training data, i.e., using distant supervision. While most existing approaches use language-specific methods (usually for English), we present a language-agnostic approach that exploits background knowledge from the graph instead of language-specific techniques and builds machine learning models only from language-independent features. We demonstrate the extraction of relations from Wikipedia abstracts, using the twelve largest language editions of Wikipedia. From those, we can extract 1.6 M new relations in DBpedia at a level of precision of 95%, using a RandomForest classifier trained only on language-independent features. We furthermore investigate the similarity of models for different languages and show an exemplary geographical breakdown of the information extracted. In a second series of experiments, we show how the approach can be transferred to DBkWik, a knowledge graph extracted from thousands of Wikis. We discuss the challenges and first results of extracting relations from a larger set of Wikis, using a less formalized knowledge graph.

Reference Key	heist2018informationlanguage-agnostic Use this key to autocite in the manuscript while using SciMatic Manuscript Manager or Thesis Manager
Authors	;Nicolas Heist;Sven Hertling;Heiko Paulheim
Journal	psychoanalytic review
Year	2018
DOI	10.3390/info9040075 Searching for DOI...
URL	http://www.mdpi.com/2078-2489/9/4/75 https://doi.org/10.3390/info9040075
Keywords	wikipedia relation extraction wiki farmsinformation technology

Citations

No citations found. To add a citation, contact the admin at info@scimatic.org

Comments

Login to comment Register

No comments yet. Be the first to comment on this article.