OntoILPER: an ontology- and inductive logic programming-based system to extract entities and relations from text

Abstract : Named Entity Recognition (NER) and Relation Extraction (RE) are two important subtasks in Information Extraction (IE). Most of the current learning methods for NER and RE rely on supervised machine learning techniques with more accurate results for NER than RE. This paper presents OntoILPER a system for extracting entity and relation instances from unstructured texts using ontology and Inductive Logic Programming, a symbolic machine learning technique. OntoILPER uses the domain ontology and takes advantage of a higher expressive relational hypothesis space for representing examples whose structure is relevant to IE. It induces extraction rules that subsume examples of entities and relation instances from a specific graph-based model of sentence representation. Furthermore, OntoILPER enables the exploitation of the domain ontology and further background knowledge in the form of relational features. To evaluate OntoILPER, several experiments over the TREC corpus for both NER and RE tasks were conducted and the yielded results demonstrate its effectiveness in both tasks. This paper also provides a comparative assessment among OntoILPER and other NER and RE systems, showing that OntoILPER is very competitive on NER and outperforms the selected systems on RE.
Document type :
Journal articles
Complete list of metadatas

Cited literature [56 references]  Display  Hide  Download

https://hal-amu.archives-ouvertes.fr/hal-01794571
Contributor : Bernard Espinasse <>
Submitted on : Thursday, March 28, 2019 - 11:19:26 AM
Last modification on : Monday, August 12, 2019 - 4:46:02 PM
Long-term archiving on : Saturday, June 29, 2019 - 12:06:11 PM

Identifiers

Collections

Citation

Rinaldo Lima, Bernard Espinasse, Fred Freitas. OntoILPER: an ontology- and inductive logic programming-based system to extract entities and relations from text. Knowledge and Information Systems (KAIS), Springer, 2017, 52 (2), pp.291 - 339. ⟨10.1007/s10115-017-1108-3⟩. ⟨hal-01794571⟩

Share

Metrics

Record views

42

Files downloads

218