Unsupervised Anomaly Detection in Knowledge Graphs

Asara Senaratne, Pouya Ghiasnezhad Omran, Graham Williams, Peter Christen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

Anomalies such as redundant, inconsistent, contradictory, and deficient values in a knowledge graph are unavoidable, as such graphs are often curated manually, or extracted using machine learning and natural language processing techniques. Therefore, anomaly detection in knowledge graphs is an essential task that contributes towards its quality. Although there are approaches to detect anomalies in knowledge graphs, they are either domain dependent, not scalable to large graphs, or they require substantial human intervention. In this preliminary research paper we propose a novel unsupervised feature-based approach to anomaly detection in knowledge graphs. We first characterize triples in a directed edge-labelled knowledge graph using a set of binary features, and then use a one-class Support Vector Machine (SVM) to classify these triples as normal or abnormal. After selecting the features that have the highest consistency with the SVM outcomes, we provide a visualization of the identified anomalies, and the list of anomalous triples, thus supporting non-technical domain experts to understand the anomalies present in a knowledge graph. We evaluate our approach on the four knowledge graphs YAGO-1, KBpedia, Wikidata, and DSKG. This evaluation demonstrates that our approach is well suited to identify anomalies in knowledge graphs in an unsupervised manner, independent from the domain of the knowledge graph being evaluated.

Original languageEnglish
Title of host publicationProceedings of the 10th International Joint Conference on Knowledge Graphs
Subtitle of host publicationIJCKG 2021
EditorsOscar Corcho, Thepchai Supnithi, Xiaoyan Zhu, Aidan Hogan, Thanaruk Theeramunkong, Haofen Wang
Place of PublicationNew York
PublisherAssociation for Computing Machinery
Pages161-165
Number of pages5
ISBN (Electronic)978-1-4503-9565-6
DOIs
Publication statusPublished - 24 Jan 2022
Externally publishedYes
Event10th International Joint Conference on Knowledge Graphs - Virtual, Thailand
Duration: 6 Dec 20218 Dec 2021

Conference

Conference10th International Joint Conference on Knowledge Graphs
Abbreviated titleIJCKG 2021
Country/TerritoryThailand
CityVirtual
Period6/12/218/12/21

Keywords

  • binary feature library
  • Data quality assessment
  • edge-labelled graphs
  • one-class classifier
  • Visualization

Fingerprint

Dive into the research topics of 'Unsupervised Anomaly Detection in Knowledge Graphs'. Together they form a unique fingerprint.

Cite this