Fallatah, O., Zhang, Z. orcid.org/0000-0002-8587-8618 and Hopfgartner, F. orcid.org/0000-0003-0380-6088 (2020) A gold standard dataset for large knowledge graphs matching. In: Shvaiko, P., Euzenat, J., Jiménez-Ruiz, E., Hassanzadeh, O. and Trojahn, C., (eds.) Ontology Matching 2020 : Proceedings of the 15th International Workshop on Ontology Matching co-located with the 19th International Semantic Web Conference (ISWC 2020). 15th International Workshop on Ontology Matching co-located with the 19th International Semantic Web Conference (ISWC 2020), 02 Nov 2020, Virtual conference. CEUR Workshop Proceedings , pp. 24-35.
Abstract
In the last decade, a remarkable number of Knowledge Graphs (KGs) were developed, such as DBpedia, NELL and Google knowledge graph. These KGs are the core of many web-based applications such as query answering and semantic web navigation. The majority of these KGs are semi-automatically constructed, which has resulted in a significant degree of heterogeneity. KGs are highly complementary; thus, mapping them can benefit intelligent applications that require integrating different KGs such as recommendation systems and search engines. Although the problem of ontology matching has been investigated and a significant number of systems have been developed, the challenges of mapping large-scale KGs remain significant. In 2018, OAEI has introduced a specific track for KG matching systems. Nonetheless, a major limitation of the current benchmark is their lack of representation of real-world KGs. In this work we introduce a gold standard dataset for matching the schema of large, automatically constructed, less-well structured KGs based on DBpedia and NELL. We evaluate OAEI's various participating systems on this dataset, and show that matching large-scale and domain independent KGs is a more challenging task. We believe that the dataset which we make public in this work makes the largest domain-independent gold standard dataset for matching KG classes.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (http://creativecommons.org/licenses/by/4.0). |
Keywords: | Knowledge Graphs; Schema Matching; Evaluation Dataset |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 23 Apr 2021 10:51 |
Last Modified: | 24 Apr 2021 10:51 |
Published Version: | http://ceur-ws.org/Vol-2788/ |
Status: | Published |
Publisher: | CEUR Workshop Proceedings |
Refereed: | Yes |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:173366 |