FB15k-CVT: A Challenging Dataset for Knowledge Graph Embedding Models

Knowledge Graphs (KGs) are an essential component of neuro-symbolic AI. KG Embedding Models (KGEMs) are used to represent elements of a KG (its entities and relations) in a vector space, to enable efficient processing and reasoning over knowledge. Most KGEMs are evaluated against datasets derived from the Freebase KG: FB15k and FB15k-237. In this paper, we identify limitations in these datasets with respect to Compound Value Types (CVTs), which are nodes introduced in Freebase as a substitute for \uD835\uDC5B-ary relations. In FB15k and FB51k-237, CVTs have been removed, thereby eliminating valuable information. To evaluate whether KGEMs can learn semantically accurate representations of entities and relations in Freebase, we introduce here a new dataset named FB15k-CVT, which reintroduces the deleted CVT nodes. In a preliminary evaluation, we assess the limitations of baseline KGEMs (TransE, DistMult) in the presence of CVTs. The evaluation suggests that KGEMs based on tensor decomposition are more promising than translational models but, most of all, it calls for further experiments with KGEMs that can answer conjunctive queries or that preserve logical entailment.

Mots clés

Knowledge Graphs Neurosymbolic AI Knowldge Graph Embeddings Models FB15K-237

Domaines

Informatique [cs] Intelligence artificielle [cs.AI]

Fichier principal

paper32.pdf (870.3 Ko)

Origine	Fichiers éditeurs autorisés sur une archive ouverte
Licence	Paternité

Florent Breuil : Connectez-vous pour contacter le contributeur

https://hal-emse.ccsd.cnrs.fr/emse-04081543

Soumis le : dimanche 25 juin 2023-13:07:41

Dernière modification le : mardi 17 septembre 2024-15:45:00

Archivage à long terme le : mardi 26 septembre 2023-18:06:32

Dates et versions

emse-04081543 , version 1 (25-06-2023)

Licence

Paternité

Identifiants

HAL Id : emse-04081543 , version 1

Citer

Mouloud Iferroudjene, Victor Charpenay, Antoine Zimmermann. FB15k-CVT: A Challenging Dataset for Knowledge Graph Embedding Models. NeSy 2023, 17th International Workshop on Neural-Symbolic Learning and Reasoning, Jul 2023, Siena, Italy. pp.381-394. ⟨emse-04081543⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EMSE PRES_CLERMONT CNRS FAYOL-ENSMSE LIMOS ISCOD-ENSMSE CLERMONT-AUVERGNE-INP INSTITUT-MINES-TELECOM

268 Consultations

167 Téléchargements