Before discussing individual algorithms, we will go through some high-level generalization of the embedding techniques which make each algorithm unique. This will help us differentiate and hence appreciate the individual algorithms.
Generalization of embedding methods
Embedding is the way of representing an object from its existing environment to another.
Knowledge graph embedding includes representation of relations and entities into continuous space.
Models for KG embedding can be categorised based on their answer for following questions, (Ji_2021)
What is the representation space in which the relations and entities are represented?
What is the scoring function for measuring the plausibility of factual triples?
Representation space
Point-wise Euclidean space
The most common representation space.
Embedding space is Euclidean real valued vector or matrix space.
Easy to understand; Not ideal for graphical (tree-like) structure.
Complex vector space
Entities and relations are represented in a complex space
Taking head entity as an example, h has a real part Re(h) and an imaginary part Im(h), i.e., \(\textbf{h}=Re(\textbf{h}) + i Im(\textbf{h})\)
Can capture anti-symmetric relations better than operations in Euclidean space.
Gaussian distribution space
Entities and relations are represented as probabilistic distribution
Applicable if you want to capture uncertainties.
Manifold space
Entities and relations are represented in a well defined topological space
Good for graphical (tree-like) structure.
Scoring functions
Distance based
Measure plausibility of facts by calculating the distance between the entities.
Additive translation with relation is the most widely used method i.e. \(\textbf{h} + \textbf{r} \approx \textbf{t}\)
Similarity based
Measure plausibility of the facts by semantic similarity matching
Multiplicative formulation is most widely used method i.e. \(\textbf{h}^T \textbf{M}_r \approx \textbf{t}^T\) , use relation matrix to transform head entity into tail entity.
Algorithm Comparison
A holistic comparison of different knowledge graph emebdding techniques w.r.t. category and scoring function is provided below,