KE-GCN

发表于 2021-05-17 更新于 2022-09-02 分类于 Paper ， KGE 阅读次数：本文字数： 2.7k 阅读时长 ≈ 2 分钟

Knowledge Embedding Based Graph Convolutional Network

WWW 2021, KE-GCN，提出了一个泛化的框架，将GCN和KGE的传统方法结合起来，认为在GCN中的信息传播过程是传播计算edge是否存在的得分函数\(f(u,r,v)\)对\(v\)的梯度，并且提出了对于relation的传播过程，在knowledge graph alignment和entity classification上进行了实验。

Recently, a considerable literature has grown up around the theme of Graph Convolutional Network (GCN). How to effectively leverage the rich structural information in complex graphs, such as knowledge graphs with heterogeneous types of entities and relations, is a primary open challenge in the field. Most GCN methods are either restricted to graphs with a homogeneous type of edges (e.g., citation links only), or focusing on representation learning for nodes only instead of jointly propagating and updating the embeddings of both nodes and edges for target-driven objectives. This paper addresses these limitations by proposing a novel framework, namely the Knowledge Embedding based Graph Convolutional Network (KE-GCN), which combines the power of GCNs in graphbased belief propagation and the strengths of advanced knowledge embedding (a.k.a. knowledge graph embedding) methods, and goes beyond. Our theoretical analysis shows that KE-GCN offers an elegant unification of several well-known GCN methods as specific cases, with a new perspective of graph convolution. Experimental results on benchmark datasets show the advantageous performance of KE-GCN over strong baseline methods in the tasks of knowledge graph alignment and entity classification .

1 Introduction

motivation：

传统的GCN方法主要假设在同质图上进行学习，忽略了KG中的relation蕴含的丰富的信息。
传统的KGE方法没有考虑graph的结构信息
将GCN和KGE结合的方法比如VR-GCN，COMPGCN等，在学习relation embedding的时候没有考虑entity embedding对relation embedding的影响

method：

为了解决上面的问题，提出了KE-GCN（Knowledge Embedding based Graph Convolution Network），能够结合KGE的方法，基于图卷积操作同时学习entity和relation embedding。

2 Method

2.1 Reformulation of Vanilla GCN

作者首先从新的角度看原始GCN的公式：

原来GCN的公式：

通过引入一个得分函数，重新定义GCN，假设引入得分函数\(f\)，该得分函数计算edge存在的score，对于已经存在的edge输出较大的值；对于不存在的边输出较小的值。假设\(f\)为求内积： \[ f(h_u,h_v)=h_u^T h_v \] 那么计算的消息\(h_u\)能够看做是\(f\)对\(v\)的梯度，那么所有的\(h_u\)加起来就成为下面的形式