论文标题

Edukg:一个异质的可持续K-12教育知识图

EDUKG: a Heterogeneous Sustainable K-12 Educational Knowledge Graph

论文作者

Zhao, Bowen, Sun, Jiuding, Xu, Bin, Lu, Xingyu, Li, Yuchen, Yu, Jifan, Liu, Minghui, Zhang, Tingjian, Chen, Qiuyang, Li, Hanming, Hou, Lei, Li, Juanzi

论文摘要

网络和人工智能技术,尤其是语义网络和知识图(KG),最近在教育场景中引起了极大的关注。然而,从知识和数据角度来看,针对K-12教育的特定主题KGS仍然缺乏足够的和可持续性。为了解决这些问题,我们提出了Edukg,这是一个异质的可持续K-12教育知识图。我们首先设计了一个跨学科和细粒度的本体,用于在K-12教育中统一建模知识和资源,在该教育中,我们总共定义了635个类,445个对象属性和1314个数据类型属性。在这个本体论的指导下,我们提出了一种灵活的方法,用于从教科书中互动提取事实知识。此外,我们基于Edukg可持续维护的提议的广义实体链接系统建立了一种通用机制,该系统可以动态地为Edukg的知识主题动态地索引大量的异质资源和数据。我们进一步评估Edukg,以说明其充分性,丰富性和可变性。我们发布了Edukg,拥有超过2.52亿个实体和38.6亿个三胞胎。现在,我们的代码和数据存储库可在https://github.com/thu-keg/edukg上找到。

Web and artificial intelligence technologies, especially semantic web and knowledge graph (KG), have recently raised significant attention in educational scenarios. Nevertheless, subject-specific KGs for K-12 education still lack sufficiency and sustainability from knowledge and data perspectives. To tackle these issues, we propose EDUKG, a heterogeneous sustainable K-12 Educational Knowledge Graph. We first design an interdisciplinary and fine-grained ontology for uniformly modeling knowledge and resource in K-12 education, where we define 635 classes, 445 object properties, and 1314 datatype properties in total. Guided by this ontology, we propose a flexible methodology for interactively extracting factual knowledge from textbooks. Furthermore, we establish a general mechanism based on our proposed generalized entity linking system for EDUKG's sustainable maintenance, which can dynamically index numerous heterogeneous resources and data with knowledge topics in EDUKG. We further evaluate EDUKG to illustrate its sufficiency, richness, and variability. We publish EDUKG with more than 252 million entities and 3.86 billion triplets. Our code and data repository is now available at https://github.com/THU-KEG/EDUKG.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源