论文标题
关系数据库摄入NOSQL数据仓库
Relational Databases Ingestion into a NoSQL Data Warehouse
论文作者
论文摘要
公司的数字转型导致数据库向大数据的发展。我们的工作是这种情况的一部分,尤其是提取存储在数据湖中并将数据存储在数据仓库中的数据集的机制。后者将在第二次允许决策分析。在本文中,我们将提取机制限于关系数据库。为了自动化此过程,我们使用了模型驱动的体系结构(MDA),该体系结构为模式转换提供了形式化的环境。从描述数据湖的物理图案中,我们提出了转换规则,允许创建存储在以文档为导向的NOSQL系统上的数据仓库。转换过程的实验已在医疗应用上进行。
The digital transformation of companies has led to the evolution of databases towards Big Data. Our work is part of this context and concerns more particularly the mechanisms to extract datasets stored in a Data Lake and to store the data in a Data Warehouse. The latter will allow, in a second time, decisional analysis. In this paper, we present the extraction mechanism limited to relational databases. To automate this process, we used the Model Driven Architecture (MDA), which offers a formalized environment for schema transformation. From the physical schemas describing a Data Lake, we propose transformation rules that allow the creation of a Data Warehouse stored on a document-oriented NoSQL system. An experimentation of the transformation process has been performed on a medical application.