Development of an ETL Process Based on Open Source Technologies to Solve the Problem of Data Delivery to Consumers



The article discusses the issues of developing an ETL process for a data warehouse based on open source technologies, instead of private software supplied by the vendor. The process allows you to deliver data from the source to the consumer, focusing on the speed of delivery, the resources spent and the convenience of development. The architecture for solving the problem with a description of the processes being replaced is presented, data transmission over a new process is implemented. Modern tools used to work with data are involved, methods of interaction with them and selection of technical characteristics for the process are described.

General Information

Keywords: database, open source, software, ETL process, data delivery

Journal rubric: Software

Article type: scientific article


Received: 12.04.2023

For citation: Starkov V.V., Gorbatova S.S., Vodolaga V.I. Development of an ETL Process Based on Open Source Technologies to Solve the Problem of Data Delivery to Consumers. Modelirovanie i analiz dannikh = Modelling and Data Analysis, 2023. Vol. 13, no. 2, pp. 180–193. DOI: 10.17759/mda.2023130210. (In Russ., аbstr. in Engl.)


Information About the Authors

Viacheslav V. Starkov

Svetlana S. Gorbatova, Senior Lecturer, Moscow Institute of Steel and Alloys (National Research Technological University) (NUST MISIS), Moscow, Russia

Victoria I. Vodolaga, Master's Degree, Lomonosov Moscow State University (MSU), Moscow, Russia



