Development of a Keyphrase Extraction Method Based on a Probabilistic Topic Model



The article considers the task of topic modeling. A new method for extracting keywords has been developed based on topic modeling to analyze a collection of documents describing the goods of an online store. A comparative analysis of the basic method for extracting keywords and the proposed method was carried out. Illustrative results are presented that describe the advantages of this approach. The resulting solution can be used to simplify site navigation and search for relevant products.

General Information

Keywords: keyword extraction, topic modeling, NLP, LDA, machine learning

Journal rubric: Data Analysis


Received: 18.04.2022


For citation: Romanadze E.L., Sudakov V.A., Kislinsky V.G. Development of a Keyphrase Extraction Method Based on a Probabilistic Topic Model. Modelirovanie i analiz dannikh = Modelling and Data Analysis, 2022. Vol. 12, no. 2, pp. 20–33. DOI: 10.17759/mda.2022120202. (In Russ., аbstr. in Engl.)


Information About the Authors

Ekaterina L. Romanadze, Graduate Student, Moscow Aviation Institute (National Research University)(MAI), Moscow, Russia, ORCID:, e-mail:

Vladimir A. Sudakov, Doctor of Engineering, Professor of Department 805, Moscow Aviation Institute (MAI), Leading Researcher, Keldysh Institute of Applied Mathematics (Russian Academy of Sciences), Moscow, Russia, ORCID:, e-mail:

Vadim G. Kislinsky, Researcher, Moscow Institute of Physics and Technology (National Research University) (MFTI), Moscow, Russia, ORCID:



