Text Detoxification System in Dialogue Conversations
The work is aimed at improving the cultural level of correspondence in dialog systems. The key feature of the work is its focus on real–time use and ensuring sustainable detoxification, taking into account the specifics of dialog communication (typos, noise symbols, transliteration, etc.). The solution offers the use of a neural network approach and software processing to obtain embeds of tokens and the subsequent solution of the classification problem. Unlike traditional message filters, the task is to preserve the meaning of the source text by clearing it of toxic content. The operability of the system can be checked on the basis of the Telegram messenger, in which the model is presented in the form of a bot. The system itself is deployed on the basis of Serverless technology from a cloud provider, which allows it to adapt to peak loads and at the same time be easy to maintain.
Keywords: detoxification of text, neural networks, serverless
Journal rubric: Data Analysis
Article type: scientific article
For citation: Suvorov M.D., Vinogradov V.I. Text Detoxification System in Dialogue Conversations. Modelirovanie i analiz dannikh = Modelling and Data Analysis, 2023. Vol. 13, no. 1, pp. 19–24. DOI: 10.17759/mda.2023130102. (In Russ., аbstr. in Engl.)
- Rubtsova Yu.V. Automatic construction and analysis of the corpus of short texts (microblogging posts) for the task of developing and training a tone classifier //Knowledge engineering and semantic web technologies. - 2012. – Vol. 1. – pp. 109-116.
Information About the Authors
Previous month: 5
Current month: 3
Previous month: 4
Current month: 2