Voice Commands Recognition Method that Uses Special Spectral Density Transform

64

Abstract

This article discusses new approach to solving specific problem of speech recognition. The problem is formulated as problem of command recognition. Commands have equal lengths (in words). Each word position has its own set of word candidates. There were developed the models for solving this problem. The models consist of spectral density estimator, convolutional neural networks and naive Bayes classifier. The author performed a comparative analysis of the models and selected the best. There was developed a graphical user interface for interacting with the best model. The result can be used as a base for creation speech recognition methods for voice user interfaces.

General Information

Keywords: automatic speech recognition, spectral density estimation, convolutional neural networks

Journal rubric: Software

Article type: scientific article

DOI: https://doi.org/10.17759/mda.2022120304

For citation: Levonovich N.I. Voice Commands Recognition Method that Uses Special Spectral Density Transform. Modelirovanie i analiz dannikh = Modelling and Data Analysis, 2022. Vol. 12, no. 3, pp. 49–57. DOI: 10.17759/mda.2022120304. (In Russ., аbstr. in Engl.)

References

  1. Sak, Haşim, et al. "Fast and accurate recurrent neural network acoustic models for speech recognition." arXiv preprint arXiv:1507.06947 (2015).
  2. L. S. Kuravskii, S. N. Baranov Komp'yuternoe modelirovanie i analiz dannykh. Konspekty lektsii i uprazhneniya: Ucheb. posobie. – M.: RUSAVIA, 2012. – 218 p.
  3. Dumoulin, Vincent, and Francesco Visin. "A guide to convolution arithmetic for deep learning." arXiv preprint arXiv:1603.07285 (2016).
  4. Ch, Read, and ML MAP. "Bayesian learning." book: Machine Learning. McGraw-Hill Science/Engineering/Math (1997): 154-200.
  5. Virbel, Mathieu, Thomas Hansen, and Oleksandr Lobunets. "Kivy–a framework for rapid creation of innovative user interfaces." Workshop-Proceedings der Tagung Mensch & Computer 2011. überMEDIEN| ÜBERmorgen. Universitätsverlag Chemnitz, 2011.

Information About the Authors

Nikita I. Levonovich, student, Moscow State University of Psychology and Education (MSUPE), Moscow, Russia, ORCID: https://orcid.org/0000-0002-8580-0490, e-mail: levonikitatech@yandex.ru

Metrics

Views

Total: 193
Previous month: 7
Current month: 4

Downloads

Total: 64
Previous month: 6
Current month: 2