INTRODUCTION
The previous decades have been marked by rapid developments in artificial intelligence (AI). The number of scientific articles on the use of AI techniques is increasing. Machine learning is a fundamental area of AI that allows a computer to analyze data and extract information without explicit programming. Unlike the traditional approach, where it is necessary to write a special code to solve a specific problem (e.g., determining an image of a cat), in machine learning a model is generated with a large amount of data (e.g., images of cats and images of non-cats) and is allowed to “learn” based on this generated data. Following that, the model is able to predict or aporton new data (e.g., determine whether the new image is a cat) that were not used in the original dataset [1]. In scientific papers, machine learning is used as a tool with many practical applications, including pattern recognition, data analysis, event prediction, and much more [2, 3]. Models created using machine learning techniques are used in many fields of the sciences, such as physics, chemistry, mathematics, economics (forecasting financial markets [4]) and in bioinformatics for the analysis of biological data, such as genomes, proteomes, and metabolomes [5]. Models created using machine learning techniques are also used in medicine; they can help specialists in decision-making, in diagnosing and predicting the development of diseases, in monitoring patients’ health using mobile applications, in predicting epidemic outbreaks, etc. [6–8].
Machine learning algorithms have also found use in the diagnosis of schizophrenia. Schizophrenia is a chronic, progressive psychiatric disorder with an incidence of 4 to 6 per 1,000 population. The prevalence of schizophrenia is quite identical in females and males and is slightly higher in residents of urban communities compared to residents of rural areas [9–11]. The diagnosis of schizophrenia, according to the DSM-5 criteria, is based solely on clinical signs [12]. This may make it difficult to accurately diagnose disorders that are similar in some cases, such as schizophrenia and autism spectrum disorders [13], or schizophrenia and bipolar disorder [14]. To increase the likeliness of a good prognosis and a high quality of life for patients with schizophrenia, it is crucial to quickly and accurately detect the clinical symptoms of the disease and prescribe treatment in a timely manner [15, 16]. In the context of schizophrenia, machine learning techniques open up new opportunities for more accurate diagnosis and prediction of the clinical features of the illness. One of the advantages of using machine learning techniques is the ability to process large amounts of data, as well as the ability to analyze information that is diverse in nature [17]; e.g., individual clinical manifestations, neuroimaging test findings, patient’s history data, genetic data, the patient’s voice, etc. Based on these data, both diagnostic and prognostic models have been generated. Diagnostic models help specialists identify the disease more accurately, and prognostic models can help predict the development course of schizophrenia [18], as well as its clinical manifestations and complications, including the risk of suicide [19]. Furthermore, machine learning can help identify new biomarkers associated with schizophrenia, which can improve our understanding of the mechanisms underpinning this disorder and contribute to the development of more effective therapeutic methods [20, 21].
There has been a significant increase in the number of studies that seek to evaluate the diagnosis of schizophrenia and predict its clinical course using machine learning techniques. However, the field is characterized by a wide variety of topics and multiple publications, which requires a systematic review of available information. First, a literature review on this topic allows one to identify the most effective machine learning techniques used to predict and diagnose schizophrenia; second, it allows one to also identify promising areas of future research into the use of AI in psychiatry.
Thus, this literature review aimed to probe for information on the use of machine learning techniques in predicting and diagnosing schizophrenia and trying to identify its clinical features, as well as generalizing data and identifying key findings that can provide a better understanding of the current state of research in this field.
METHODS
Scientific papers were searched in the Google Scholar, PubMed, and eLIBRARY.ru databases, and the publications included in the lists of references of thematic reviews were also analyzed. Search queries included various combinations of the following words: “machine learning”, “deep learning”, “schizophrenia”, “neural network”, “predictors”, “artificial intelligence”, “diagnostics”. The following keywords were used to search for papers devoted to the identification of the clinical features and complications of schizophrenia: “suicide”, “depression”, “insomnia”, cognitive”. These keywords were combined to create search queries; e.g., “machine learning”, “predictors” AND “schizophrenia”. The review included studies into the use of various AI technologies in the context of schizophrenia diagnosed according to the DSM-IV, DSM IV-TR, DSM-5, ICD-10, and ICD-11 criteria published no earlier than January 1, 2010, and no later than March 31, 2023, without any language restrictions. This time interval was chosen because of the substantial increase in the number of publications on this topic from 2010 to the present. The review included original studies, regardless of their design, evaluating the use of various machine learning techniques in the diagnosis of schizophrenia and determination of its clinical features in patients with both a first episode of schizophrenia and the chronic form of the disease. Descriptive analysis was used to summarize the retrieved data.
RESULTS
Based on the search results, 38 papers were included in the review. Then the sections containing information on the use of different AI technologies in the functional testing (electroencephalography (EEG) and magnetic resonance imaging (MRI), actigraphy) of patients with schizophrenia, the analysis of their mental capacities (speech, behavior, creativity), the evaluation of their history and genetic data, as well as the prediction of complications, outcomes of the disease, and its individual manifestations, were determined. Each of the listed aspects of the use of AI technologies is reviewed below.
Machine learning techniques in the functional assessment of patients with schizophrenia
In addition to psychiatric interviews and neuropsychological testing, other investigative (EEG, MRI) techniques are used in the diagnosis of schizophrenia to rule out the presence of other disorders, as well as for research purposes. In a study by Di Lorenzo et al., the authors revealed that people with schizophrenia demonstrated lower alpha rhythms on EEG in the frontal and central regions of the brain compared to the control groups. The level of the alpha rhythm is known to be associated with mental processes, such as attention and memory. The authors suggested that a low level of alpha rhythm may be associated with cognitive impairment and impaired mental abilities in patients with schizophrenia [22]. In another study, scientists found that in people with schizophrenia, interhemispheric connectivity was significantly lower in the frontal and parietal lobes compared to the control group [23].
Despite the fact that EEG is not used to diagnose schizophrenia in routine clinical practice, machine learning techniques can improve the accuracy of the diagnosis of schizophrenia based on EEG findings. In an article by Sun et al., researchers concerted EEG signals into a series of images then a hybrid deep neural network was built and trained, which could help distinguish the EEG signals of a healthy person from those of a person with schizophrenia with 99.22% accuracy [24]. In another, similar study in which a convolutional neural network was used, the accuracy was also high, reaching 98.07% [25]. Neural networks are widely used in the classification of EEG signals, and scientists also suggest that neural networks trained to classify EEG findings can be useful in early detection of schizophrenia [26–28].
Neural networks are being relied upon with increasing frequency in the analysis of 3D MRI images of the brain. In a study by Chen et al., the researchers trained a convolutional neural network to classify MR images of people with schizophrenia with a probability of 85%. Likewise, with the help of a neural network, suspected biomarkers of schizophrenia were identified: namely, abnormal structural changes in the cerebellum, fusiform gyrus, and the temporal, occipital, and frontal brain lobes [29]. In another study, researchers analyzed MR images of people with schizophrenia, bipolar disorder, and mentally healthy people. As a result, models based on machine learning algorithms were built to distinguish an image of a person with schizophrenia from that of a healthy person with an average accuracy of 90%, and from a person with bipolar disorder with an accuracy of 88% [30]. In a study by Oh et al., the authors successfully used a convolutional neural network to classify MR images of patients with schizophrenia with an accuracy of 84.15~84.43% and they revealed that the most significant brain regions were the low and middle temporal lobes [31]. In another paper that pursued the same objective, the researchers applied the M3 method (multimodal imaging and multi-level characterization with multi-classifier) and achieved an accuracy of 83.49% [32].
There are models that have been created using machine learning techniques that can be used to diagnose schizophrenia using actigraphy. In one study, scientists collected data from Actiwatch bracelets that recorded the acceleration amplitude of the sensor, thereby reflecting the motor activity of the participants during the day. Using a convolutional neural network, the researchers successfully distinguished patients with schizophrenia from those suffering from mood disorders against the control group patients. At the same time, patients with schizophrenia showed the lowest motor activity [33]. In another study, researchers analyzed the patterns of nocturnal activity in individuals at risk of developing schizophrenia, people with bipolar disorder, and healthy people. Using various machine learning algorithms, the scientists created models that could identify a respondent at risk of schizophrenia and bipolar disorder [34].
Machine learning techniques in the analysis of speech, behavior, and creativity in people with schizophrenia
Machine learning is used to analyze written and spoken speech. In a study by Bae et al., the authors used a neuronal network to analyze the linguistic patterns of people with and without schizophrenia on the Reddit social network. On this social platform, people can create different topics, discuss them, and share something important. Researchers compared the topics created about schizophrenia with topics about humor, fitness, meditation, parenting, etc. It turned out that people describing their mental issues used fewer singular first and third person pronouns, and, vice versa, a greater number of impersonal pronouns, plural second, and third person pronouns. It has also been observed that people with mental illnesses are less likely to use the past tense as well as words describing positive emotions, and that they are more likely to use words associated with negative emotions [35]. In a study evaluating the linguistic characteristics of people with schizophrenia on the Twitter social network, it was found that people diagnosed with schizophrenia more often used interpersonal pronouns in their texts and were more likely to put less emphasis on friendship and more on biological needs [36].
Neural networks are also capable of processing audio information. In a study by Fu et al., researchers created a Sch-net neuronal network which was able to distinguish the speech of a person with schizophrenia from that of a mentally healthy person with 97.68% accuracy [37]. In a study by Tahir et al., researchers used a system based on machine learning algorithms to automatically predict the presence of “negative symptoms” of schizophrenia based on the speech characteristics. This also could identify the voice of a person with schizophrenia with 81.3% accuracy [38].
There is also data in the literature on the use of the SchiNet convolutional neural network in the analysis of facial behavior during psychiatric interviews of people with schizophrenia. Researchers have concluded that automated identification of facial behavioral patterns is a reliable means of identifying “negative symptoms” of schizophrenia [39]. In another study, a convolutional neuronal network could recognize people with schizophrenia by video recordings. The recording of the face was carried out following various types of emotional stimulation; based on the information collected, the neural network determined a person with schizophrenia with 89% accuracy [40].
A study conducted by Vasilchenko and Usov published the results of the use of a convolutional neural network in the classification of drawings made by people with schizophrenia based on the images of a human face drawn by them; the accuracy in providing correct answers amounted to 82% [41]. In a study by Shen et al., the authors evaluated the categorization of color drawings created by people with schizophrenia and control group subjects using a convolutional neural network. The study showed that people with schizophrenia were more likely to use fewer colors in their drawings, draw irregular lines, and draw more lines near the center of the image compared to the control group. The accuracy demonstrated by the neural network was 90%. Using a neural network analysis, the investigators could reliably predict the results of the Positive and Negative Syndrome Scale (PANSS) assessment using drawings made by people with schizophrenia. The model could predict high scores both in the general scale and the subscales [42].
Diagnosis of schizophrenia based on genetic information using artificial intelligence
There are psychiatric disorders that come with similar symptoms, such as schizophrenia and bipolar disorder, as well as schizophrenia and autism spectrum disorders. This may make the diagnosis of such disorders difficult and result in the use of inadequate therapy. Machine learning techniques-based models can use genetic data to help alleviate this problem. For example, in a study by Karthik et al., researchers used genetic information to teach a neural network how to separate schizophrenia from bipolar disorder. Models based on machine learning techniques helped identify genetic patterns consisting of 75 genes for schizophrenia and 67 genes for bipolar disorder; the probability of correct assumptions by the constructed neural network stood at 95.65% and 97.01%, respectively [43]. In a study by Sardaar et al., researchers compared the genome architecture of schizophrenia and autism spectrum disorders in order to look for “nodal” genes for these disorders. Using a model based on the regularized “gradient boosted machine” (GBM) technique, researchers separated patients with these diseases with an accuracy of 86–88%. They were also able to identify the “nodal” genes responsible for the transmembrane ion transport, neurotransmitter transport, and the processes in the cytoskeleton associated with schizophrenia [44]. In another study, a neural network used information on 792 genetic markers to allocate respondents to a control group and people with schizophrenia with an accuracy of 87.9% [45]. The research conducted by Gunasekaraet al., was also aimed at identifying schizophrenia in the study subjects. In that study, the authors used a SPLS-DA machine learning technique to successfully identify schizophrenia based on epigenetic data: namely, the methylation of various DNA sites [46]. In another study, scientists pointed at the possibility of distinguishing people with and without schizophrenia using G72 gene single-nucleotide polymorphisms, as well as the plasma level of the G72 protein. A naive Bayes classifier turned out to be the best model (AUC=0.9356) [47]. Aguiar-Pulido et al. studied single-nucleotide polymorphisms in the HTR2A and DRD3 genes. They employed neural network analysis of genetic information to identify the genotypes of people with schizophrenia, and the accuracy in the exercise ranged from 78.3 to 93.8% [48].
Analysis of patient’s history data using machine learning techniques in the early diagnosis and prevention of schizophrenia
In a large study by Raket et al., researchers used information obtained from the electronic medical records (4,899 events) of the control group (N=72,860) and people with schizophrenia (N=72,860) to predict the development of the first episode of psychosis within one year before its onset. To create a model capable of solving such a problem, the method of recurrent neural network analysis was selected and the probability of a correct predication was 0.774 [49]. Fusar-Poli et al. used machine learning techniques to create a model to predict the development of a psychotic episode in people with a clinically high risk of psychosis. The authors identified the following most significant predictors: high scores on the scales for positive and negative symptoms and disorganization on the Brief Psychiatric Rating Scale, Expanded (BPRS-E), and a small number of years of formal education [50]. Then, the model was improved with the addition of other predictors such as tearfulness, poor appetite, weight loss, cannabis use, cocaine, guilt, hopelessness, irritability, delusion, sleep disorders, lack of insight, arousal, and paranoia. The accuracy of the model based on the Harrell’s C-index was 0.085 [51]. In another study, 500 health records of patients with psychosis were analyzed using deep learning of the neural network. This neural network was able to identify the health record of a person with schizophrenia with an accuracy of 92.5%. The most important factor in detecting the disease was age [52].
Predicting the clinical features of schizophrenia
In addition to the main symptoms, the clinical symptoms of schizophrenia may include insomnia, depressive, anxiety, suicidal thoughts, and other symptoms [53–55]. These additional symptoms of schizophrenia may aggravate its course, making treatment more difficult [56, 57].
Insomnia often accompanies schizophrenia exacerbations and may herald an emergent psychotic episode. Insomnia also complicates the course of schizophrenia exacerbation and worsens the clinical prognosis and quality of life of patients [58, 59]. Its diagnosis is of great clinical importance. Kalinich et al., developed an application using machine learning which not only assumed the presence of schizophrenia in the respondent, but also predicted the development of insomnia and neurocognitive deficit. In that application, the subjects were asked to answer several questions and play a mini-game [60]. We have also constructed and trained a neural network capable of predicting the development of insomnia during hospitalization with 72% accuracy based on a patient’s medical history and statistical data [61].
One of the most unfortunate complications of schizophrenia is suicide, which can result from symptoms of depression. With the help of AI, scientists have the opportunity to predict the development of depressive manifestations in a person with schizophrenia [62]. Hettige et al., used models created using machine learning techniques to identify individuals with schizophrenia and suicidal attempts. The authors used social, cultural, statistical, medical history, and clinical data from their medical records as input data. The most significant factors in determining suicidal behavior were age, the results of the Childhood Trauma Questionnaire (CTQ) “emotional abuse” subscale, the total CTQ score, the duration of the disease, and the scores on the neuroticism scale in the NEO Five Factor Inventory (NEO-FFI) [63]. In another study, which also used machine learning techniques, the most significant predictors of suicide attempts were sexual abuse in childhood and the knowledge that one is suffering from a mental disorder [64].
An important aspect of the clinical manifestations of schizophrenia is aggressive, violent behavior. It is known that the risk of committing violent crimes in female and male patients suffering from the schizophrenic spectrum disease is 1 in 20 and 1 in 4, respectively [65]. Scientists from Switzerland tried to establish the factors associated with violent behavior using machine learning techniques. The authors concluded that a large number of stress factors affect the frequency of violent crimes by people with schizophrenia. The most important factors included social isolation in adulthood, involuntary psychiatric treatment, unemployment, estrangement from family, and failure in school. The model based on classification trees determined a person who had committed a violent crime with 91.57% accuracy [66].
Kanchanatawan et al. used neural network predictions to demonstrate that the severity of “negative” symptoms, symptoms of mannerism, arousal, and hostility are very accurate predictors of affective and psychosomatic symptoms in schizophrenia [67]. Models based on machine learning techniques can also predict the outcomes of schizophrenia. In a study by Lin et al., low scores on the quality-of-life scale were associated with the severity of “negative” and depressive symptoms and low results on the global functioning assessment scale were associated with the severity of “positive” and “negative” symptoms in schizophrenia. Cognitive impairment was also evaluated in that study. Researchers were able to predict changes over time in cognitive functions using machine learning techniques based on test results and the analysis of cognitive domains; the most significant predicting factor was the speed of information processing [68]. In another study, the significant factors able to predict the state of neurocognitive functions were memory disorders, executive dysfunction, as well as disorders of concentration and fluency of speech [69].
DISCUSSION
Our analysis of published literature included articles relating to the use of AI for the development of the diagnostic and prognostic models used in the context of schizophrenia. Diagnostic models are used to more accurately identify schizophrenia by analyzing EEG signals, MRI scans, mental abilities (speech, voice, emotions, visual art), and genetic and epigenetic information. Predictive models created with the help of AI technologies can be used for early identification of persons at high risk of psychosis, including the first episode of schizophrenia, as well as for predicting the outcomes of schizophrenia. Prognostic models are able to predict individual clinical symptoms and complications in schizophrenia.
Based on the information provided so far, it can be predicted that in the near future various methods may find wider application in psychiatric practice. Some algorithms have already been approved by the U.S. Food and Drug Administration (FDA) [70]. In this regard, many doctors and scientists are concerned about the ethical question of AI use in medicine and, particularly, in psychiatry. Researchers are concerned about the confidentiality of information, the accuracy of calculations, the safety of the use of algorithms, and possible disregard for the individual characteristics of a patient [70, 71].
AI algorithms, including those based on machine learning, are only as good as the data they were trained on [72]. If the training data is biased, incomplete, or of poor quality, then the operation of the AI system may be disrupted, which will lead to inaccurate or unreliable results. Therefore, it is very important to check the results obtained with the help of AI technologies using traditional diagnostic methods to ensure their accuracy. Also, models may be sensitive to input bias and models may make mistakes in situations that are very different from those on which they were trained, which makes the model less reliable. Therefore, some scientists propose to introduce tools into AI-based models that can shore up their reliability; for example, by comparing the training set against each new instance introduced into the model [73]. It can be difficult to understand the decisions made by some models based on machine learning techniques; e.g., deep neural networks. Such models are called “black box” models in the literature [74, 75]. In this regard, there arises a question on the ethics of entrusting the patient’s health to the “internal logic” of AI, which is not controlled by humans [76]. In our opinion, machine learning techniques should be used with extreme caution in clinical practice, only in conjunction with the main diagnostic tests, checking the results provided by AI and, of course, informing patients if recommendations based on the use of machine learning techniques are used.
The limitations of this review included a nonsystematic search for information, continuous inclusion of any type of studies, and lack of any assessment of the quality of the included studies. Also, the small sample size in a number of studies does not allow one to extrapolate the results to all people with schizophrenia.
There are several aspects of practical significance in the obtained results.
First, the use of models based on machine learning techniques in the diagnosis of schizophrenia makes it possible to achieve a more accurate and reliable qualification of this mental illness. An accurate diagnosis is key in providing the patient with adequate medical care.
Second, predictive models created using AI technologies can help with early identification of individuals at high risk of developing psychotic episodes, including the first episode of schizophrenia. This can be especially useful because early detection of schizophrenia can help prevent or reduce the severity of this mental illness. Moreover, prognostic models can predict the outcomes of schizophrenia, which can help clinicians and patients choose the most appropriate treatment and plan long-term care.
Third, the ability to predict individual clinical symptoms and complications of schizophrenia using prognostic models is of great importance for an individualized approach to treatment. This means that clinicians can predict what symptoms and complications may occur in specific patients and choose treatment according to their individual needs. This personalized approach can improve the effectiveness of treatment and clinical outcomes. However, no model can ensure a perfect result, which may be explained by the inability to include all patient features contributing to the final outcome in the prognostic or diagnostic model. In addition, models “overloaded” with input data can become unstable and produce poorer results compared to balanced models.
In general, the results of this review indicate the significant potential of machine learning techniques in the field of diagnosis and prediction of schizophrenia and its clinical features. These methods can significantly improve our understanding, diagnosis, and treatment of this psychiatric disorder, which ultimately can lead to an improvement in the lives of people with schizophrenia and lessen the state’s economic burden associated with this disease.
CONCLUSION
Machine learning techniques are used both to identify schizophrenia (diagnostic algorithms) and to predict the manifestation of the disease or the clinical features of a known illness (prognostic algorithms).
So far, the ethical issues associated with the use of these techniques remain unresolved and the clinical reliability of these models remains unclear, which limits, at this point, our ability to use these algorithms in clinical practice. Nevertheless, the use of machine learning algorithms remains one of the most promising areas in psychiatry, and it can significantly improve the effectiveness of diagnosis and treatment when dealing with schizophrenia.