Loading [MathJax]/jax/output/SVG/jax.js
Research article Special Issues

How artificial intelligence reduces human bias in diagnostics?

  • Accurate diagnostics of neurological disorders often rely on behavioral assessments, yet traditional methods rooted in manual observations and scoring are labor-intensive, subjective, and prone to human bias. Artificial Intelligence (AI), particularly Deep Neural Networks (DNNs), offers transformative potential to overcome these limitations by automating behavioral analyses and reducing biases in diagnostic practices. DNNs excel in processing complex, high-dimensional data, allowing for the detection of subtle behavioral patterns critical for diagnosing neurological disorders such as Parkinson's disease, strokes, or spinal cord injuries. This review explores how AI-driven approaches can mitigate observer biases, thereby emphasizing the use of explainable DNNs to enhance objectivity in diagnostics. Explainable AI techniques enable the identification of which features in data are used by DNNs to make decisions. In a data-driven manner, this allows one to uncover novel insights that may elude human experts. For instance, explainable DNN techniques have revealed previously unnoticed diagnostic markers, such as posture changes, which can enhance the sensitivity of behavioral diagnostic assessments. Furthermore, by providing interpretable outputs, explainable DNNs build trust in AI-driven systems and support the development of unbiased, evidence-based diagnostic tools. In addition, this review discusses challenges such as data quality, model interpretability, and ethical considerations. By illustrating the role of AI in reshaping diagnostic methods, this paper highlights its potential to revolutionize clinical practices, thus paving the way for more objective and reliable assessments of neurological disorders.

    Citation: Artur Luczak. How artificial intelligence reduces human bias in diagnostics?[J]. AIMS Bioengineering, 2025, 12(1): 69-89. doi: 10.3934/bioeng.2025004

    Related Papers:

    [1] Lei Chen, Kaiyu Chen, Bo Zhou . Inferring drug-disease associations by a deep analysis on drug and disease networks. Mathematical Biosciences and Engineering, 2023, 20(8): 14136-14157. doi: 10.3934/mbe.2023632
    [2] Bo Zhou, Bing Ran, Lei Chen . A GraphSAGE-based model with fingerprints only to predict drug-drug interactions. Mathematical Biosciences and Engineering, 2024, 21(2): 2922-2942. doi: 10.3934/mbe.2024130
    [3] Saranya Muniyappan, Arockia Xavier Annie Rayan, Geetha Thekkumpurath Varrieth . DTiGNN: Learning drug-target embedding from a heterogeneous biological network based on a two-level attention-based graph neural network. Mathematical Biosciences and Engineering, 2023, 20(5): 9530-9571. doi: 10.3934/mbe.2023419
    [4] Rishin Haldar, Swathi Jamjala Narayanan . A novel ensemble based recommendation approach using network based analysis for identification of effective drugs for Tuberculosis. Mathematical Biosciences and Engineering, 2022, 19(1): 873-891. doi: 10.3934/mbe.2022040
    [5] Wanying Xu, Xixin Yang, Yuanlin Guan, Xiaoqing Cheng, Yu Wang . Integrative approach for predicting drug-target interactions via matrix factorization and broad learning systems. Mathematical Biosciences and Engineering, 2024, 21(2): 2608-2625. doi: 10.3934/mbe.2024115
    [6] Xiujun Zhang, H. G. Govardhana Reddy, Arcot Usha, M. C. Shanmukha, Mohammad Reza Farahani, Mehdi Alaeiyan . A study on anti-malaria drugs using degree-based topological indices through QSPR analysis. Mathematical Biosciences and Engineering, 2023, 20(2): 3594-3609. doi: 10.3934/mbe.2023167
    [7] Huiqing Wang, Sen Zhao, Jing Zhao, Zhipeng Feng . A model for predicting drug-disease associations based on dense convolutional attention network. Mathematical Biosciences and Engineering, 2021, 18(6): 7419-7439. doi: 10.3934/mbe.2021367
    [8] Wangping Xiong, Kaiqi Wang, Shixiong Liu, Zhaoyang Liu, Yimin Zhu, Peng Liu, Ming Yang, Xian Zhou . Multiple prescription pattern recognition model based on Siamese network. Mathematical Biosciences and Engineering, 2023, 20(10): 18695-18716. doi: 10.3934/mbe.2023829
    [9] Banghua Yang, Xuelin Gu, Shouwei Gao, Ding Xu . Classification accuracy and functional difference prediction in different brain regions of drug abuser prefrontal lobe basing on machine-learning. Mathematical Biosciences and Engineering, 2021, 18(5): 5692-5706. doi: 10.3934/mbe.2021288
    [10] Yufeng Li, Chengcheng Liu, Weiping Zhao, Yufeng Huang . Multi-spectral remote sensing images feature coverage classification based on improved convolutional neural network. Mathematical Biosciences and Engineering, 2020, 17(5): 4443-4456. doi: 10.3934/mbe.2020245
  • Accurate diagnostics of neurological disorders often rely on behavioral assessments, yet traditional methods rooted in manual observations and scoring are labor-intensive, subjective, and prone to human bias. Artificial Intelligence (AI), particularly Deep Neural Networks (DNNs), offers transformative potential to overcome these limitations by automating behavioral analyses and reducing biases in diagnostic practices. DNNs excel in processing complex, high-dimensional data, allowing for the detection of subtle behavioral patterns critical for diagnosing neurological disorders such as Parkinson's disease, strokes, or spinal cord injuries. This review explores how AI-driven approaches can mitigate observer biases, thereby emphasizing the use of explainable DNNs to enhance objectivity in diagnostics. Explainable AI techniques enable the identification of which features in data are used by DNNs to make decisions. In a data-driven manner, this allows one to uncover novel insights that may elude human experts. For instance, explainable DNN techniques have revealed previously unnoticed diagnostic markers, such as posture changes, which can enhance the sensitivity of behavioral diagnostic assessments. Furthermore, by providing interpretable outputs, explainable DNNs build trust in AI-driven systems and support the development of unbiased, evidence-based diagnostic tools. In addition, this review discusses challenges such as data quality, model interpretability, and ethical considerations. By illustrating the role of AI in reshaping diagnostic methods, this paper highlights its potential to revolutionize clinical practices, thus paving the way for more objective and reliable assessments of neurological disorders.



    Disease is one of the greatest threats to human health. Many people die due to various diseases each year. With the development of medical science, several efforts have been given to various diseases [1,2,3], with an ultimate purpose of uncovering the underlying mechanism and the essential characteristics of different diseases to design efficient treatments. To date, several treatments have been designed to deal with different diseases, and drug is deemed as one of the most important parts. For fast indication of the utility of drugs, they are divided into several classes. Drugs in one class always have some common features. Correctly identifying the class of a candidate drug-like compound is helpful to uncover its effects. The classification procedure could be completed by traditional experiments. However, such methods are always inefficient and expensive. Designing quick and cheap methods that could accurately predict the class of a given drug is urgent. With the development of computer science, many advanced computational methods have been proposed in recent years and applied to deal with various practical problems, such as artificial neural networks [4,5,6], feature selection algorithms [7], statistical test methods [8,9,10]. These methods provide strong support for designing efficient methods.

    Designing computational methods to classify drugs have recently become relatively popular. The most representative studies focused on predicting drug classes in the Anatomical Therapeutic Chemical (ATC) classification system. Such system is recommended and maintained by the World Health Organization (WHO). It has five levels, and several classes are contained in each level. Many computational methods have been proposed to predict classes in the first level of the ATC classification system. Most of them were based on machine learning algorithms, such as deep learning algorithms [11,12,13,14], network embedding algorithms [15], and multi-label classification algorithms [15,16,17]. Meanwhile, several types of drug properties were adopted to build methods that include drug fingerprints [16,17] and chemical–chemical similarity/interaction information [13,18,19,20,21]. These methods provide helpful insights into investigating other drug classification systems.

    The Kyoto Encyclopedia of Genes and Genomes (KEGG) [22] recently reported a new drug classification system that classified drugs into the following 14 classes: (Ⅰ) anti-allergic agent; (Ⅱ) anti-inflammatory; (Ⅲ) antibacterial; (Ⅳ) antidiabetic agent; (Ⅴ) antifungal; (Ⅵ) antineoplastic; (Ⅶ) antiviral; (Ⅷ) cardiovascular agent; (Ⅸ) endocrine and hormonal agent; (Ⅹ) gastrointestinal agent; (Ⅺ) hypolipidemic agent; (Ⅻ) immunological agent; (XIII) neuropsychiatric agent; (XIV) ophthalmic agent. Under this system, drugs are roughly classified in accordance with the diseases they could treat. Identifying the class of any latent drug-like compound is helpful to preliminarily determine its effects to a certain type of disease. Targeted experiments for this type of disease could subsequently be conducted for further confirmation, which could accelerate the procedures for discovering novel drugs. Thus, for the first time, such a drug classification system was investigated in this study by developing a classification model, which could assign one of the above 14 classes to given drugs. A large drug network was constructed on the basis of chemical–chemical interaction (CCI) information reported in Search Tool for Interactions of Chemicals (STITCH) to extract informative drug features [23,24]. It was fed into a powerful network embedding algorithm called Node2vec [25] to produce drug features. These features were quite different from traditional drug fingerprint features, which could be obtained by considering each drug individually. The features adopted in this study abstracted drug associations, and they could overview each drug with all drugs as background, thus representing drugs at a system level. They were learnt by support vector machine (SVM) [26] or random forest (RF) [27] to construct the model. The cross-validation results indicated the good performance of the model. The model was also superior to those with classic drug fingerprint features or embedding features yielded by another network embedding algorithm. The proposed model adopted synthetic minority oversampling technique (SMOTE) [28] to tackle the imbalance problem, and the results suggested that it could balance the performance across different classes.

    Drugs and their information were retrieved from KEGG DRUG (https://www.genome.jp/kegg/drug/) [22]. According to https://www.kegg.jp/brite/br08332 (accessed on 10 January 2022), 820 drugs encoded by KEGG IDs were classified into 14 classes, as mentioned in Section 1 and listed in column 2 of Table 1. For convenience, these classes were tagged as C1C14. The correspondence between tags and classes could be found in columns 1 and 2 of Table 1. Among these drugs, only two belong to two classes, while the others are in one exact class. Thus, these two drugs were removed. As the CCI information was employed to build the model, where drugs are represented by PubChem IDs, the KEGG IDs with unavailable PubChem IDs were also removed. If the PubChem ID of one KEGG ID was not included in the drug network (Section 2.2), it was also discarded. Finally, 579 drugs encoded by KEGG IDs were obtained. These drugs were classified into 14 classes, as listed in column 2 of Table 1. The number of drugs in each class is also listed in Table 1. The detailed drugs in each class are provided in S1.

    Table 1.  Breakdown of the drug dataset.
    Tag Drug class Number of drugs
    C1 Anti-allergic agent 35
    C2 Anti-inflammatory 116
    C3 Antibacterial 99
    C4 Antidiabetic agent 25
    C5 Antifungal 21
    C6 Antineoplastic 51
    C7 Antiviral 44
    C8 Cardiovascular agent 54
    C9 Endocrine and hormonal agent 19
    C10 Gastrointestinal agent 29
    C11 Hypolipidemic agent 13
    C12 Immunological agent 8
    C13 Neuropsychiatric agent 37
    C14 Ophthalmic agent 28
    Total 579

     | Show Table
    DownLoad: CSV

    Recently, network is a popular form to investigate various complex biological and medical problems [15,29,30,31], because it could overview all objects (nodes in the network) and evaluate them at a system level. In the present study, network was utilized to organize drugs.

    The constructed network defined drugs reported in KEGG as nodes. The relationship between nodes should be determined to form the network. Several public databases provide the existing associations of drugs. Here, the CCI information reported in STITCH (http://stitch.embl.de/, version 4.0) was employed [23,24]. The file named "chemical_chemical.links.v4.0.tsv.gz" was downloaded from STITCH, which contains a large number of CCIs. Each CCI consists of two chemicals, represented by PubChem IDs, and one confidence score. The confidence score is obtained by measuring several aspects of chemicals, such as structures, reactions, activities, and literature descriptions. It ranges between 1 and 999 with a meaning that the higher the confidence score, the stronger the association between two chemicals. For formulation, the confidence score on the CCI between chemicals c1 and c2 was denoted as S(c1, c2). For the constructed network, two nodes were connected by an edge if their corresponding drugs could comprise a CCI with a confidence score larger than zero. Evidently, each edge in the network indicated a CCI. Each edge was assigned a weight, which was defined as S(c1, c2)/1000, to reflect different strengths of CCIs. Such network was denoted by Nd, and it contained 17,956 nodes and 3,134,797 edges.

    Encoding samples into continuous vectors that could be processed by most computer algorithms is one of the most important steps to build efficient classification models. As mentioned in Section 2.2, a drug network was constructed, which contained informative associations of drugs. This information could be used to encode drugs. In recent years, several network embedding algorithms, such as DeepWalk [32], Mashup [33], and LINE [34], have been proposed to assign continuous vectors to nodes in one or more networks. Here, Node2vec [25] was selected to extract drug features from Nd.

    Node2vec is a powerful network embedding algorithm. In fact, it could be deemed as an improved version of DeepWalk. Similar to DeepWalk, it samples many paths from a given network. The procedures of generating paths are greatly improved compared with those in DeepWalk. For each node u, Node2vec produces a predefined number of paths starting at u. Suppose that ni-1 is the (i-1)-th node in one path starting at u. This path is extended to the i-th node, denoted by ni, by selecting one neighbor of ni-1. The selection probability from ni-1 to any other node is defined as

    P(ni=w|ni1=v)={πvw/Zifw is adjacent to v0Otherwise, (1)

    where πvw stands for the transition probability from v to w, defined by πvw=αpq(t,w)wvw; Z is the sum of transition probabilities from v to all its neighbors; and wvw denotes the weight on edge connecting nodes v and w. Moreover, αpq(t,w) could be determined by

    αpq(t,w)={1/pif dtw=01if dtw=11/qif dtw=2, (2)

    where t is the (i-2)-th node in the path, and dtw represents the distance between t and w. For the path starting at u, it is iteratively extended by selecting a neighbor of the current end-point in accordance with the probabilities calculated using Eq (1) until the length of the path reaches the predefined length. After all paths are produced, they are fed into the word2vec algorithm with SkipGram to produce vectors of nodes, where the paths are deemed as sentences and nodes are considered as words.

    In this study, the program of Node2vec was retrieved from https://snap.stanford.edu/node2vec/. Such program was performed on Nd by using its default parameters. Afterwards, the feature vectors of 579 drugs were picked up to construct models.

    Besides the representation of samples, another key step to build efficient classification models is to select a proper classification algorithm. In this study, two classic classification algorithms were used: SVM [26] and RF [27]. These two algorithms have been widely used to set up classification models with excellent performance for investigating various biological [35,36,37,38,39] and medical problems [40,41,42,43].

    SVM is a statistical learning-based classification algorithm. Its original version could only process binary classification problems. Its idea is to determine an optimal hyperplane that could separate samples into two classes as perfect as possible. However, such hyperplane is not easy to obtain in the original feature subspace. It generally employs kernel trick to map samples into a high-dimension space, in which the hyperplane, designed to separate samples in two classes, could be easily constructed. A test sample is also mapped into the high-dimension space. The class of the test sample is determined in accordance with the side of the hyperplane it is located. Subsequently, SVM could tackle multi-class classification problems by using "one-versus-rest" or "one-versus-one".

    RF is another powerful and widely used classification algorithm, which is quite different from SVM. In fact, it is a type of ensemble algorithm. Several decision trees are constructed and comprise RF. Some features are randomly selected from all features to learn a decision tree. Samples are randomly selected, with replacement, from the original dataset. Then, a decision tree is built on the basis of these randomly selected features and samples. RF gives the prediction by majority voting on predictions generated by the decision trees it contains.

    In this study, the tools "SMO" and "RandomForest" in Weka were directly used [44], and they implemented the above mentioned SVM and RF, respectively. These tools were used with their default parameters.

    A total of 14 drug classes were involved in this study. Some classes contained much less samples than the others, implying the dataset was imbalanced. The classification model directly built on such dataset may provide excellent performance on the majority class and poor performance on the minority class. SMOTE [28] was employed to tackle such problem. For each class, except the largest, one sample is randomly selected, denoted by x. The k nearest neighbors of x in the same class are found, and one is randomly selected, say y. On the basis of x and y, a new sample z is generated, which is defined as the linear combination of x and y with randomly produced combination coefficients. Given that the new sample is highly related to x and y, it is placed into the same class of x and y. The above procedures are executed several times until the size of the minority class is the same as that of the largest class.

    This study adopted the tool "SMOTE" in Weka [44] to balance the investigated dataset, and default parameters were used.

    As a multi-class classification problem, the performance of the models could be measured by overall accuracy (ACC), which is defined as the proportion of all correctly predicted samples.

    Recall, precision, F-measure, and Matthew's correlation coefficient (MCC) [45] were also computed for each class as follows:

    Precision=TPTP+FP, (3)
    Recall=TPTP+FN, (4)
    Fmeasure=2×Precision×RecallPrecision+Recall, (5)
    MCC=TP×TNFP×FN(TN+FN)×(TN+FP)×(TP+FN)×(TP+FP), (6)

    where TP, FP, FN, and TN stand for true positive, false positive, false negative, and true negative, respectively. In multi-class classification, their specific definitions for one class are as follows: TP is the number of samples in the class that are also classified into this class, FP is the number of samples not in the class that are classified into this class, FN is the number of samples in the class that are classified into other classes, and TN is the number of samples not in the class that are classified into other classes. Eqs (3)–(6) could only assess the performance under a certain threshold. For full assessment, the receiver operating characteristic (ROC) and precision-recall (PR) curve analyses were further employed on the predicted results for each class. The area under the ROC curve, which was denoted by AUROC, and the PR curve, which was represented by AUPR, were calculated to fully evaluate the performance of different models on each class. For easy description, the six measurements for class Ci were denoted by precision(i), recall(i), F-measure(i), MCC(i), AUROC(i), and AUPR(i). Furthermore, the weighted precision, recall, F-measure, MCC, AUROC, and AUPR were calculated to fully evaluate the overall performance of all classification models.

    In this study, a novel model was proposed to classify drugs in accordance with the drug classification system recently reported in KEGG. Its construction and evaluation procedures are illustrated in Figure 1. In this section, detailed evaluation results were provided, and further comparisons were conducted.

    Figure 1.  Entire procedures for constructing and evaluating the model. Drugs were classified into 14 classes reported in KEGG DRUG. The chemical-chemical interaction information retrieved from STITCH was used to build a large drug network, from which drug features were extracted via Node2vec. These features indicated the associations between drugs, and they could overview each drug with all drugs as background. After the dataset was processed by the synthetic minority oversampling technique, it was fed into one classification algorithm to build the model. The model was finally assessed by 10-fold cross-validation. This figure was generated in PowerPoint.

    The proposed model adopted the drug features derived from a large drug network via Node2vec and SMOTE to tackle the imbalance problem. The best dimension of the features was found to be 128. Two classic classification algorithms (RF and SVM) were used as the prediction engine. For easy description, if RF (SVM) was selected as the prediction engine, the model was called RF (SVM) model. Each model was evaluated five times using a 10-fold cross-validation [46]. The confusion matrix of each model under each 10-fold cross-validation is provided in S2. The average performance is listed in Table 2.

    Table 2.  Overall performance of the models with different classification algorithms under 10-fold cross-validation.
    Classification algorithm SMOTE ACC Weighted Precision Weighted F-measure Weighted MCC Weighted AUROC Weighted AUPR
    Random forest 0.9486 0.9631 0.9511 0.9484 0.9958 0.9830
    Support vector machine 0.9583 0.9752 0.9640 0.9625 0.9830 0.9350
    Random forest 0.9410 - - - 0.9930 0.9710
    Support vector machine 0.9600 0.9610 0.9600 0.9550 0.9840 0.9320

     | Show Table
    DownLoad: CSV

    For the RF model, the ACC was 0.9486, and the weighted precision, F-measure, MCC, AUROC, and AUPR were 0.9631, 0.9511, 0.9484, 0.9958, and 0.9830, respectively. The result of each measurement was relatively high, suggesting the excellent performance of the RF model. The model's performance on 14 classes is provided in Table 3, and the ROC and PR curves under one 10-fold cross-validation are shown in Figure 2(A), (B), respectively. Table 3 shows that most measurements were higher than 0.9. The number of classes on which the recall values were higher than 0.9 was 12, and this numbers for precision, F-measure, MCC, AUPRO, and AUPR were 11, 12, 12, 14, and 13, respectively. The results indicated the good performance of the RF model on most classes, conforming to its overall performance. Careful examination of the measurements in Table 3 showed that the RF model provided the lowest performance on class C12 (immunological agent) for each measurement. For example, the recall on this class was only 0.2726. This class only contained eight drugs, which was not only the smallest among all 14 classes, but was considered the lowest performing class. Although the RF model adopted SMOTE to tackle the imbalance problem, the class sizes could still have influence. In the following text, we would elaborate that such influence has been decreased by SMOTE.

    Table 3.  Performance of the random forest model on each class under 10-fold cross-validation.
    Tag of class Recall(i) Precision(i) F-measure(i) MCC(i) AUROC(i) AUPR(i)
    C1 0.9052 0.9319 0.9141 0.9162 0.9975 0.9627
    C2 0.9550 0.9449 0.9469 0.9366 0.9972 0.9879
    C3 0.9983 0.9761 0.9865 0.9845 0.9998 0.9991
    C4 0.9750 0.9749 0.9690 0.9708 0.9994 0.9921
    C5 0.9676 1.0000 0.9804 0.9812 0.9943 0.9780
    C6 0.9473 0.9113 0.9179 0.9182 0.9915 0.9770
    C7 0.9434 0.9611 0.9450 0.9453 0.9932 0.9769
    C8 0.9377 0.9829 0.9548 0.9537 0.9940 0.9709
    C9 0.9191 0.9294 0.9499 0.9511 0.9972 0.9659
    C10 0.9132 0.8882 0.8858 0.8886 0.9969 0.9602
    C11 0.8870 0.8528 0.9582 0.9622 1.0000 1.0000
    C12 0.2726 0.5183 0.5834 0.5852 0.9127 0.7654
    C13 0.9781 0.9456 0.9552 0.9557 0.9997 0.9958
    C14 0.9831 0.9738 0.9768 0.9764 0.9998 0.9967

     | Show Table
    DownLoad: CSV
    Figure 2.  ROC and PR curves of two models on 14 classes under one 10-fold cross-validation. (A) ROC curves of the RF model; (B) PR curves of the RF model; (C) ROC curves of the SVM model; (D) PR curves of the SVM model. The two models provide nearly perfect ROC and PR curves on most classes, indicating that they could efficiently classify drugs. This figure was generated by matplotlib package in Python.

    As for the SVM model, the ACC reached 0.9583, which was slightly higher than that of the RF model. As for the other five measurements, they were 0.9752, 0.9640, 0.9625, 0.9830 and 0.9350 (Table 2), respectively. Compared with the measurements of RF model, SVM model yielded higher weighted precision, F-measure, MCC, whereas lower weighted AUROC and AUPR. These indicated that the RF and SVM model provided almost equal performance. The detailed performance of SVM model on fourteen classes is listed in Table 4. Furthermore, the ROC and PR curves of this model under one 10-fold cross-validation are illustrated in Figure 2(C), (D), respectively. Similar to the performance of RF model, most measurements listed in this table were higher than 0.9. The numbers of classes on which recall, precision, F-measure, MCC, AUROC and AUPR, respectively, were higher than 0.9 were 12, 13, 13, 13, 14 and 11, which were also similar to those of the RF model. Again, the performance of SVM model on the smallest class C12 (immunological agent) was also lowest for each measurement.

    Table 4.  Performance of the support vector machine model on each class under 10-fold cross-validation.
    Tag of class Recall(i) Precision(i) F-measure(i) MCC(i) AUROC(i) AUPR(i)
    C1 0.9444 0.9152 0.9211 0.9215 0.9934 0.8984
    C2 0.9635 0.9584 0.9593 0.9513 0.9882 0.9428
    C3 0.9967 0.9803 0.9877 0.9860 0.9982 0.9803
    C4 0.9717 1.0000 0.9831 0.9838 0.9982 0.9836
    C5 0.9676 0.9867 0.9704 0.9725 0.9916 0.9603
    C6 0.9556 0.9656 0.9549 0.9540 0.9857 0.9604
    C7 0.9384 0.9598 0.9407 0.9412 0.9700 0.9197
    C8 0.9569 0.9805 0.9650 0.9637 0.9905 0.9546
    C9 0.9109 1.0000 0.9756 0.9759 0.9964 0.9644
    C10 0.8567 0.9282 0.9049 0.9096 0.9623 0.8393
    C11 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000
    C12 0.7373 0.4740 0.6904 0.7134 0.9270 0.5556
    C13 0.9790 0.9843 0.9786 0.9788 0.9993 0.9814
    C14 0.9841 0.9944 0.9874 0.9875 0.9985 0.9874

     | Show Table
    DownLoad: CSV

    According to Table 2, the SVM model provided higher performance on four measurements than the RF model, whereas the RF model yielded higher performance on the other two measurements. The SVM model was slightly superior to the RF model. Several box plots were drawn on all measurements for the performance of the two models on 14 classes, as shown in Figure 3. For recall, the SVM model varied in a smaller range than the RF model. The same case occurred for F-measure, MCC, and AUROC. As for the remaining two measurements (precision and AUPR), the RF model changed in a smaller interval. On the whole, the stability of the performance of these two models on different classes was at the same level. However, the SVM model was slightly more stable. These results showed that the SVM model is a more suitable tool for classifying drugs than the RF model.

    Figure 3.  Box plot showing the performance change of RF and SVM models on 14 classes. (A) Performance change on recall; (B) Performance change on precision; (C) Performance change on F-measure; (D) Performance change on MCC; (E) Performance change on AUROC; (F) Performance change on AUPR. The SVM model provides more stable performance on 14 classes than the RF model, indicating that the former is a more suitable tool for drug classification. This figure was generated by GraphPad Prism.

    In this study, SMOTE was adopted to tackle the imbalance problem. The performance of the model without SMOTE must be investigated and compared with that of the model with SMOTE. In view of this, RF and SVM were directly applied on the features yielded by Node2vec to build the RF and SVM models without SMOTE. Their performance was also evaluated five times using a 10-fold cross-validation. The average performance is listed in Table 2. For the RF model without SMOTE, the ACC, weighted AUROC, and AUPR were 0.9410, 0.9930, and 0.9710, respectively. As no drugs were classified into C12, the precision, F-measure, and MCC for this class could not be computed, further inducing no results for weighted precision, F-measure, and MCC. According to the computed measurements, the performance of this model was only slightly lower than that of the RF model with SMOTE. As for the SVM model without SMOTE, the six measurements were 0.9600, 0.9610, 0.9600, 0.9550, 0.9840, and 0.9320, which were relatively similar to those of the SVM model with SMOTE. This finding suggested that the employment of SMOTE did not improve the overall performance of the model at all. However, the utilization of SMOTE was reflected not only on the improvement of the overall performance of the models but also on the balance of performance across different classes. Thus, the accuracy, i.e., recall, on the 14 classes yielded by the models with or without SMOTE was further investigated. A box plot was drawn for each model to clearly show the change in recall on all classes, as shown in Figure 4. When the same prediction engine was used (RF or SVM), the recall values of the model with SMOTE varied in a smaller interval than those of the model without SMOTE. This result implied that the employment of SMOTE balanced the performance across different classes. For example, on C12, the RF and SVM models without SMOTE produced recall values of 0.0000 and 0.5000, respectively, and these values were improved to 0.2726 and 0.7373, respectively, by employing SMOTE. For practical applications, the model with SMOTE could avoid the extremely good performance on the majority class and extremely low performance on the minority class.

    Figure 4.  Box plot showing the performance change of models with or without SMOTE on 14 classes. The models with SMOTE evidently provided more balanced performance on all classes than those without SMOTE. Through SMOTE, the drug samples in each class, including real and synthesized samples, are almost similar in number. The bias of the model based on the dataset processed by SMOTE could be reduced. This figure was generated by GraphPad Prism.

    In this study, the drug features derived from a network via Node2vec were used to build the model. Two other feature types were adopted to build models, which were compared with the proposed model.

    The first feature type contained features derived from the drug network Nd via another powerful network embedding algorithm, Mashup [33]. The dimension was also set to 128. RF and SVM were applied on these features to construct models. SMOTE was employed to tackle the imbalance problem. The models were also assessed five times using a 10-fold cross-validation. For clear description, they were called Mashup-based RF and SVM models, whereas the proposed models were termed as Node2vec-based RF and SVM models. The average performance of the Mashup-based models is provided in Table 5. When RF was selected as the prediction engine, the Node2vec-based model provided evidently higher performance on all measurements than the Mashup-based model. In detail, the improvement on ACC, weighted F-measure, MCC, and AUPR was about 0.05 and such improvement on weighted precision and AUROC was about 0.03 and 0.01, respectively. Meanwhile, the superiority of the Node2vec-based SVM model to the Mashup-based SVM model was not very evident. However, the Node2vec-based SVM model provided higher performance on five measurements, implied that it was slightly better than the Mashup-based model.

    Table 5.  Performance of models with other drug features.
    Drug features Classification algorithm ACC Weighted Precision Weighted F-measure Weighted MCC Weighted AUROC Weighted AUPR
    Drug features produced by Mashup Random forest 0.8891 0.9282 0.9069 0.9006 0.9819 0.9359
    Support vector machine 0.9526 0.9724 0.9617 0.9579 0.9880 0.9280
    Drug fingerprint features Random forest 0.8432 0.8768 0.8431 0.8366 0.9748 0.9107
    Support vector machine 0.8426 0.8791 0.8489 0.8420 0.9476 0.7943

     | Show Table
    DownLoad: CSV

    The second feature type included the drug fingerprint features, which were widely used to deal with various drug- or chemical-related problems [16,38,47,48,49]. Here, RDKit [50] was adopted to extract the ECFP fingerprints of all 579 drugs. The fingerprints of each drug were represented by a binary vector with dimension of 1024. These fingerprint features were fed into RF and SVM to construct models. SMOTE was also employed. The models were called fingerprint-based models. All models were evaluated five times using a 10-fold cross-validation. The average performance is provided in Table 5. The results showed that most measurements were lower than 0.9, indicating low performance of such models. Compared with the performance of the proposed model (Table 2), the proposed model clearly provided much higher performance.

    Finally, the ROC and PR curves on each class for some models were plotted. For the Node2vec-based model, the SVM model was selected because it was slightly better and more stable than the RF model. For the Mashup-based model, the SVM model was chosen as it was superior to the RF model (Table 5). As for the fingerprint-based model, the RF model was selected due to its superiority to the SVM model (Table 5). The ROC and PR curves of these models on all classes are shown in Figure 5. A notable detail that these curves were produced by one of the five 10-fold cross-validations. The areas under the ROC curves of the fingerprint-based RF model were much smaller than those of other two models. The same results were obtained for PR curves. As for the Node2vec-based and Mashup-based SVM models, the areas under the ROC or PR curves were relatively similar. However, the areas under the ROC or PR curves of the Mashup-based SVM model on some classes (e.g., C12) were relatively low, whereas they were improved by the Node2vec-based SVM model.

    Figure 5.  ROC and PR curves of some models on 14 classes. (A) ROC curves of the Node2vec-based SVM model; (B) PR curves of the Node2vec-based SVM model; (C) ROC curves of the Mashup-based SVM model; (D) PR curves of the Mashup-based SVM model; (E) ROC curves of the fingerprint-based RF model; (F) PR curves of the fingerprint-based RF model. Models with features derived from the drug network are superior to those with fingerprint features, suggesting network features are more related to and informative for drug classification than traditional drug fingerprint features. This figure was generated by matplotlib package in Python.

    Therefore, the proposed model (Node2vec-based model) was better than the models using other drug features, indicating its superiority in classifying drugs.

    In Section 3.3, the proposed model and two other models with different drug features were compared. The proposed model provided better performance than the two other models, suggesting that the features used in the proposed model were more efficient than those adopted in other models for drug classification. The t-SNE [51] was adopted to analyze three feature types used in the Node2vec-based, Mashup-based, and fingerprint-based models to further confirm these findings and provide intuitive evidence. Such tool could reduce the dimensions of features to two and display them in a 2D space, where samples in different classes are in different colors. The results of t-SNE are illustrated in Figure 6. The features derived from the drug network [Figure 6(A),(B)] could cluster drugs in different classes evidently better than the drug fingerprints [Figure 6(C)]. As for two feature types derived from the drug network, the features yielded by Node2vec [Figure 6(A)] could cluster drugs in different classes more compactly than those generated by Mashup. These results implied that the features derived from the drug network via Node2vec, which were used in the proposed model, were more helpful in classifying drugs. This finding was the main reason why the Node2vec-based model could provide better performance.

    Figure 6.  Visualization of three drug feature types in 2D space by using t-SNE. (A) Drug features derived from the drug network via Node2vec; (2) Drug features derived from the drug network via Mashup; (3) Drug fingerprint features. The drug features derived from the drug network via Node2vec could evidently cluster drugs in different classes the best, implying that such features are the excellent representations for drug classification. This figure was generated by matplotlib package in Python.

    In this study, a new drug classification system reported in KEGG was investigated by proposing classification models for assigning class in this system to any given drug. Informative drug features were derived from a large drug network via a powerful network embedding algorithm. The evaluation and comparison results indicated that the features were more related to the drug classification system in KEGG and more informative than classic fingerprint features. Employing more drug information (e.g., drug side effects and indications) or more complex and advanced feature learning algorithms (e.g., graph convolutional network and convolutional neural network) may be helpful to access more informative drug features, thereby building more powerful classification models. These aspects could be investigated in future work. Furthermore, the employment of SMOTE guaranteed that the performance on all classes was high. The model could be a useful tool to classify drugs, discover novel effects of existing drugs and determine the effects of candidate drug-like compounds.

    The authors declare there is no conflict of interest.


    Acknowledgments



    The author developed AI agents to work with them, like with a good MSc student. Agents helped to identify the most relevant literature, design a plan for the paper, implement suggested improvements, and draft paper sections and rewrite them based on comments provided by the author. The author assumes full responsibility for the accuracy of the content presented here.

    Conflict of interest



    The author has no conflicts of interest to declare.

    [1] Bakeman R, Quera V (2011) Sequential Analysis and Observational Methods for the Behavioral Sciences.Cambridge University Press. https://doi.org/10.1017/CBO9781139017343
    [2] Metz GA, Whishaw IQ (2002) Cortical and subcortical lesions impair skilled walking in the ladder rung walking test: a new task to evaluate fore-and hindlimb stepping, placing, and co-ordination. J Neurosci Meth 115: 169-179. https://doi.org/10.1016/S0165-0270(02)00012-2
    [3] Spano R (2005) Potential sources of observer bias in police observational data. Soc Sci Res 34: 591-617. https://doi.org/10.1016/j.ssresearch.2004.05.003
    [4] Asan O, Montague E (2014) Using video-based observation research methods in primary care health encounters to evaluate complex interactions. J Innov Health Inform 21: 161-170. https://doi.org/10.14236/jhi.v21i4.72
    [5] Moran RW, Schneiders AG, Major KM, et al. (2016) How reliable are functional movement screening scores? A systematic review of rater reliability. Brit J Sport Med 50: 527-536. https://doi.org/10.1136/bjsports-2015-094913
    [6] Mathis MW, Mathis A (2020) Deep learning tools for the measurement of animal behavior in neuroscience. Curr Opin Neurobiol 60: 1-11. https://doi.org/10.1016/j.conb.2019.10.008
    [7] Gautam R, Sharma M (2020) Prevalence and diagnosis of neurological disorders using different deep learning techniques: a meta-analysis. J Med Syst 44: 49. https://doi.org/10.1007/s10916-019-1519-7
    [8] Singh KR, Dash S (2023) Early detection of neurological diseases using machine learning and deep learning techniques: a review. Artif Intell Neurol Diso 2023: 1-24. https://doi.org/10.1016/B978-0-323-90277-9.00001-8
    [9] Arac A, Zhao P, Dobkin BH, et al. (2019) DeepBehavior: A deep learning toolbox for automated analysis of animal and human behavior imaging data. Front Syst Neurosci 13: 20. https://doi.org/10.3389/fnsys.2019.00020
    [10] Sewak M, Sahay SK, Rathore H (2020) An overview of deep learning architecture of deep neural networks and autoencoders. J Comput Theor Nanos 17: 182-188. https://doi.org/10.1166/jctn.2020.8648
    [11] Brattoli B, Büchler U, Dorkenwald M, et al. (2021) Unsupervised behaviour analysis and magnification (uBAM) using deep learning. Nat Mach Intell 3: 495-506. https://doi.org/10.1038/s42256-021-00326-x
    [12] ul Haq A, Li JP, Agbley BLY, et al. (2022) A survey of deep learning techniques based Parkinson's disease recognition methods employing clinical data. Expert Syst Appl 208: 118045. https://doi.org/10.1016/j.eswa.2022.118045
    [13] Nilashi M, Abumalloh RA, Yusuf SYM, et al. (2023) Early diagnosis of Parkinson's disease: a combined method using deep learning and neuro-fuzzy techniques. Comput Biol Chem 102: 107788. https://doi.org/10.1016/j.compbiolchem.2022.107788
    [14] Shahid AH, Singh MP (2020) A deep learning approach for prediction of Parkinson's disease progression. Biomed Eng Lett 10: 227-239. https://doi.org/10.1007/s13534-020-00156-7
    [15] Chintalapudi N, Battineni G, Hossain MA, et al. (2022) Cascaded deep learning frameworks in contribution to the detection of parkinson's disease. Bioengineering 9: 116. https://doi.org/10.3390/bioengineering9030116
    [16] Almuqhim F, Saeed F (2021) ASD-SAENet: a sparse autoencoder, and deep-neural network model for detecting autism spectrum disorder (ASD) using fMRI data. Front Comput Neurosci 15: 654315. https://doi.org/10.3389/fncom.2021.654315
    [17] Zhang L, Wang M, Liu M, et al. (2020) A survey on deep learning for neuroimaging-based brain disorder analysis. Front Neurosci 14: 779. https://doi.org/10.3389/fnins.2020.00779
    [18] Uddin MZ, Shahriar MA, Mahamood MN, et al. (2024) Deep learning with image-based autism spectrum disorder analysis: a systematic review. Eng Appl Artif Intel 127: 107185. https://doi.org/10.1016/j.engappai.2023.107185
    [19] Gupta C, Chandrashekar P, Jin T, et al. (2022) Bringing machine learning to research on intellectual and developmental disabilities: taking inspiration from neurological diseases. J Neurodev Disord 14: 28. https://doi.org/10.1186/s11689-022-09438-w
    [20] Saleh AY, Chern LH (2021) Autism spectrum disorder classification using deep learning. IJOE 17: 103-114. https://doi.org/10.3991/ijoe.v17i08.24603
    [21] Koppe G, Meyer-Lindenberg A, Durstewitz D (2021) Deep learning for small and big data in psychiatry. Neuropsychopharmacology 46: 176-190. https://doi.org/10.1038/s41386-020-0767-z
    [22] Gütter J, Kruspe A, Zhu XX, et al. (2022) Impact of training set size on the ability of deep neural networks to deal with omission noise. Front Remote Sens 3: 932431. https://doi.org/10.3389/frsen.2022.932431
    [23] Sturman O, von Ziegler L, Schläppi C, et al. (2020) Deep learning-based behavioral analysis reaches human accuracy and is capable of outperforming commercial solutions. Neuropsychopharmacology 45: 1942-1952. https://doi.org/10.1038/s41386-020-0776-y
    [24] He T, Kong R, Holmes AJ, et al. (2020) Deep neural networks and kernel regression achieve comparable accuracies for functional connectivity prediction of behavior and demographics. NeuroImage 206: 116276. https://doi.org/10.1016/j.neuroimage.2019.116276
    [25] Chen M, Li H, Wang J, et al. (2019) A multichannel deep neural network model analyzing multiscale functional brain connectome data for attention deficit hyperactivity disorder detection. Radiol Artif Intell 2: e190012. https://doi.org/10.1148/ryai.2019190012
    [26] Golshan HM, Hebb AO, Mahoor MH (2020) LFP-Net: a deep learning framework to recognize human behavioral activities using brain STN-LFP signals. J Neurosci Meth 335: 108621. https://doi.org/10.1016/j.jneumeth.2020.108621
    [27] Sutoko S, Masuda A, Kandori A, et al. (2021) Early identification of Alzheimer's disease in mouse models: Application of deep neural network algorithm to cognitive behavioral parameters. Iscience 24: 102198. https://doi.org/10.1016/j.isci.2021.102198
    [28] Tarigopula P, Fairhall SL, Bavaresco A, et al. (2023) Improved prediction of behavioral and neural similarity spaces using pruned DNNs. Neural Networks 168: 89-104. https://doi.org/10.1016/j.neunet.2023.08.049
    [29] Uyulan C, Ergüzel TT, Unubol H, et al. (2021) Major depressive disorder classification based on different convolutional neural network models: deep learning approach. Clin EEG Neurosci 52: 38-51. https://doi.org/10.1177/1550059420916634
    [30] Wen J, Thibeau-Sutre E, Diaz-Melo M, et al. (2020) Convolutional neural networks for classification of Alzheimer's disease: overview and reproducible evaluation. Med Image Anal 63: 101694. https://doi.org/10.1016/j.media.2020.101694
    [31] Karthik R, Menaka R, Johnson A, et al. (2020) Neuroimaging and deep learning for brain stroke detection-A review of recent advancements and future prospects. Comput Meth Prog Bio 197: 105728. https://doi.org/10.1016/j.cmpb.2020.105728
    [32] Iqbal MS, Heyat MBB, Parveen S, et al. (2024) Progress and trends in neurological disorders research based on deep learning. Comput Med Imag Grap 116: 102400. https://doi.org/10.1016/j.compmedimag.2024.102400
    [33] Kim S, Pathak S, Parise R, et al. (2024) The thriving influence of artificial intelligence in neuroscience. Application of Artificial Intelligence in Neurological Disorders. Singapore: Springer Nature Singapore 157-184. https://doi.org/10.1007/978-981-97-2577-9_9
    [34] Lima AA, Mridha MF, Das SC, et al. (2022) A comprehensive survey on the detection, classification, and challenges of neurological disorders. Biology 11: 469. https://doi.org/10.3390/biology11030469
    [35] Mulpuri RP, Konda N, Gadde ST, et al. (2024) Artificial intelligence and machine learning in neuroregeneration: a systematic review. Cureus 16: e61400. https://doi.org/10.7759/cureus.61400
    [36] Keserwani PK, Das S, Sarkar N (2024) A comparative study: prediction of parkinson's disease using machine learning, deep learning and nature inspired algorithm. Multimed Tools Appl 83: 69393-69441. https://doi.org/10.1007/s11042-024-18186-z
    [37] Fatima A, Masood S (2024) Machine learning approaches for neurological disease prediction: a systematic review. Expert Syst 41: e13569. https://doi.org/10.1111/exsy.13569
    [38] Surianarayanan C, Lawrence JJ, Chelliah PR, et al. (2023) Convergence of artificial intelligence and neuroscience towards the diagnosis of neurological disorders—a scoping review. Sensors 23: 3062. https://doi.org/10.3390/s23063062
    [39] Lombardi A, Diacono D, Amoroso N, et al. (2021) Explainable deep learning for personalized age prediction with brain morphology. Front Neurosci 15: 674055. https://doi.org/10.3389/fnins.2021.674055
    [40] Choo YJ, Chang MC (2022) Use of machine learning in stroke rehabilitation: a narrative review. Brain Neurorehab 15: e26. https://doi.org/10.12786/bn.2022.15.e26
    [41] Ryait H, Bermudez-Contreras E, Harvey M, et al. (2019) Data-driven analyses of motor impairments in animal models of neurological disorders. PLoS Biol 17: e3000516. https://doi.org/10.1371/journal.pbio.3000516
    [42] Nguyen HS, Ho DKN, Nguyen NN, et al. (2024) Predicting EGFR mutation status in non-small cell lung cancer using artificial intelligence: a systematic review and meta-analysis. Acad Radiol 31: 660-683. https://doi.org/10.1016/j.acra.2023.03.040
    [43] Zhang Y, Yao Q, Yue L, et al. (2023) Emerging drug interaction prediction enabled by a flow-based graph neural network with biomedical network. Nat Comput Sci 3: 1023-1033. https://doi.org/10.1038/s43588-023-00558-4
    [44] Le NQK (2023) Predicting emerging drug interactions using GNNs. Nat Comput Sci 3: 1007-1008. https://doi.org/10.1038/s43588-023-00555-7
    [45] Abed Mohammed A, Sumari P (2024) Hybrid k-means and principal component analysis (PCA) for diabetes prediction. Int J Comput Dig Syst 15: 1719-1728. https://doi.org/10.12785/ijcds/1501121
    [46] Mostafa F, Hasan E, Williamson M, et al. (2021) Statistical machine learning approaches to liver disease prediction. Livers 1: 294-312. https://doi.org/10.3390/livers1040023
    [47] Jackins V, Vimal S, Kaliappan M, et al. (2021) AI-based smart prediction of clinical disease using random forest classifier and Naive Bayes. J Supercomput 77: 5198-5219. https://doi.org/10.1007/s11227-020-03481-x
    [48] Cho G, Yim J, Choi Y, et al. (2019) Review of machine learning algorithms for diagnosing mental illness. Psychiat Invest 16: 262. https://doi.org/10.30773/pi.2018.12.21.2
    [49] Aljrees T (2024) Improving prediction of cervical cancer using KNN imputer and multi-model ensemble learning. Plos One 19: e0295632. https://doi.org/10.1371/journal.pone.0295632
    [50] Hajare S, Rewatkar R, Reddy KTV (2024) Design of an iterative method for enhanced early prediction of acute coronary syndrome using XAI analysis. AIMS Bioeng 11: 301-322. https://doi.org/10.3934/bioeng.2024016
    [51] Schjetnan AGP, Luczak A (2011) Recording large-scale neuronal ensembles with silicon probes in the anesthetized rat. J Vis Exp 56: e3282. https://doi.org/10.3791/3282-v
    [52] Luczak A, Narayanan NS (2005) Spectral representation-analyzing single-unit activity in extracellularly recorded neuronal data without spike sorting. J Neurosci Meth 144: 53-61. https://doi.org/10.1016/j.jneumeth.2004.10.009
    [53] Luczak A, Hackett TA, Kajikawa Y, et al. (2004) Multivariate receptive field mapping in marmoset auditory cortex. J Neurosci Meth 136: 77-85. https://doi.org/10.1016/j.jneumeth.2003.12.019
    [54] Luczak A (2010) Measuring neuronal branching patterns using model-based approach. Front Comput Neurosci 4: 135. https://doi.org/10.3389/fncom.2010.00135
    [55] Luczak A, Kubo Y (2022) Predictive neuronal adaptation as a basis for consciousness. Front Syst Neurosci 15: 767461. https://doi.org/10.3389/fnsys.2021.767461
    [56] Lepakshi VA (2022) Machine learning and deep learning based AI tools for development of diagnostic tools. Computational Approaches for Novel Therapeutic and Diagnostic Designing to Mitigate SARS-CoV-2 Infection.Academic Press 399-420. https://doi.org/10.1016/B978-0-323-91172-6.00011-X
    [57] Montavon G, Binder A, Lapuschkin S, et al. (2019) Layer-wise relevance propagation: an overview. Explainable AI: Interpreting, Explaining and Visualizing Deep Learning. Cham: Springer 193-209. https://doi.org/10.1007/978-3-030-28954-6_10
    [58] Nazir S, Dickson DM, Akram MU (2023) Survey of explainable artificial intelligence techniques for biomedical imaging with deep neural networks. Comput Biol Med 156: 106668. https://doi.org/10.1016/j.compbiomed.2023.106668
    [59] Torabi R, Jenkins S, Harker A, et al. (2021) A neural network reveals motoric effects of maternal preconception exposure to nicotine on rat pup behavior: a new approach for movement disorders diagnosis. Front Neurosci 15: 686767. https://doi.org/10.3389/fnins.2021.686767
    [60] Shahtalebi S, Atashzar SF, Patel RV, et al. (2021) A deep explainable artificial intelligent framework for neurological disorders discrimination. Sci Rep 11: 9630. https://doi.org/10.1038/s41598-021-88919-9
    [61] Morabito FC, Ieracitano C, Mammone N (2023) An explainable artificial intelligence approach to study MCI to AD conversion via HD-EEG processing. Clin EEG Neurosci 54: 51-60. https://doi.org/10.1177/15500594211063662
    [62] Goodwin NL, Nilsson SRO, Choong JJ, et al. (2022) Toward the explainability, transparency, and universality of machine learning for behavioral classification in neuroscience. Curr Opin Neurobiol 73: 102544. https://doi.org/10.1016/j.conb.2022.102544
    [63] Lindsay GW (2024) Grounding neuroscience in behavioral changes using artificial neural networks. Curr Opin Neurobiol 84: 102816. https://doi.org/10.1016/j.conb.2023.102816
    [64] Dan T, Kim M, Kim WH, et al. (2023) Developing explainable deep model for discovering novel control mechanism of neuro-dynamics. IEEE T Med Imaging 43: 427-438. https://doi.org/10.1109/TMI.2023.3309821
    [65] Fellous JM, Sapiro G, Rossi A, et al. (2019) Explainable artificial intelligence for neuroscience: behavioral neurostimulation. Front Neurosci 13: 1346. https://doi.org/10.3389/fnins.2019.01346
    [66] Bartle AS, Jiang Z, Jiang R, et al. (2022) A critical appraisal on deep neural networks: bridge the gap between deep learning and neuroscience via XAI. HANDBOOK ON COMPUTER LEARNING AND INTELLIGENCE: Volume 2: Deep Learning, Intelligent Control and Evolutionary Computation 2022: 619-634. https://doi.org/10.1142/9789811247323_0015
    [67] Lemon RN (1997) Mechanisms of cortical control of hand function. Neuroscientist 3: 389-398. https://doi.org/10.1177/107385849700300612
    [68] Alaverdashvili M, Whishaw IQ (2013) A behavioral method for identifying recovery and compensation: hand use in a preclinical stroke model using the single pellet reaching task. Neurosci Biobehav R 37: 950-967. https://doi.org/10.1016/j.neubiorev.2013.03.026
    [69] Metz GAS, Whishaw IQ (2000) Skilled reaching an action pattern: stability in rat (Rattus norvegicus) grasping movements as a function of changing food pellet size. Behav Brain Res 116: 111-122. https://doi.org/10.1016/S0166-4328(00)00245-X
    [70] Faraji J, Gomez-Palacio-Schjetnan A, Luczak A, et al. (2013) Beyond the silence: bilateral somatosensory stimulation enhances skilled movement quality and neural density in intact behaving rats. Behav Brain Res 253: 78-89. https://doi.org/10.1016/j.bbr.2013.07.022
    [71] Sheu Y (2020) Illuminating the black box: interpreting deep neural network models for psychiatric research. Front Psychiatry 11: 551299. https://doi.org/10.3389/fpsyt.2020.551299
    [72] Fan FL, Xiong J, Li M, et al. (2021) On interpretability of artificial neural networks: a survey. IEEE T Radiat Plasma 5: 741-760. https://doi.org/10.1109/TRPMS.2021.3066428
    [73] Smucny J, Shi G, Davidson I (2022) Deep learning in neuroimaging: overcoming challenges with emerging approaches. Front Psychiatry 13: 912600. https://doi.org/10.3389/fpsyt.2022.912600
    [74] Kohlbrenner M, Bauer A, Nakajima S, et al. (2020) Towards best practice in explaining neural network decisions with LRP. 2020 International Joint Conference on Neural Networks (IJCNN).IEEE 1-7. https://doi.org/10.1109/IJCNN48605.2020.9206975
    [75] Farahani FV, Fiok K, Lahijanian B, et al. (2022) Explainable AI: a review of applications to neuroimaging data. Front Neurosci 16: 906290. https://doi.org/10.3389/fnins.2022.906290
    [76] Böhle M, Eitel F, Weygandt M, et al. (2019) Layer-wise relevance propagation for explaining deep neural network decisions in MRI-based Alzheimer's disease classification. Front Aging Neurosci 11: 456892. https://doi.org/10.3389/fnagi.2019.00194
    [77] Marques dos Santos JD, Marques dos Santos JP (2023) Path-weights and layer-wise relevance propagation for explainability of ANNs with fMRI data. International Conference on Machine Learning, Optimization, and Data Science. Cham: Springer Nature Switzerland 433-448. https://doi.org/10.1007/978-3-031-53966-4_32
    [78] Filtjens B, Ginis P, Nieuwboer A, et al. (2021) Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation. BMC Med Inform Decis Mak 21: 341. https://doi.org/10.1186/s12911-021-01699-0
    [79] Li H, Tian Y, Mueller K, et al. (2019) Beyond saliency: understanding convolutional neural networks from saliency prediction on layer-wise relevance propagation. Image Vision Comput 83: 70-86. https://doi.org/10.1016/j.imavis.2019.02.005
    [80] Nam H, Kim JM, Choi W, et al. (2023) The effects of layer-wise relevance propagation-based feature selection for EEG classification: a comparative study on multiple datasets. Front Hum Neurosci 17: 1205881. https://doi.org/10.3389/fnhum.2023.1205881
    [81] Korda AI, Ruef A, Neufang S, et al. (2021) Identification of voxel-based texture abnormalities as new biomarkers for schizophrenia and major depressive patients using layer-wise relevance propagation on deep learning decisions. Psychiat Res-Neuroim 313: 111303. https://doi.org/10.1016/j.pscychresns.2021.111303
    [82] von Ziegler L, Sturman O, Bohacek J (2021) Big behavior: challenges and opportunities in a new era of deep behavior profiling. Neuropsychopharmacology 46: 33-44. https://doi.org/10.1038/s41386-020-0751-7
    [83] Marks M, Jin Q, Sturman O, et al. (2022) Deep-learning-based identification, tracking, pose estimation and behaviour classification of interacting primates and mice in complex environments. Nat Mach Intell 4: 331-340. https://doi.org/10.1038/s42256-022-00477-5
    [84] Bohnslav JP, Wimalasena NK, Clausing KJ, et al. (2021) DeepEthogram, a machine learning pipeline for supervised behavior classification from raw pixels. Elife 10: e63377. https://doi.org/10.7554/eLife.63377
    [85] Wang PY, Sapra S, George VK, et al. (2021) Generalizable machine learning in neuroscience using graph neural networks. Front Artif Intell 4: 618372. https://doi.org/10.3389/frai.2021.618372
    [86] Watson DS, Krutzinna J, Bruce IN, et al. (2019) Clinical applications of machine learning algorithms: beyond the black box. Bmj 364: l886. https://doi.org/10.2139/ssrn.3352454
    [87] Jain A, Salas M, Aimer O, et al. (2024) Safeguarding patients in the AI era: ethics at the forefront of pharmacovigilance. Drug Safety 48: 119-127. https://doi.org/10.1007/s40264-024-01483-9
    [88] Murdoch B (2021) Privacy and artificial intelligence: challenges for protecting health information in a new era. BMC Med Ethics 22: 1-5. https://doi.org/10.1186/s12910-021-00687-3
    [89] Ziesche S (2021) AI ethics and value alignment for nonhuman animals. Philosophies 6: 31. https://doi.org/10.3390/philosophies6020031
    [90] Bossert L, Hagendorff T (2021) Animals and AI. the role of animals in AI research and application-an overview and ethical evaluation. Technol Soc 67: 101678. https://doi.org/10.1016/j.techsoc.2021.101678
    [91] Gong Y, Liu G, Xue Y, et al. (2023) A survey on dataset quality in machine learning. Inform Software Tech 162: 107268. https://doi.org/10.1016/j.infsof.2023.107268
    [92] Bolaños LA, Xiao D, Ford NL, et al. (2021) A three-dimensional virtual mouse generates synthetic training data for behavioral analysis. Nat Methods 18: 378-381. https://doi.org/10.1038/s41592-021-01103-9
    [93] Lashgari E, Liang D, Maoz U (2020) Data augmentation for deep-learning-based electroencephalography. J Neurosci Methods 346: 108885. https://doi.org/10.1016/j.jneumeth.2020.108885
    [94] Barile B, Marzullo A, Stamile C, et al. (2021) Data augmentation using generative adversarial neural networks on brain structural connectivity in multiple sclerosis. Comput Meth Prog Bio 206: 106113. https://doi.org/10.1016/j.cmpb.2021.106113
    [95] Memar S, Jiang E, Prado VF, et al. (2023) Open science and data sharing in cognitive neuroscience with MouseBytes and MouseBytes+. Sci Data 10: 210. https://doi.org/10.1038/s41597-023-02106-1
    [96] Jleilaty S, Ammounah A, Abdulmalek G, et al. (2024) Distributed real-time control architecture for electrohydraulic humanoid robots. Robot Intell Automat 44: 607-620. https://doi.org/10.1108/RIA-01-2024-0013
    [97] Zhao J, Wang Z, Lv Y, et al. (2024) Data-driven learning for H∞ control of adaptive cruise control systems. IEEE Trans Veh Technol 73: 18348-18362. https://doi.org/10.1109/TVT.2024.3447060
    [98] Kelly CJ, Karthikesalingam A, Suleyman M, et al. (2019) Key challenges for delivering clinical impact with artificial intelligence. BMC Med 17: 1-9. https://doi.org/10.1186/s12916-019-1426-2
    [99] Kulkarni PA, Singh H (2023) Artificial intelligence in clinical diagnosis: opportunities, challenges, and hype. Jama 330: 317-318. https://doi.org/10.1001/jama.2023.11440
    [100] Choudhury A, Asan O (2020) Role of artificial intelligence in patient safety outcomes: systematic literature review. JMIR Med inf 8: e18599. https://doi.org/10.2196/18599
    [101] Ratwani RM, Sutton K, Galarraga JE (2024) Addressing AI algorithmic bias in health care. Jama 332: 1051-1052. https://doi.org/10.1001/jama.2024.13486
    [102] Chen C, Sundar SS (2024) Communicating and combating algorithmic bias: effects of data diversity, labeler diversity, performance bias, and user feedback on AI trust. Hum-Comput Interact 2024: 1-37. https://doi.org/10.1080/07370024.2024.2392494
    [103] Chen F, Wang L, Hong J, et al. (2024) Unmasking bias in artificial intelligence: a systematic review of bias detection and mitigation strategies in electronic health record-based models. J Am Medl Inform Assn 31: 1172-1183. https://doi.org/10.1093/jamia/ocae060
    [104] Ienca M, Ignatiadis K (2020) Artificial intelligence in clinical neuroscience: methodological and ethical challenges. AJOB Neurosci 11: 77-87. https://doi.org/10.1080/21507740.2020.1740352
    [105] Avberšek LK, Repovš G (2022) Deep learning in neuroimaging data analysis: applications, challenges, and solutions. Front Neuroimag 1: 981642. https://doi.org/10.3389/fnimg.2022.981642
  • This article has been cited by:

    1. Feiming Huang, Minfei Fu, JiaRui Li, Lei Chen, KaiYan Feng, Tao Huang, Yu-Dong Cai, Analysis and prediction of protein stability based on interaction network, gene ontology, and KEGG pathway enrichment scores, 2023, 1871, 15709639, 140889, 10.1016/j.bbapap.2023.140889
    2. Yaochen Xu, FeiMing Huang, Wei Guo, KaiYan Feng, Lin Zhu, Zhenbing Zeng, Tao Huang, Yu-Dong Cai, Characterization of chromatin accessibility patterns in different mouse cell types using machine learning methods at single-cell resolution, 2023, 14, 1664-8021, 10.3389/fgene.2023.1145647
    3. Jingxin Ren, Wei Guo, Kaiyan Feng, Tao Huang, Yudong Cai, Identifying MicroRNA Markers That Predict COVID-19 Severity Using Machine Learning Methods, 2022, 12, 2075-1729, 1964, 10.3390/life12121964
    4. Jing Li, FeiMing Huang, QingLan Ma, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai, Identification of genes related to immune enhancement caused by heterologous ChAdOx1–BNT162b2 vaccines in lymphocytes at single-cell resolution with machine learning methods, 2023, 14, 1664-3224, 10.3389/fimmu.2023.1131051
    5. Jian Lu, Mei Meng, XianChao Zhou, Shijian Ding, KaiYan Feng, Zhenbing Zeng, Tao Huang, Yu-Dong Cai, Identification of COVID-19 severity biomarkers based on feature selection on single-cell RNA-Seq data of CD8+ T cells, 2022, 13, 1664-8021, 10.3389/fgene.2022.1053772
    6. Jingxin Ren, XianChao Zhou, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai, Kwang Gi Kim, Identification of Methylation Signatures and Rules for Sarcoma Subtypes by Machine Learning Methods, 2022, 2022, 2314-6141, 1, 10.1155/2022/5297235
    7. Jing Li, JingXin Ren, HuiPing Liao, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai, Identification of dynamic gene expression profiles during sequential vaccination with ChAdOx1/BNT162b2 using machine learning methods, 2023, 14, 1664-302X, 10.3389/fmicb.2023.1138674
    8. Hao Li, Qinglan Ma, Jingxin Ren, Wei Guo, Kaiyan Feng, Zhandong Li, Tao Huang, Yu-Dong Cai, Immune responses of different COVID-19 vaccination strategies by analyzing single-cell RNA sequencing data from multiple tissues using machine learning methods, 2023, 14, 1664-8021, 10.3389/fgene.2023.1157305
    9. Jingxin Ren, Yuhang Zhang, Wei Guo, Kaiyan Feng, Ye Yuan, Tao Huang, Yu-Dong Cai, Identification of Genes Associated with the Impairment of Olfactory and Gustatory Functions in COVID-19 via Machine-Learning Methods, 2023, 13, 2075-1729, 798, 10.3390/life13030798
    10. Yaochen Xu, Qinglan Ma, Jingxin Ren, Lei Chen, Wei Guo, Kaiyan Feng, Zhenbing Zeng, Tao Huang, Yudong Cai, Using Machine Learning Methods in Identifying Genes Associated with COVID-19 in Cardiomyocytes and Cardiac Vascular Endothelial Cells, 2023, 13, 2075-1729, 1011, 10.3390/life13041011
    11. Liang-Sian Lin, Chen-Huan Kao, Yi-Jie Li, Hao-Hsuan Chen, Hung-Yu Chen, Improved support vector machine classification for imbalanced medical datasets by novel hybrid sampling combining modified mega-trend-diffusion and bagging extreme learning machine model, 2023, 20, 1551-0018, 17672, 10.3934/mbe.2023786
    12. Lei Chen, Kaiyu Chen, Bo Zhou, Inferring drug-disease associations by a deep analysis on drug and disease networks, 2023, 20, 1551-0018, 14136, 10.3934/mbe.2023632
    13. Jingxin Ren, XianChao Zhou, Ke Huang, Lei Chen, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai, Identification of key genes associated with persistent immune changes and secondary immune activation responses induced by influenza vaccination after COVID-19 recovery by machine learning methods, 2024, 169, 00104825, 107883, 10.1016/j.compbiomed.2023.107883
    14. Bo Zhou, Bing Ran, Lei Chen, A GraphSAGE-based model with fingerprints only to predict drug-drug interactions, 2024, 21, 1551-0018, 2922, 10.3934/mbe.2024130
    15. Jing-Xin Ren, Qian Gao, Xiao-Chao Zhou, Lei Chen, Wei Guo, Kai-Yan Feng, Lin Lu, Tao Huang, Yu-Dong Cai, Identification of Gene Markers Associated with COVID-19 Severity and Recovery in Different Immune Cell Subtypes, 2023, 12, 2079-7737, 947, 10.3390/biology12070947
    16. Pijush Dutta, Shobhandeb Paul, Gour Gopal Jana, Arindam Sadhu, 2023, Hybrid Genetic Algorithm Random Forest algorithm (HGARF) for improving the missing value Imputation in Hepatitis Medical Dataset, 979-8-3503-1504-2, 01, 10.1109/ISDCS58735.2023.10153526
    17. Qing-Lan Ma, Fei-Ming Huang, Wei Guo, Kai-Yan Feng, Tao Huang, Yu-Dong Cai, Machine Learning Classification of Time since BNT162b2 COVID-19 Vaccination Based on Array-Measured Antibody Activity, 2023, 13, 2075-1729, 1304, 10.3390/life13061304
    18. JingXin Ren, Qian Gao, XianChao Zhou, Lei Chen, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai, Identification of key gene expression associated with quality of life after recovery from COVID-19, 2024, 62, 0140-0118, 1031, 10.1007/s11517-023-02988-8
    19. Qinglan Ma, FeiMing Huang, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai, Identification of Phase-Separation-Protein-Related Function Based on Gene Ontology by Using Machine Learning Methods, 2023, 13, 2075-1729, 1306, 10.3390/life13061306
    20. Lei Chen, Chenyu Zhang, Jing Xu, PredictEFC: a fast and efficient multi-label classifier for predicting enzyme family classes, 2024, 25, 1471-2105, 10.1186/s12859-024-05665-1
    21. Yong Yang, Yuhang Zhang, Jingxin Ren, Kaiyan Feng, Zhandong Li, Tao Huang, Yudong Cai, Identification of Colon Immune Cell Marker Genes Using Machine Learning Methods, 2023, 13, 2075-1729, 1876, 10.3390/life13091876
    22. Jing Xin Ren, Lei Chen, Wei Guo, Kai Yan Feng, Yu-Dong Cai, Tao Huang, Patterns of Gene Expression Profiles Associated with Colorectal Cancer in Colorectal Mucosa by Using Machine Learning Methods, 2024, 27, 13862073, 2921, 10.2174/0113862073266300231026103844
    23. Xiandong Lin, QingLan Ma, Lei Chen, Wei Guo, Zhiyi Huang, Tao Huang, Yu-Dong Cai, Identifying genes associated with resistance to KRAS G12C inhibitors via machine learning methods, 2023, 1867, 03044165, 130484, 10.1016/j.bbagen.2023.130484
    24. QingLan Ma, Lei Chen, KaiYan Feng, Wei Guo, Tao Huang, Yu-Dong Cai, Exploring Prognostic Gene Factors in Breast Cancer via Machine Learning, 2024, 0006-2928, 10.1007/s10528-024-10712-w
    25. Lei Chen, Xiaoyu Zhao, PCDA-HNMP: Predicting circRNA-disease association using heterogeneous network and meta-path, 2023, 20, 1551-0018, 20553, 10.3934/mbe.2023909
    26. Ruoqi Zhang, Xiaoming Huang, Qiang Zhu, Weakly supervised salient object detection via image category annotation, 2023, 20, 1551-0018, 21359, 10.3934/mbe.2023945
    27. Xiaohong Li, Xianchao Zhou, Shijian Ding, Lei Chen, Kaiyan Feng, Hao Li, Tao Huang, Yu-Dong Cai, Identification of Transcriptome Biomarkers for Severe COVID-19 with Machine Learning Methods, 2022, 12, 2218-273X, 1735, 10.3390/biom12121735
    28. Fei Yuan, JingXin Ren, HuiPing Liao, Wei Guo, Lei Chen, KaiYan Feng, Tao Huang, Yu-Dong Cai, Identification of Whole-Blood DNA Methylation Signatures and Rules Associated with COVID-19 Severity, 2023, 28, 2768-6701, 10.31083/j.fbl2811284
    29. Xiao Zhou, Shuyi Guo, Kunlun Xin, Zhenheng Tang, Xiaowen Chu, Guangtao Fu, Network Embedding: The Bridge Between Water Distribution Network Hydraulics and Machine Learning, 2024, 00431354, 123011, 10.1016/j.watres.2024.123011
    30. Amir Mohamed Talib, Areen Metib Al-Hgaish, Rodziah Binti Atan, Abdulaziz Alshammari, Fahad Omar Alomary, Razali Yaakob, Abdulaziz Alsahli, Mohd Hafeez Osman, Evaluating Critical Success Factors in AI-Driven Drug Discovery Using AHP: A Strategic Framework for Optimization, 2025, 13, 2169-3536, 42045, 10.1109/ACCESS.2025.3546925
  • Reader Comments
  • © 2025 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(479) PDF downloads(20) Cited by(0)

Figures and Tables

Figures(3)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog