
With the development of deep learning, medical image segmentation has become a promising technique for computer-aided medical diagnosis. However, the supervised training of the algorithm relies on a large amount of labeled data, and the private dataset bias generally exists in previous research, which seriously affects the algorithm's performance. In order to alleviate this problem and improve the robustness and generalization of the model, this paper proposes an end-to-end weakly supervised semantic segmentation network to learn and infer mappings. Firstly, an attention compensation mechanism (ACM) aggregating the class activation map (CAM) is designed to learn complementarily. Then the conditional random field (CRF) is introduced to prune the foreground and background regions. Finally, the obtained high-confidence regions are used as pseudo labels for the segmentation branch to train and optimize using a joint loss function. Our model achieves a Mean Intersection over Union (MIoU) score of 62.84% in the segmentation task, which is an effective improvement of 11.18% compared to the previous network for segmenting dental diseases. Moreover, we further verify that our model has higher robustness to dataset bias by improved localization mechanism (CAM). The research shows that our proposed approach improves the accuracy and robustness of dental disease identification.
Citation: Yue Li, Hongmei Jin, Zhanli Li. A weakly supervised learning-based segmentation network for dental diseases[J]. Mathematical Biosciences and Engineering, 2023, 20(2): 2039-2060. doi: 10.3934/mbe.2023094
[1] | Danilo T. Pérez-Rivera, Verónica L. Torres-Torres, Abraham E. Torres-Colón, Mayteé Cruz-Aponte . Development of a computational model of glucose toxicity in the progression of diabetes mellitus. Mathematical Biosciences and Engineering, 2016, 13(5): 1043-1058. doi: 10.3934/mbe.2016029 |
[2] | Kimberly Fessel, Jeffrey B. Gaither, Julie K. Bower, Trudy Gaillard, Kwame Osei, Grzegorz A. Rempała . Mathematical analysis of a model for glucose regulation. Mathematical Biosciences and Engineering, 2016, 13(1): 83-99. doi: 10.3934/mbe.2016.13.83 |
[3] | Lela Dorel . Glucose level regulation via integral high-order sliding modes. Mathematical Biosciences and Engineering, 2011, 8(2): 549-560. doi: 10.3934/mbe.2011.8.549 |
[4] | Huazhong Yang, Wang Li, Maojin Tian, Yangfeng Ren . A personalized multitasking framework for real-time prediction of blood glucose levels in type 1 diabetes patients. Mathematical Biosciences and Engineering, 2024, 21(2): 2515-2541. doi: 10.3934/mbe.2024111 |
[5] | Delong Cui, Hong Huang, Zhiping Peng, Qirui Li, Jieguang He, Jinbo Qiu, Xinlong Luo, Jiangtao Ou, Chengyuan Fan . Next-generation 5G fusion-based intelligent health-monitoring platform for ethylene cracking furnace tube. Mathematical Biosciences and Engineering, 2022, 19(9): 9168-9199. doi: 10.3934/mbe.2022426 |
[6] | Yu-Mei Han, Hui Yang, Qin-Lai Huang, Zi-Jie Sun, Ming-Liang Li, Jing-Bo Zhang, Ke-Jun Deng, Shuo Chen, Hao Lin . Risk prediction of diabetes and pre-diabetes based on physical examination data. Mathematical Biosciences and Engineering, 2022, 19(4): 3597-3608. doi: 10.3934/mbe.2022166 |
[7] | Mengfan Liu, Runkai Jiao, Qing Nian . Training method and system for stress management and mental health care of managers based on deep learning. Mathematical Biosciences and Engineering, 2022, 19(1): 371-393. doi: 10.3934/mbe.2022019 |
[8] | Anarina L. Murillo, Jiaxu Li, Carlos Castillo-Chavez . Modeling the dynamics of glucose, insulin, and free fatty acids with time delay: The impact of bariatric surgery on type 2 diabetes mellitus. Mathematical Biosciences and Engineering, 2019, 16(5): 5765-5787. doi: 10.3934/mbe.2019288 |
[9] | Micaela Morettini, Christian Göbl, Alexandra Kautzky-Willer, Giovanni Pacini, Andrea Tura, Laura Burattini . Former gestational diabetes: Mathematical modeling of intravenous glucose tolerance test for the assessment of insulin clearance and its determinants. Mathematical Biosciences and Engineering, 2020, 17(2): 1604-1615. doi: 10.3934/mbe.2020084 |
[10] | Ying Zhu, Lipeng Guo, Jixin Zou, Liwen Wang, He Dong, Shengbo Yu, Lijun Zhang, Jun Li, Xueling Qu . JQ1 inhibits high glucose-induced migration of retinal microglial cells by regulating the PI3K/AKT signaling pathway. Mathematical Biosciences and Engineering, 2022, 19(12): 13079-13092. doi: 10.3934/mbe.2022611 |
With the development of deep learning, medical image segmentation has become a promising technique for computer-aided medical diagnosis. However, the supervised training of the algorithm relies on a large amount of labeled data, and the private dataset bias generally exists in previous research, which seriously affects the algorithm's performance. In order to alleviate this problem and improve the robustness and generalization of the model, this paper proposes an end-to-end weakly supervised semantic segmentation network to learn and infer mappings. Firstly, an attention compensation mechanism (ACM) aggregating the class activation map (CAM) is designed to learn complementarily. Then the conditional random field (CRF) is introduced to prune the foreground and background regions. Finally, the obtained high-confidence regions are used as pseudo labels for the segmentation branch to train and optimize using a joint loss function. Our model achieves a Mean Intersection over Union (MIoU) score of 62.84% in the segmentation task, which is an effective improvement of 11.18% compared to the previous network for segmenting dental diseases. Moreover, we further verify that our model has higher robustness to dataset bias by improved localization mechanism (CAM). The research shows that our proposed approach improves the accuracy and robustness of dental disease identification.
Type 2 diabetes mellitus is one of the most prevalent, multifactorial chronic diseases with a high rate of disability and mortality, which badly demands lifelong cost-efficient personalised management [1,2]. Blood glucose control is essential to postpone disease progression and alleviate symptoms [3,4].
RT-CGM (Real-time continuous glucose monitor), a portable subcutaneous interstitial glucose detector, enables long-term glucose control for its benefits in less puncture and more holistic pictures of blood glucose [5]. However, this CGM system limits its leverage due to its gap in glucose prediction [6], which can be filled by glucose trajectory prediction, such as deep learning tools. In this way, people can take action beforehand to eliminate imminent events, such as hyperglycaemic crisis and hypoglycaemic coma [7,8].
LSTM-RNN (long short-term memory - recursive neural network), the deep learning model fitting long sequence data such as continuous glucose [9], has the potential competence to provide a more accurate glucose trajectory prediction [10]. Previously, Sadegh Mirshekarian et al. presented an RNN approach with LSTM units to learn a physiological model of blood glucose [11]. And Mario Munoz-Organero also proposed, implemented, validated and compared a new hybrid deep learning model to mimic the metabolic behaviour of physiological blood glucose methods [12]. These studies indicated that deep learning contained the power to predict health-related parameters. Besides, the differential equations for carbohydrate and insulin absorption in physiological models were also modelled using LSTM cells. Rabby et al. proposed a novel approach to predict blood glucose levels with a stacked LSTM based on a deep RNN model considering sensor fault [13]. Based on these studies, we believe that LSTM-RNN would improve glucose prediction in diabetic patients with a stronger prediction power.
Although the precise prediction of glucose among patients suffering from type 2 diabetes is necessary and many researchers have verified the accuracy of deep learning models [14], only limited studies have tested the detailed effects of these potentially burden-free, deep-learning-involved care in the community [15]. Besides, only a few studies paid attention to the ethnoracial disparity, not to mention the inter-personal variation of glucose management with the deep-learning models [16]. Furthermore, scarce studies cared for the mental conditions of patients who utilise these deep learning-based health management models. Hence, we would like to explore whether this personally developed deep-learning model supported glucose management would improve the health of type 2 diabetes, both physically and mentally.
In this study, we would apply a personalized glucose prediction model to test the merits of the deep learning-assisted personalised health management pattern, aimed to support the management strategies for type 2 diabetic patients in the community. Firstly, we represented the development of individual deep learning models for blood glucose trajectory prediction. Then we exhibited a participant receiving personalised diabetes self-management in a real-world scenario who finally showed well-controlled blood glucose data despite increased stress.
A 42-year man (the Han Chinese; living at home) with a history of type 2 diabetes for four years expressed interest in our management strategy in a review at Tianjin Medical University Metabolic Diseases Hospital. His body mass index (BMI) was 26.23 kg/m2. With fasting blood glucose (FBG) of 8.2 mmol/L and glycosylated haemoglobin (HbA1c) of 9.2 × 10-2 mmol/mol (8.4%), he was treated with Metformin 0.5 g three times a day. There was no history of diabetic retinopathy, neuropathy, carcinoma or any other comorbidities except a history of mild depression for four years. He was a non-smoker, and he seldom drank alcohol. His diabetes management aimed to control fasting glucose from 4.4 mmol/L to 7.0 mmol/L. With a clear consciousness and independent ability in daily life, the person agreed to participate and signed the consent for engagement and publication. In addition, the medical jargon was explained in the supplement file.
In this paradigm (Figure 1(a)), the person was armed with RT-CGM (FreeStyle® Libre, Abbott Diabetes Care Ltd.) and instructed to submit relevant data each day (for adjustment, Sec 0, day0–day3). The predicted trajectory by the personalised deep learning model, which was transferred from general glucose models, was not returned (for monitoring, Sec 1, day4–day8) until the ninth day, following the model application (Sec 2, day9–day13) and regular follow-up.
The individual LSTM-RNN model was transferred from naïve LSTM-RNN models (Prem) and developed from personal RT-CGM data. Naïve LSTM-RNN models were created and evaluated with 16 cases of CGM data (17,182 sets of glucose data) from the CGM system, whose interval was about 15 minutes. We chose three top models (Model a-c in Figure 1(a)) from ten replications of naïve models in case of the disparate features between cases in the naïve model development and the person applied the personalised prediction model. While in the following progressing and prediction sections, only the top model was employed to acquire personalised super-parameters, even though we tripled each forecast to assess the stability of the models. In these prediction models, the independent variables were a sequence of glucose obtained from CGM data, and the dependent variables were the glucose trajectory in the ensuing two hours. The core of the prediction models was iterated by Adam gradient optimization [17,18] and assessed by RMSE [19].
Personalised models were transferred (Tran) and fine-tuned (Tune) from naïve models by RT-CGM data in the adjustment section (Sec 0). And the individual LSTM-RNN models were progressed (Prog) during the monitoring section (Sec 1). Then the appendix 2-hour prediction was applied (Appl) on time for the 3rd section by three individual LSTM-RNN models.
Since blood glucose is commonly acknowledged in the short-term management of diabetic patients, some glucose-related parameters were chosen, including average glucose, daily hyperglycaemic time, daily time of high glucose and high blood glucose index [10,20]. Besides, fasting glucose, glycosylated haemoglobin, and BMI were used to contrast the lasting efficacy of this interference. Moreover, we also inspected psychological disturbance by the PAID (the Problem Area in Diabetes) scale, a reliable psychometric examiner for diabetes-related emotional distress [21]. Further description of the parameters used above was in the supplement file.
RMSE (Root Mean Square Error) was calculated to evaluate the model accuracy. The student's T test was applied for continuous data with Gaussian distribution (mean ± SEM), and the chi-square test was used for those non-Gaussian distribution data. As for the individual application results, data were reported directly for scarce cases.
The fine-tuned models had an average RMSE of 0.99 mmol/L for glucose prediction. And the simulation of on-time application was followed with an average RMSE of 0.98 mmol/L. The progressed models had a lower average RMSE (0.75 mmol/L) in on-time simulation.
Compared with before application (Sec 1), the daily average glucose of this participant was decreased with the incorporation of a personalised self-management strategy in daily glucose level (Sec 2) (8.68 mmol/L ± 0.24 mmol/L to 8.02 mmol/L ± 0.11 mmol/L, mean ± SEM, P < 0.05), especially at night monitoring during 18:00 to 6:00 the next day (8.99 mmol/L ± 0.34 mmol/L to 7.65 mmol/L ± 0.13 mmol/L, mean ± SEM, P < 0.01). The measurements are exhibited in Figure 2(a, b). Mean of daily difference tended to be lower (from 0.15 mmol/L ± 0.66 mmol/L to 0.05 mmol/L ± 0.35 mmol/L). Time in the target range (3.9 mmol/L to 10.0 mmol/L) increased from 19.40 hours to 21.70 hours per day, along with less high glucose (Alert, CGM in a range of 10.0 mmol/L to 13.9 mmol/L, 1.70 hours per day; Clinically significant 0.55 hour per day). The hyperglycaemic risk was alleviated for high blood glucose index (HBGI) has dropped from 12.99 × 102 to 9.17 × 102, with a mild decrease in glucose variation (standard deviation, from 1.81 mmol/L to 1.59 mmol/L). The measured results are shown in Figure 2(c–e).
Furthermore, the follow-up questionnaire in 3 months displayed a dropped BMI (24.69 kg/m2), less energy intake, more physical exercise, and better sleep. Besides, the participant had a lower FBG (6.7 mmol/L) and HbA1c 7.9 × 10-2 mol/mol, with a regimen of Trajenta 5.0 mg in the morning and Metformin 1.0 g at night. The measurements are shown in Figure 2(f–h) and Table 1. More diabetes-associated distress was revealed by PAID scale, with a total increase of 6.25%, especially in the diet facet (sense of dietary deprivation), attributing to more than half of the rise. Others were related to social support (less safety) and emotion, like loneliness and depression (Table 2).
Physical status | Baseline | Follow-up |
BMI (kg/m2) | 26.23 | 24.69 |
FBG (mmol/L) | 8.2 | 6.7 |
HbA1c (mol/mol) | 9.2 × 10-2 | 7.9 × 10-2 |
PAID (%) | Baseline | Follow-up |
Emotional | 26.25 | 27.50 |
Therapeutic | 12.50 | 12.50 |
Dietary | 18.75 | 22.50 |
Social supporting | 15.00 | 16.25 |
total | 72.50 | 78.75 |
Note: Diabetes-associated distress data assessed by PAID scale (Problem Areas In Diabetes) were compared between baseline and follow-up in three months [21]. |
Efficient and economical life-long glucose control is vital for people with diabetes in the community. This case depicts the construction of personalised deep learning models for on-time application. It suggests that deep learning customised glucose prediction may be a potential remedy for the CGM system in glucose forecast for life-long health management. And the application of the predicted system also requires the proper support from health care providers.
Personalised deep learning models are a potential supplement to CGM with kind of accuracy (Figure 2(b, c)). The individual may avoid abnormal hyperglycaemia glucose actively by more physical activity and less food intake, learning how to control his glucose appropriately. Moreover, the significant decrease of glucose (Figure 2(a–e)) and the mild change of glucose variation crossing two segments, along with the average daily RMSE (1.59 mmol/L) in the application segment, also support the applicability of deep learning in glucose prediction as insertion of the CGM system. However, the specific role of deep learning customised diabetes control still needs more cases engaged.
Deep learning customised glucose prediction can be more accurate with more data and advanced artificial intelligence techniques. Customised models built from 16 instances of CGM data and personal historical CGM data had a limited performance in this case. Considering the data reliance on prediction models, we believe that with more training data, multiple information involved in the era of big data and cutting-edge artificial intelligence techniques [23,24,25], the accuracy of deep prediction models will be higher, accomplishing individual healthcare step-by-step [26].
Deep learning customised glucose prediction with a CGM system may be a feasible personalised self-management strategy in the community. The participant has experienced a promising change in glucose after two sessions, which was sustained and with a lower BMI in 3 months, together with more physical activity and less food intake (Figure 2(f–h)). Besides, due to the limited accessibility of caregivers, the deep learning prediction provides optional guidance on how to act before the occurrence of abnormal glucose and to alleviate burdens in labour and economics.
Despite the promising results of this strategy mentioned above, attention must be paid to aspects such as stress. The participant suffered more stress both during the monitoring fortnight and 3-month follow-up. Researches indicate that the intervention efforts of diabetes stress were helpful in the comprehensive diabetes care, contributing to behaviour changes [27], elevated diabetes-associated stress might barricade the responsibility to beneficial intervention [28], and may increase depression burden [29]. The prolonged impact on glucose and behaviours, together with the condition of his mental health and stress, requires support from peers and healthcare providers [30,31]. Moreover, this strategy also demands a regular follow-up. Increasing the size of the participant samples, will help reinforce the certainty of the results., the one tailored for the personalised management of diabetes. Therefore, we would incorporate more patients to further research. We believe that more participants involved in more strictly designed clinical trials integrated with deep learning prediction would improve the outcome of patients with diabetes and march on the development of health management.
In summary, deep learning customised glucose prediction may be accessible to personalised health care in the long-term management of type 2 diabetes, for example, by aiding in the CGM system. This would be beneficial for people suffering from this chronic disease, since a promising outcome (i.e., a decrease of glucose into a safer range, as shown above) might also occur on them using this care pattern, though more cases should be involved to test the validity of this caring pattern and more clinical settings should also be tested. Furthermore, the diabetes stress should be emphasized too, which seems to require a periodical care from the healthcare providers and the family members of patients.
We thank the support from the China Postdoctoral International Exchange Program Academic Exchange Project, Science and Technology Program of Tianjin (18ZXZNSY00280) and the Tianjin Medical University college student Innovation training program.
The participant has no conflicts of interest to disclose. Written informed consent was obtained from the participant for collecting his real-world data and publication. The funders did not participate in the designing, data gathering and analysing, publicising, or preparing of the manuscript. Abbott Diabetes Care supported the CGM data, discounted continuous glucose monitoring system (device and sensor), and equipment guidance and real-time communication.
Except for general clinical glucose metrics like fasting glucose and glycosylated haemoglobin, body mass index was also applied to contrast the lasting efficacy of the intelligence-assisted health management.
Furthermore, continuous glucose parameters were included to evaluate the short-dated efficacy of glucose management, such as average glucose, daily hyperglycaemic time, daily time of high glucose and high blood glucose index.
Average glucose: the arithmetic average of glucose.
SD, standard deviation, evaluating the glucose variation.
Daily time of high glucose: the total time of glucose above range each day.
daily hyperglycaemic time: glucose multiplied by the time of glucose above range each day. The glucose above range were sectioned by clinical risk, alert (CGM data > 10.0 mmol/l, CGM data < 13.9 mmol/l) and clinically significant (CGM data ≥ 13.9 mmol/l).
High blood glucose index: a metric reflecting the risk of hyperglycaemia, calculated by functions below.
HBGI=∑22.77⋅f(xi)2n, f(xi)=ln(xi)1.084–5.381,iff(xi)≥0 for glucose readings x1, …, xn measured in mg/dl.
MODD, the mean of daily differences, evaluating intraday variability from all 24h intervals2.
Deep learning has leapt into the public view since the triumph of AlphaGo and got unceasing victories from diabetic retinopathy identification, medical events or outcomes prediction, and health care opmisation. The recurrent neural network (RNN), designed to sequence data, is powerful for long sequences after the incorporation of Long Short-Term Memory (LSTM), the one proposed to solve the "long-term dependencies" problem. We then described the LSTM-RNN models to predict glucose trajectories for diabetes management.
Continuous glucose data by CGM (continuous glucose monitor) system (FreeStyle® Libre, Abbott Diabetes Care Ltd.) from December 2014 to September 2017 were collected from16 cases (up to 15 days per case). These data were arranged as daily glucose from 0 to 24 O'clock, at an interval of about 15 mins. Missing values were imputed by the likelihood StructTS method on the R 3.4 platform.
Real-time monitoring data of this participant were obtained from CGM system (FreeStyle® Libre, Abbott Diabetes Care Ltd.) in September 2017. Fortnight average glucose from CGM was 8.9 mmol/L (160 mg/dL), with an estimated haemoglobin of 7.2% (55 mmol/mol). The distribution of glucose was 0 in Very Low, 0 in Low Alert, 77.0% in Target Range, 18.2% High Alert, and 4.8% in Very High4. About 23% of the glucose points were above the normal range. The utilisation rate of the CGM system was 99%, with an average of 53 times scans a day. No insulin was used during the two-weeks monitoring, which was assured by the assignment.
Model architecture referred to previous works. Input and output layers were 1-dimensional glucose data, and prediction of appendix 2-hour glucose by recurrence was iterated by Adam gradient optimization. The formula of naïve LSTM-RNN models was composed here where input gate, forget gate, and output gate were sigmoid function, while those of input and output block were hyperbolic tangent functions. The CGM data set was divided into two groups for the construction of naïve models (Prem), in which 14,878 (13 cases) in 17,182 data were put in the training set in python 3.6.
Block input: zt=tanh(Wz⋅[Rt−1,xt]+bz)
Input gate: it=σ(Wi⋅[Rt−1,xt]+bi)
Forget gate: ft=σ(Wf⋅[Rt−1,xt]+bf)
Cell: ct=zt⋅it+ct−1⋅ft
Output gate: ot=σ(Wo⋅[Rt−1,xt]+bo)
Block output: yt=tanh(ct)⋅ot
Rt=yt
Transferring model (Tran): Ⅰ, load three best models; train three days, test 1 day, chose superparameters with the best RMSE.
Fine-tuned model (Tune): Ⅱ, super-parameters fine-tuned based on those output from Ⅰ, and each selected 3 top models with ten replications; train three days, test 1 day, chose superparameters with the best RMSE.
Progressing model (Prog): Ⅲ, super-parameters fine-tuned based on the top model from Ⅱ, though each prediction was tested in triplicate to assess the accuracy. Each one was re-built for three times; train seven days, test 1day (all past personalised data).
On-time application (Appl): Ⅴ, super-parameters fine-tuned based on the top models from Ⅲ, though each prediction was tested in triplicate to assess the accuracy; train N-1 days (all past personalised data), test for 2-hour trajectory.
[1] |
R. K. Meleppat, M. V. Matham, L. K. Seah, An efficient phase analysis-based wavenumber linearization scheme for swept source optical coherence tomography systems, Laser Phys. Lett., 12 (2015), 055601. https://dx.doi.org/10.1088/1612-2011/12/5/055601 doi: 10.1088/1612-2011/12/5/055601
![]() |
[2] |
K. M. Ratheesh, L. K. Seah, V. M. Murukeshan, Spectral phase-based automatic calibration scheme for swept source-based optical coherence tomography systems, Phys. Med. Biol., 61 (2016), 7652. https://dx.doi.org/10.1088/0031-9155/61/21/7652 doi: 10.1088/0031-9155/61/21/7652
![]() |
[3] |
R. K. Meleppat, C. Shearwood, L. K. Seah, M. V. Matham, Quantitative optical coherence microscopy for the in situ investigation of the biofilm, J. Biomed. Opt., 21 (2016), 127002. https://doi.org/10.1117/1.JBO.21.12.127002 doi: 10.1117/1.JBO.21.12.127002
![]() |
[4] |
R. K. Meleppat, P. Prabhathan, S. L. Keey, M. V. Matham, Plasmon resonant silica-coated silver nanoplates as contrast agents for optical coherence tomography, J. Biomed. Nanotechnol., 12 (2016), 1929–1937. https://doi.org/10.1166/jbn.2016.2297 doi: 10.1166/jbn.2016.2297
![]() |
[5] |
W. J. Park, J. B. Park, History and application of artificial neural networks in dentistry, Eur. J. Dent., 12 (2018), 594–601. https://doi.org/10.4103/ejd.ejd_325_18 doi: 10.4103/ejd.ejd_325_18
![]() |
[6] | A. Rana, G. Yauney, L. C. Wong, O. Gupta, A. Muftu, P. Shah, Automated segmentation of gingival diseases from oral images, in 2017 IEEE Healthcare Innovations and Point of Care Technologies (HI-POCT), (2017), 144–147. https://doi.org/10.1109/HIC.2017.8227605 |
[7] |
X. Xu, C. Liu, Y. Zheng, 3D tooth segmentation and labeling using deep convolutional neural networks, IEEE Trans. Vis. Comput. Graph., 25 (2019), 2336–2348. https://doi.org/10.1109/TVCG.2018.2839685 doi: 10.1109/TVCG.2018.2839685
![]() |
[8] | S. Sivagami, P. Chitra, G. S. R. Kailash, S. R. Muralidharan, UNet architecture based dental panoramic image segmentation, in 2020 International Conference on Wireless Communications Signal Processing and Networking (WiSPNET), (2020), 187–191. https://doi.org/10.1109/WiSPNET48689.2020.9198370 |
[9] | S. Li, Z. Pang, W. Song, Y. Guo, W. You, A. Hao, et al., Low-shot learning of automatic dental plaque segmentation based on local-to-global feature fusion, in 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI), (2020), 664–668. https://doi.org/10.1109/ISBI45749.2020.9098741 |
[10] | M. C. Kaya, G. B. Akar, Dental x-ray image segmentation using octave convolution neural network, in 2020 28th Signal Processing and Communications Applications Conference (SIU), (2020), 1–4. https://doi.org/10.1109/SIU49456.2020.9302495 |
[11] | W. Van Gansbeke, S. Vandenhende, S. Georgoulis, L. Van Gool, Unsupervised semantic segmentation by contrasting object mask proposals, in 2021 IEEE/CVF International Conference on Computer Vision (ICCV), (2021), 10052–10062. https://doi.org/10.1109/ICCV48922.2021.00990 |
[12] | Z. Chen, T. Wang, X. Wu, X. S. Hua, H. Zhang, Q. Sun, Class re-activation maps for weakly-supervised semantic segmentation, in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2022), 959–968. https://doi.org/10.1109/CVPR52688.2022.00104 |
[13] | J. Ahn, S. Kwak, Learning pixel-level semantic affinity with image-level supervision for weakly supervised semantic segmentation, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2018), 4981–4990. https://doi.org/10.1109/CVPR.2018.00523 |
[14] | J. Long, E. Shelhamer, T. Darrell, Fully convolutional networks for semantic segmentation, in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2015), 3431–3440. https://doi.org/10.1109/CVPR.2015.7298965 |
[15] | J. Kim, J. Canny, Interpretable learning for self-driving cars by visualizing causal attention, in 2017 IEEE International Conference on Computer Vision (ICCV), (2017), 2961–2969. https://doi.org/10.1109/ICCV.2017.320 |
[16] | Z. Zhang, Y. Xie, F. Xing, M. McGough, L. Yang, MDNet: A semantically and visually interpretable medical image diagnosis network, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2017), 3549–3557. https://doi.org/10.1109/CVPR.2017.378 |
[17] | C. Song, Y. Huang, W. Ouyang, L. Wang, Box-driven class-wise region masking and filling rate guided loss for weakly supervised semantic segmentation, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2019), 3131–3140. https://doi.org/10.1109/CVPR.2019.00325 |
[18] | M. Tang, F. Perazzi, A. Djelouah, I. Ben Ayed, C. Schroers, Y. Boykov, On regularized losses for weakly-supervised CNN segmentation, in European Conference on Computer Vision, (2018), 524–540. https://doi.org/10.1007/978-3-030-01270-0_31 |
[19] | K. K. Maninis, S. Caelles, J. Pont-Tuset, L. Van Gool, Deep extreme cut: From extreme points to object segmentation, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2018), 616–625. https://doi.org/10.1109/CVPR.2018.00071 |
[20] | X. Zhang, Y. Wei, J. Feng, Y. Yang, T. S. Huang, Adversarial complementary learning for weakly supervised object localization, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2018), 1325–1334. https://doi.org/10.1109/CVPR.2018.00144 |
[21] | Y. Wei, J. Feng, X. Liang, M. M. Cheng, Y. Zhao, S. Yan, Object region mining with adversarial erasing: A simple classification to semantic segmentation approach, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2017), 6488–6496. https://doi.org/10.1109/CVPR.2017.687 |
[22] | X. Wang, S. You, X. Li, H. Ma, Weakly-supervised semantic segmentation by iteratively mining common object features, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2018), 1354–1362. https://doi.org/10.1109/CVPR.2018.00147 |
[23] | Z. Huang, X. Wang, J. Wang, W. Liu, J. Wang, Weakly-supervised semantic segmentation network with deep seeded region growing, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2018), 7014–7023. https://doi.org/10.1109/CVPR.2018.00733 |
[24] | B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, A. Torralba, Learning deep features for discriminative localization, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2016), 2921–2929. https://doi.org/10.1109/CVPR.2016.319 |
[25] | R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, D. Batra, Grad-cam: Visual explanations from deep networks via gradient-based localization, in 2017 IEEE International Conference on Computer Vision (ICCV), (2017), 618–626. https://doi.org/10.1109/ICCV.2017.74 |
[26] | K. Baek, M. Lee, H. Shim, Psynet: Self-supervised approach to object localization using point symmetric transformation, in Proceedings of the AAAI Conference on Artificial Intelligence, 34 (2020), 10451–10459. https://doi.org/10.1609/aaai.v34i07.6615 |
[27] | F. Wan, P. Wei, J. Jiao, Z. Han, Q. Ye, Min-entropy latent model for weakly supervised object detection, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2018), 1297–1306. https://doi.org/10.1109/CVPR.2018.00141 |
[28] | Y. Wang, J. Zhang, M. Kan, S. Shan, X. Chen, Self-supervised equivariant attention mechanism for weakly supervised semantic segmentation, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2020), 12272–12281. https://doi.org/10.1109/CVPR42600.2020.01229 |
[29] | P. Krähenbühl, V. Koltun, Parameter learning and convergent inference for dense random fields, in Proceedings of the 30th International Conference on International Conference on Machine Learning, (2013), 513–521. |
[30] | K. Simonyan, A. Vedaldi, A. Zisserman, Deep in-side convolutional networks: Visualising image classification models and saliency maps, preprint, arXiv: 1312.6034 |
[31] |
T. Joy, A. Desmaison, T. Ajanthan, R. Bunel, M. Salzmann, P. Kohli, et al., Efficient relaxations for dense crfs with sparse higher-order potentials, SIAM J. Imaging Sci., 12 (2019), 287–318. https://doi.org/10.1137/18M1178104 doi: 10.1137/18M1178104
![]() |
[32] | E. D. Cubuk, B. Zoph, D. Mane, V. Vasudevan, Q. V. Le, Autoaugment: Learning augmentation policies from data, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2019), 113–123. https://doi.org/10.1109/CVPR.2019.00020 |
[33] |
Z. Wu, C. Shen, A. Van Den Hengel, Wider or deeper: Revisiting the resnet model for visual recognition, Pattern Recognit., 90 (2019), 119–133. https://doi.org/10.1016/j.patcog.2019.01.006 doi: 10.1016/j.patcog.2019.01.006
![]() |
[34] | P. Goyal, P. Dollár, R. Girshick, P. Noordhuis, L. Wesolowski, A. Kyrola, et al., Accurate, large minibatch SGD: Training imageNet in 1 hour, preprint, arXiv.1706.02677 |
[35] | T. He, Z. Zhang, H. Zhang, Z. Zhang, J. Xie, M. Li, Bag of tricks for image classification with convolutional neural networks, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2019), 558–567. https://doi.org/10.1109/CVPR.2019.00065 |
[36] | L. C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, A. L. Yuille, Semantic image segmentation with deep convolutional nets and fully connected CRFs, preprint, arXiv.1412.7062 |
[37] | W. Shang, Z. Li, Y. Li, Identification of common oral disease lesions based on U-Net. in 2021 IEEE 3rd International Conference on Frontiers Technology of Information and Computer (ICFTIC), (2021), 194–200. https://doi.org/10.1109/ICFTIC54370.2021.9647420 |
[38] | T. Zhou, M. Zhang, J. Li, Regional semantic contrast and aggregation for weakly supervised semantic segmentation. in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2022), 4289–4299. https://doi.org/10.1109/CVPR52688.2022.00426 |
[39] |
Z. Wu, C. Shen, A. Van Den Hengel, Wider or deeper: Revisiting the resnet model for visual recognition, Pattern Recognit., 90 (2019), 119–133. https://doi.org/10.1016/j.patcog.2019.01.006 doi: 10.1016/j.patcog.2019.01.006
![]() |
[40] | K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition, preprint, arXiv.1409.1556 |
1. | Ke Shang, Muhammad Asif, The Design of a Compound Neural Network-Based Economic Management Model for Advancing the Digital Economy, 2023, 35, 1546-2234, 1, 10.4018/JOEUC.330678 |
Physical status | Baseline | Follow-up |
BMI (kg/m2) | 26.23 | 24.69 |
FBG (mmol/L) | 8.2 | 6.7 |
HbA1c (mol/mol) | 9.2 × 10-2 | 7.9 × 10-2 |
PAID (%) | Baseline | Follow-up |
Emotional | 26.25 | 27.50 |
Therapeutic | 12.50 | 12.50 |
Dietary | 18.75 | 22.50 |
Social supporting | 15.00 | 16.25 |
total | 72.50 | 78.75 |
Note: Diabetes-associated distress data assessed by PAID scale (Problem Areas In Diabetes) were compared between baseline and follow-up in three months [21]. |
Physical status | Baseline | Follow-up |
BMI (kg/m2) | 26.23 | 24.69 |
FBG (mmol/L) | 8.2 | 6.7 |
HbA1c (mol/mol) | 9.2 × 10-2 | 7.9 × 10-2 |
PAID (%) | Baseline | Follow-up |
Emotional | 26.25 | 27.50 |
Therapeutic | 12.50 | 12.50 |
Dietary | 18.75 | 22.50 |
Social supporting | 15.00 | 16.25 |
total | 72.50 | 78.75 |
Note: Diabetes-associated distress data assessed by PAID scale (Problem Areas In Diabetes) were compared between baseline and follow-up in three months [21]. |