Cervical cell extraction network based on optimized yolo

Nengkai Wu; Dongyao Jia; Chuanwang Zhang; Ziqi Li; Nengkai Wu; Dongyao Jia; Chuanwang Zhang; Ziqi Li

doi:10.3934/mbe.2023111

Mathematical Biosciences and Engineering

2023, Volume 20, Issue 2: 2364-2381. doi: 10.3934/mbe.2023111

Previous Article Next Article

Research article

Cervical cell extraction network based on optimized yolo

Beijing Jiaotong University, School of Electronics and Information Engineering, No. 3 Shangyuancun Haidian District, Beijing, China, 100044

Received: 26 September 2022 Revised: 31 October 2022 Accepted: 09 November 2022 Published: 18 November 2022

Early screening for cervical cancer is a common form of cancer prevention. In the microscopic images of cervical cells, the number of abnormal cells is small, and some abnormal cells are heavily stacked. How to solve the segmentation of highly overlapping cells and realize the identification of single cells from overlapping cells is still a heavy task. Therefore, this paper proposes an object detection algorithm of Cell_yolo to effectively and accurately segment overlapping cells. Cell_yolo adopts a simplified network structure and improves the maximum pooling operation, so that the information of the image is preserved to the greatest extent during the model pooling process. Aiming at the characteristics of many overlapping cells in cervical cell images, a non-maximum suppression method of center distance is proposed to prevent the overlapping cell detection frame from being deleted by mistake. At the same time, the loss function is improved and the focus loss function is added to alleviate the imbalance of positive and negative samples in the training process. Experiments are conducted on a private dataset (BJTUCELL). Experiments have verified that the Cell_yolo model has the advantages of low computational complexity and high detection accuracy, and it is superior to common network models such as YOLOv4 and Faster_RCNN.

Keywords:

Citation: Nengkai Wu, Dongyao Jia, Chuanwang Zhang, Ziqi Li. Cervical cell extraction network based on optimized yolo[J]. Mathematical Biosciences and Engineering, 2023, 20(2): 2364-2381. doi: 10.3934/mbe.2023111

Related Papers:

[1]	Hexiao Hu, Yalian Zhang, Chen Yao, Xin Guo, Zhijing Yang . Research on cost accounting of enterprise carbon emission (in China). Mathematical Biosciences and Engineering, 2022, 19(11): 11675-11692. doi: 10.3934/mbe.2022543
[2]	Zheng Liu, Hangxin Guo, Yuanjun Zhao, Bin Hu, Lihua Shi, Lingling Lang, Bangtong Huang . Research on the optimized route of cold chain logistics transportation of fresh products in context of energy-saving and emission reduction. Mathematical Biosciences and Engineering, 2021, 18(2): 1926-1940. doi: 10.3934/mbe.2021100
[3]	Baotong Wu, Qi Tang . A sustainable scheduling system for medical equipment: Towards net zero goals for green healthcare. Mathematical Biosciences and Engineering, 2023, 20(10): 18960-18986. doi: 10.3934/mbe.2023839
[4]	Pablo Flores-Sigüenza, Jose Antonio Marmolejo-Saucedo, Joaquina Niembro-Garcia, Victor Manuel Lopez-Sanchez . A systematic literature review of quantitative models for sustainable supply chain management. Mathematical Biosciences and Engineering, 2021, 18(3): 2206-2229. doi: 10.3934/mbe.2021111
[5]	Tarahom Mesri Gundoshmian, Sina Ardabili, Mako Csaba, Amir Mosavi . Modeling and optimization of the oyster mushroom growth using artificial neural network: Economic and environmental impacts. Mathematical Biosciences and Engineering, 2022, 19(10): 9749-9768. doi: 10.3934/mbe.2022453
[6]	Bo Dong, Alexey Luzin, Dmitry Gura . The hybrid method based on ant colony optimization algorithm in multiple factor analysis of the environmental impact of solar cell technologies. Mathematical Biosciences and Engineering, 2020, 17(6): 6342-6354. doi: 10.3934/mbe.2020334
[7]	Ke Hou, Jianping Sun, Minggao Dong, He Zhang, Qingqing Li . Simulation of carbon peaking process of high energy consuming manufacturing industry in Shaanxi Province: A hybrid model based on LMDI and TentSSA-ENN. Mathematical Biosciences and Engineering, 2023, 20(10): 18445-18467. doi: 10.3934/mbe.2023819
[8]	Mehrdad Ahmadi Kamarposhti, Ilhami Colak, Kei Eguchi . Optimal energy management of distributed generation in micro-grids using artificial bee colony algorithm. Mathematical Biosciences and Engineering, 2021, 18(6): 7402-7418. doi: 10.3934/mbe.2021366
[9]	Dongmei Zhang . Unveiling dynamics of urbanization, rural logistics, and carbon emissions: A study based on China's empirical data. Mathematical Biosciences and Engineering, 2024, 21(2): 2731-2752. doi: 10.3934/mbe.2024121
[10]	Huanyu Chen, Jizheng Yi, Aibin Chen, Guoxiong Zhou . Application of PVAR model in the study of influencing factors of carbon emissions. Mathematical Biosciences and Engineering, 2022, 19(12): 13227-13251. doi: 10.3934/mbe.2022619

Abstract

1. Introduction and background overview

Pandemics caused by infectious diseases are becoming a constant threat in our globalized society. There are seasonal diseases like influenza, but also new diseases, often of zoonotic origin, like the recent case of COVID-19, cause by the coronavirus SARS-CoV-2. COVID-19 has become a pandemic starting in 2020 and it has left an important legacy in the form of extensive data covering various aspects relevant for the diffusion of the infection. During the COVID-19 pandemic, traditional compartmental modeling of infections, based on ordinary differential equations (ODEs), has been employed very successfully in describing the evolution of the incidence of infections. More advanced versions of the traditional models have been proposed and tested, taking advantage of the unprecedented data availability. Data on people's mobility and behavior, on the adoption of public measures, on economic restrictions and public policy, were also fundamental in trying to make sense of the pandemic as it happened. As a hindsight exercise, one of the major questions currently on the topic of infectious disease spread is how best to use the available data in transmission models in such a way that new insights can be uncovered and new lessons/conclusions can be drawn from our common recent COVID-19 experience, should we be again faced with similar situations.

Compartmental models are based on pioneering work in the early 20th century ^[1]. The first models where based on three compartments: susceptible (S), infectious (I) and removed (R), leading to the famous SIR model. More recently, the compartment of exposed (E) has been added to take into account the latency of the disease, leading to SEIR models. Such models are flexible enough to allow for interactions among separate geographical regions or age stratification. In the former case, each region has its own set of SEIR-type ODEs with connecting terms to reflect importation of cases from other regions. From the administrative point of view, the regional division of population (for example counties in the USA, or public health regions in the province of Ontario, Canada) is based on various socio-demographic criteria, history, etc. Established literature in disease transmission has looked at regions as cities, with commuter traffic and/or regular travel between them ^[2,3,4].

In general, SEIR-type models work well for large well-mixed and isolated populations within which the infections can easily spread ^[5]. This requirement is often not respected by administrative regional divisions. For instance, in the USA, counties are often sparsely populated or strongly connected to nearby counties by commuting population; states, instead, may encompass disconnected local areas or feature important cross-state commuting population. The main goal of our work is to recast small geographical units, like counties, in new well-mixed and isolated regions by use of available data on people's mobility. The new regions are defined by the following criteria: minimizing mobility between the new regions while creating well-mixed sub-populations in each such new region. Moreover, we introduce a notion of temporal stability of our clusters and use it to analyze the results throughout a 6 months time window. Clustering of populations is not a novel concept; researchers have studied clustering populations in order to improve government legislation, re-imagine municipal infrastructure, and observe the environmental impacts of carbon emissions ^[6,7]. This work is broadly focused on using individual mobility data along with other features to cluster populations based on similarity of those features ^[8,9]. Additional research has been done to discover high traffic areas within cities in order to understand traffic dynamics within regional populations ^[10]. Research into clustering regions based on interconnected mobility is less represented in the literature, to our knowledge.

Nevertheless, mobility networks have been used in conjunction with SEIR-type models in order to capture epidemiological dynamics of COVID-19 in urban populations ^[11,12,13]. Some of the research was focused on individual city dynamics in order to capture the spread within these smaller regions ^[11], while others have looked at quantifying the effect of mobility restrictions on the disease spread in Canada ^[12] and in other countries around the world ^[13]. At larger scales, mobility can be used to understand the spread of the disease across continents: for instance, the case of the second wave in 2020 in Europe has been studied in ^[14] while the spread in the USA has been modeled at the census division level ^[15]. Many other cases have been studied in the literature that cannot be briefly summarized, in all cases struggling with the need of finding appropriate sub-population characterization.

For our purpose, we will employ a simple machine learning approach to define the new regions, based on the criteria and datasets mentioned above. There are three general approaches to classification problems: supervised, semi-supervised, and unsupervised ^[16,17]. For the purposes of this article, we will focus on unsupervised classification, herein referred to as clustering ^[17]. In clustering, there is no information known regarding the true classification of data, unlike supervised and semi-supervised learning, wherein some or all of the information about the true classification is known ^[17]. Clustering assigns classes to objects in a dataset ^[17]. Many different clustering algorithms exist and have a broad scope of applications, although no one clustering method is superior in all situations ^[17]. Clustering algorithms vary in methodology and applications as described elsewhere ^[16,17,18]. For the purposes of this research, our methodology most resembles a type of fuzzy clustering, with distinct differences. Fuzzy Theory clustering algorithms look to apply a probability to an object belonging to a cluster ^[18]. In applying a probability to the clustering, the membership of a data point is shared among all clusters and thus the boundaries of the clusters become fuzzy ^[16]. In general, these approaches aim to minimize a cost-function and achieve some local-minima ^[16]. Our algorithm looks to minimize over some cost-function and define the membership to each cluster as a probability. The difference is that our algorithm uses the maximum probability to define clusters after training.

The structure of the paper is as follows: In Section 2 we introduce the clustering algorithm adopted in this work; in Section 3 we apply the algorithm to the USA counties in early 2020, focusing on the stability over the adoption of uneven measures; additionally in Section 3 we discuss the epidemiological implication of the adoption of the novel sub-populations; we finally offer our conclusions in Section 4.

1.1. Present mobility data used

Part of the data that support these findings (USA COVID surveillance) are publicly available through New York Times ^[19]. The other part of the data (USA contact rates) are available from data vendor Cuebiq but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of Cuebiq. The mobility data from Cuebiq was provided as a weekly snapshot from January–June 2020. Several counties had missing mobility data, a list of these counties can be found in the supplementary material in Table A1. The Google mobility data and the Mobility Census data from Ontario are publicly available ^[20,21].

2. Clustering algorithm

Gradient descent learning was first proposed by Louis-Augustin Cauchy in 1847 ^[22] as an optimization algorithm suited to solving systems of coupled differential equations. Gradient descent algorithms minimize some objective function with respect to its variables ^[23]. This is done by calculating the gradient of the objective function and taking a small step in the opposite (decreasing) direction of the gradient. The step size is controlled by a learning rate, which is, in general, rather small. Recently, these algorithms have been used to optimize neural networks ^[23].

Here, we apply this class of algorithm to a network formed by small geographical units (i.e., counties) that are connected by people's mobility among them. The main goal is to define a set of macro-regions formed by clusters of the network nodes, which are maximally connected inside each cluster and minimally connected to other clusters.

2.1. Basic strategy of the clustering algorithm

We first define an arbitrary number of clusters, denoted by $N_\text{clusters}$ , under the assumption that this number is much smaller than the number of nodes in the network, $N_\text{clusters}\ll N_\text{nodes}$ . Clusters can be initalized randomly or using some intuitive initalization. The main outcome of the algorithm will therefore yield the reorganization of the nodes into the desired number of clusters using spacial mobility data over a given time horizon. The clusters have no fixed size. Cluster size can range from including no nodes to including all nodes. In application, clusters never contain a large portion of the total number of nodes. We define a matrix of probabilities

$\begin{equation} P \in \mathbb{R}^{N_{\text{nodes}}\times N_{\text{clusters}}} \; , \end{equation}$

(2.1)

where each element $P_{ic} \in [0, 1]$ denotes the probability that the node $i$ belongs to cluster $c$ . Each row is bounded by conservation of probabilities to respect the following sum-rule:

$\begin{equation} \sum\limits_{c = 1}^{N_{\text{clusters}}} P_{ic} = 1\,. \end{equation}$

(2.2)

The clustering algorithm must find the optimal probability matrix $P$ following a loss function, which is defined based on the desired properties of the clusters.

By using continuous probabilities instead of Boolean assignments, we can define a differentiable objective function that can be optimized with gradient descent: successive small improvements can be made to the algorithm assignments, rather than having abrupt changes when nodes are reassigned. We then deterministically assign each node to its maximum-probability cluster to get the best node-to-cluster assignment solution. Using Batch Gradient descent, convergence is guaranteed in both convex and non-convex surfaces ^[23].

2.2. Loss functions

The optimization of the node-to-cluster assignment is based on a loss function that depends on the probability matrix $P$ . It measures how accurately any value of $P$ matches the required features of the clusters. This loss function should evaluate to a large value when we have an inaccurate solution, but close to zero when we have an accurate node-to-cluster assignment solution.

To fulfill our purposes, the loss function must have two parts, as we need to regulate two important measures: low mobility interactions among clusters and low population difference among clusters. Henceforth, we define the loss function as a convex combination of two terms:

$\begin{equation} \text{Loss}_{\text{Total}} = \alpha_{\text{Int}}\ \text{Loss}_{\text{Int}} + \alpha_{\text{Pop}} \ \text{Loss}_{\text{Pop}}\,, \end{equation}$

(2.3)

where the constants $\alpha_{\text{Int}}, \alpha_{\text{Pop}} \in \mathbb{R}^+$ control the relative strength of the two requirements. Once the convex weights are fixed, we employ a gradient descent method to minimize the loss function and find the optimal probability matrix $P$ . The optimal $\alpha_{\text{Int}}, \alpha_{\text{Pop}}$ were calculated by performing an analysis on the effects of clustering results based on a range of possible values of $\alpha_{\text{Int}}, \alpha_{\text{Pop}} \in \mathbb{R}^+$ and $\alpha_{\text{Int}} + \alpha_{\text{Pop}} = 1$ . This work can be seen in Appendix.

The low mobility interaction $\text{Loss}_{\text{Int}}$ takes into account the interactions between clusters, measured in terms of the population mobility among the nodes of the network. Hence, this measure relies on mobility data, expressed in terms of an interaction matrix, $\text{Interaction}_{ij}$ , whose elements are proportional to people's flow from node $i$ to node $j$ . The loss function sums the interactions among all nodes belonging to different clusters, and it is defined as follows:

$\begin{equation} \text{Loss}_{\text{Int}}: = \sum\limits_{i,j = 1}^{N_{\text{nodes}}}\text{Interaction}_{ij}\ \mathbb{P}_\text{different}(i,j), \end{equation}$

(2.4)

with $\mathbb{P}_\text{different}(i, j)$ being the probability of node $i$ to be in a different cluster than node $j$ . By the definition of the probability matrix $P$ , we have

$\begin{equation} \mathbb{P}_\text{different}(i,j) = 1 - [PP^T]_{ij} \; , \end{equation}$

(2.5)

hence the loss function can be written in terms of matrix operations as:

$\begin{equation} \text{Loss}_{\text{Int}} = \text{Tr}(\text{Interaction}(\mathbf{1} - PP^T)) \,, \end{equation}$

(2.6)

with $\mathbf{1}$ being defined as a matrix of ones.

The low population difference $\text{Loss}_{\text{Pop}}$ forces the solution to contain clusters of approximately equal population. The main purpose of this term is to force the algorithm away from a trivial solution where all nodes are joined in a single giant cluster while the other clusters are left empty, which trivially minimizes the inter-cluster interactions. Due to this trivial solution we require the additional loss function term defined in Eq (2.7). We define the loss function as follows:

$\begin{equation} \text{Loss}_{\text{Pop}}: = \sum\limits_{c = 1}^{N_{\text{clusters}}}\left(\mathbb{E}_P[\text{Population of cluster } c ] - \frac{\text{TotalPop}}{N_{\text{clusters}}}\right)^2 \,, \end{equation}$

(2.7)

where $\mathbb{E}_P$ denotes the population of cluster $c$ based on the node assignment given by the probabilities $P_{ic}$ and TotalPop is the total population in the network. Defining a vector of the node populations $\text{Pop}_i$ , we have

$\begin{equation} \mathbb{E}_P[\text{Population of cluster } c] = \sum\limits_{i = 1}^{N_{\text{nodes}}}P_{ic}\ \text{Pop}_i = (P^T\text{Pop})_c\,. \end{equation}$

(2.8)

Hence, the loss function can be written in terms of matrix operations as:

$\begin{equation} \text{Loss}_{\text{Pop}} = \left|P^T\text{Pop} - \frac{\text{TotalPop}}{N_{\text{clusters}}}\right|^2\,. \end{equation}$

(2.9)

2.3. Gradient descend algorithm

There is a subtle aspect in the implementation of a gradient descent algorithm to our problem. In fact, as the variables $P$ must respect $P_{ic} \in [0, 1]$ and $\sum\nolimits_{c = 1}^{N_{\text{clusters}}}P_{ic} = 1$ , the gradient descent is not ideal as it would be applied to a constrained optimization problem with both box and linear constraints. Hence, to simplify the implementation, we apply the algorithm to a parametric matrix $X \in \mathbb{R}^{N_{\text{nodes}}\times N_{\text{cluster}}}$ , where each entry $X_{ic} \in [-\infty, \infty]$ is unconstrained. In order to obtain $P$ from $X$ , the real-valued vectors must be converted into probabilities. A common approach in neural networks and deep learning is to use the softmax function ^[24,25]. Each row of the matrix $P$ can be redefined as the following:

$\begin{equation} P_{ic} = \frac{e^{X_{ic}}}{\Sigma^{N_{\text{clusters}}}_{c^\prime = 1}e^{X_{ic^\prime}}}\,. \end{equation}$

(2.10)

This is a common implementation artifact used in machine learning. To find a good cluster assignment configuration, the parameters at every step were updated using automatic differentiation in the grad package from the Jax library in Python using the following loss function:

$\begin{equation} X_{\text{new}} = X_{\text{old}}-\text{StepSize}\nabla_X \text{Loss}(X)\,. \end{equation}$

(2.11)

2.4. Implementation details for USA data

We apply the clustering algorithm to a network made of 3102 counties and county equivalents, located within the 50 states and the District of Columbia (DC). In total, there exist 3144 counties and county equivalents within the USA, however 42 counties were missing from the Cuebiq dataset, and they are not included within our network. These missing counties will appear in white on the maps of the USA, as seen for instance in Figure 1.

Figure 1. Initialization based on geographic vicinity for 100 clusters (a). The outcome of the clustering algorithm after 50,000 gradient descent steps is shown for: (b)

$\alpha_\text{Int} = \alpha_\text{Pop} = 0.5$ and (c)

$\alpha_\text{Int} = 0.99$ ,

$\alpha_\text{Pop} = 0.01$ . These clusters where formed using mobility data from the week of January 6th, 2020. The color gradient has no meaning other than the identification of each cluster.

Cluster 18 counties	Population	Cluster 42 counties	Population
Yuma County	207,829	Los Angeles County	10,098,052
Imperial County	180,216	Ventura County	848,112
Riverside County	2,383,286
San Bernardino County	2,135,413
Population density	132.77/sq mi	Population density	1854.96/sq mi

County fip code	County name
2060	Bristol Bay Borough
2164	Lake and Peninsula Borough
2231	Skagway-Yakutat-Angoon Census Area
2232	Skagway-Hoonah-Angoon Census Area
2280	Wrangell-Petersburg Census Area
2282	Yakutat Borough
2290	Yukon-Koyukuk Census Area
6003	Alpine County
8005	Arapahoe County
12025	Dade County
13137	Habersham County
15005	Kalawao County
28117	Prentiss County
29137	Monroe County
30113	Yellowstone National Park
34027	Morris County
38045	LaMoure County
39139	Richland County
45045	Georgetown County
46113	Shannon County
48027	Bell County
48189	Hale County
51019	Bedford County
51081	Greensville County
51095	James City County
51515	Bedford city
51530	Buena Vista city
51540	Charlottesville city
51560	Clifton Forge city
51580	Covington city
51595	Emporia city
51600	Fairfax city
51660	Harrisonburg city
51678	Lexington city
51683	Manassas city
51685	Manassas Park city
51690	Martinsville city
51720	Norton city
51770	Roanoke city
51775	Salem city
51780	South Boston city
51790	Staunton city
51820	Waynesboro city
51840	Winchester city

[1]	R. Elakkiya, K. S. S. Teja, L. J. Deborah, C. Bisogni, C. Medaglia, Imaging based cervical cancer diagnostics using small object detection-generative adversarial networks, Mult. Tools Appl., 81 (2022), 191–207. https://doi.org/10.1007/s11042-021-10627-3 doi: 10.1007/s11042-021-10627-3
[2]	J. C. Davies-Oliveira, M. A. Smith, S. Grover, K. Canfell, E. J. Crosbie, Eliminating cervical cancer: progress and challenges for high-income countries, Clin. Oncol., 33 (2021), 550–559. https://doi.org/10.1016/j.clon.2021.06.013 doi: 10.1016/j.clon.2021.06.013
[3]	M. E. Plissiti, C. Nikou, A Review of Automated Techniques for Cervical Cell Image Analysis and Classification, Springer Netherlands, 4 (2013), 1–18. https://doi.org/10.1007/978-94-007-4270-3_1 doi: 10.1007/978-94-007-4270-3_1
[4]	M. E. Plissiti, C. Nikou, Overlapping Cell Nuclei Segmentation Using a Spatially Adaptive Active Physical Model, IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society, 21 (2012), 4568–580. https://doi.org/10.1109/TIP.2012.2206041 doi: 10.1109/TIP.2012.2206041
[5]	N. M. Harandi, S. Sadri, N. A. Moghaddam, R. Amirfattahi, An Automated Method for Segmentation of Epithelial Cervical Cells in Images of ThinPrep, J. Med. Syst., 34 (2010), 1043–1058. https://doi.org/10.1007/s10916-009-9323-4 doi: 10.1007/s10916-009-9323-4
[6]	A. Genctav, S. Aksoy, S. Onder, Unsupervised segmentation and classification of cervical cell images, Pattern Recogn., 45 (2012), 4151–4168. https://doi.org/10.1016/j.patcog.2012.05.006 doi: 10.1016/j.patcog.2012.05.006
[7]	A. Kale, S. Aksoy, Segmentation of cervical cell images, 2010 20th International Conference on Pattern Recognition, IEEE, (2010), 2399–2402. https://doi.org/10.1109/ICPR.2010.587 doi: 10.1109/ICPR.2010.587
[8]	T. Chankong, N. Theera-Umpon, S. Auephanwiriyakul, Automatic cervical cell segmentation and classification in Pap smears, Computer Meth. Progr. Biomed., 113 (2014), 539–556. https://doi.org/10.1016/j.cmpb.2013.12.012 doi: 10.1016/j.cmpb.2013.12.012
[9]	H. Lee, J. Kim, Segmentation of overlapping cervical cells in microscopic images with superpixel partitioning and cell-wise contour Refinement, 29th IEEE Conference on Computer Vision and Pattern Recognition, (2016), 1367–1373. https://doi.org/10.1109/CVPRW.2016.172 doi: 10.1109/CVPRW.2016.172
[10]	B. Dong, L. Jia, Y. Wang, J. Li, G. Yang, An improved watershed algorithm based on k-medoids in cervical cancer images, 2019 IEEE International Conferences on Ubiquitous Computing & Communications (IUCC) and Data Science and Computational Intelligence (DSCI) and Smart Computing, Networking and Services (SmartCNS), IEEE, (2019), 190–195. https://doi.org/10.1109/IUCC/DSCI/SmartCNS.2019.00060 doi: 10.1109/IUCC/DSCI/SmartCNS.2019.00060
[11]	C. Jung, C. Kim, S. W. Chae, S. Oh, Unsupervised Segmentation of Overlapped Nuclei Using Bayesian Classification, IEEE Transact. Biomed. Eng., 57 (2010), 2825–2832. https://doi.org/10.1109/tbme.2010.2060486 doi: 10.1109/tbme.2010.2060486
[12]	D. N. Diniz, M. T. Rezende, A. G. C. Bianchi, C. M. Carneiro, D. M. Ushizima, F. N. S. de Medeiros, M. J. F. Souza, A hierarchical feature-based methodology to perform cervical cancer classification, Appl. Sciences-Basel, 11 (2021), 4091. https://doi.org/10.3390/app11094091 doi: 10.3390/app11094091
[13]	J. Long, E. Shelhamer, T. Darrell, Fully convolutional networks for semantic segmentation, The IEEE conference on computer vision and pattern recognition, 39 (2017), 640–651. https://doi.org/10.1109/TPAMI.2016.2572683 doi: 10.1109/TPAMI.2016.2572683
[14]	V. Badrinarayanan, A. Kendall, R. Cipolla, SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation, IEEE Transact. Pattern Anal. Mach. Intell., 39 (2017), 2481–2495. https://doi.org/10.1109/TPAMI.2016.2644615 doi: 10.1109/TPAMI.2016.2644615
[15]	E. Giacomello, D. Loiacono, L. Mainardi, Brain MRI Tumor Segmentation with Adversarial Networks, 2020 International Joint Conference on Neural Networks (IJCNN), IEEE, (2020), 1–8. https://doi.org/10.1109/IJCNN48605.2020.9207220 doi: 10.1109/IJCNN48605.2020.9207220
[16]	D. Yang, D. Xu, S. K. Zhou, B. Georgescu, M. Q. Chen, D. Comaniciu, Automatic liver segmentation using an adversarial image-to-image network, International Conference on Medical Image Computing and Computer-Assisted Intervention, (2017), 507–515. https://doi.org/10.1007/978-3-319-66179-7_58 doi: 10.1007/978-3-319-66179-7_58
[17]	R. Krithiga, P. Geetha, Breast cancer detection, segmentation and classification on histopathology images analysis: A systematic review, Arch. Comput. Methods Eng., 28 (2021), 2607–2619. https://doi.org/10.1007/s11831-020-09470-w doi: 10.1007/s11831-020-09470-w
[18]	O. Ronneberger, P. Fischer, T. Brox, U-net: Convolutional networks for biomedical image segmentation, International Conference on Medical image computing and computer-assisted intervention. Springer. Cham., (2015), 234–241. https://doi.org/10.48550/arXiv.1505.04597 doi: 10.48550/arXiv.1505.04597
[19]	X. Y. Li, L. L. Shen, cC-GAN: A robust transfer-learning framework for HEp-2 specimen image segmentation, IEEE Access, (2018), 14048–14058. https://doi.org/10.1109/access.2018.2808938 doi: 10.1109/access.2018.2808938
[20]	Y. Nambu, T. Mariya, S. Shinkai, M. Umemoto, H. Asanuma, I. Sato, et al., A screening assistance system for cervical cytology of squamous cell atypia based on a two-step combined CNN algorithm with label smoothing, Cancer Med., 11 (2022), 520–529. https://doi.org/10.1002/cam4.4460 doi: 10.1002/cam4.4460
[21]	Z. Q. Xing, X. Chen, F. Q. Pang, DD-YOLO: An object detection method combining knowledge distillation and Differentiable Architecture Search, IET Computer Vision, 16 (2022), 418–430. https://doi.org/10.1049/cvi2.12097 doi: 10.1049/cvi2.12097
[22]	L. C. Jiao, F. Zhang, F. Liu, S. Y. Yang, L. L. Li, Z. X. Feng, et al., A survey of deep learning-based object detection, IEEE Access, 7 (2019), 12883–128868. https://doi.org/10.1109/ACCESS.2019.2939201 doi: 10.1109/ACCESS.2019.2939201
[23]	A. Bochkovskiy, C. Y. Wang, H. Liao, YOLOv4: Optimal speed and accuracy of object detection, Computer Sci., (2020). https://doi.org/10.48550/arXiv.2004.10934 doi: 10.48550/arXiv.2004.10934
[24]	S. Liu, L. Qi, H. F. Qin, J. P. Shi, J. Y. Jia, Path aggregation network for instance segmentation, 31st IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2018), 8759–8768. https://doi.org/10.1109/CVPR.2018.00913 doi: 10.1109/CVPR.2018.00913

Mathematical Biosciences and Engineering

Cervical cell extraction network based on optimized yolo

Related Papers:

Abstract

1. Introduction and background overview

1.1. Present mobility data used

2. Clustering algorithm

2.1. Basic strategy of the clustering algorithm

2.2. Loss functions

2.3. Gradient descend algorithm

2.4. Implementation details for USA data

2.5. Initialization

3. Main results and discussion

3.1. Cluster stability over time during January–June 2020 - core clusters

3.2. Core cluster spread of COVID-19 in the USA January–June 2020

3.3. Core cluster vs. state epidemiology

3.3.1. Core cluster distribution of the initial reproduction number R0 R_0 in the USA January–June 2020

3.3.2. Core cluster vs. state transmission differences and policy implications

3.4. Analysis for Ontario, Canada

4. Further applications, discussion and conclusions

Use of AI tools declaration

Acknowledgments

Conflict of interest

Appendix

A1. Comparison with existing clustering methods

A2. Accuracy

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

3.3.1. Core cluster distribution of the initial reproduction number $R_0$ in the USA January–June 2020