With the abundance of raw data generated from various sources including social networks, big data has become essential in acquiring, processing, and analyzing heterogeneous data from multiple sources for real-time applications. In this paper, we propose a big data framework suitable for pre‑processing and classification of image as well as text analytics by employing two key workflows, called big data (BD) pipeline and machine learning (ML) pipeline. Our unique end-to-end workflow integrates data cleansing, data integration, data transformation and data reduction processes, followed by various analytics using suitable machine learning techniques. Further, our model is the first of its kind to augment facial recognition with sentiment analysis in a distributed big data framework. The implementation of our model uses state-of-the-art distributed technologies to ingest, prepare, process and analyze big data for generating actionable data insights by employing relevant ML algorithms such as k-NN, logistic regression and decision tree. In addition, we demonstrate the application of our big data framework to facial recognition system using open sources by developing a prototype as a use case. We also employ sentiment analysis on non-repetitive semi structured public data (text) such as user comments, image tagging, and other information associated with the facial images. We believe our work provides a novel approach to intersect Big Data, ML and Face Recognition and would create new research to alleviate some of the challenges associated with big data processing in real world applications.
Citation: Suriya Priya R Asaithambi, Sitalakshmi Venkatraman, Ramanathan Venkatraman. Proposed big data architecture for facial recognition using machine learning[J]. AIMS Electronics and Electrical Engineering, 2021, 5(1): 68-92. doi: 10.3934/electreng.2021005
[1] | Yaw Chang, Wei Feng, Michael Freeze, Xin Lu, Charles Smith . Elimination, permanence, and exclusion in a competition model under Allee effects. AIMS Mathematics, 2023, 8(4): 7787-7805. doi: 10.3934/math.2023391 |
[2] | Guillaume Cantin, Cristiana J. Silva . Complex network near-synchronization for non-identical predator-prey systems. AIMS Mathematics, 2022, 7(11): 19975-19997. doi: 10.3934/math.20221093 |
[3] | Haiping Pan, Yiqiu Mao . Transition and bifurcation analysis for chemotactic systems with double eigenvalue crossings. AIMS Mathematics, 2023, 8(10): 24681-24698. doi: 10.3934/math.20231258 |
[4] | Lini Fang, N'gbo N'gbo, Yonghui Xia . Almost periodic solutions of a discrete Lotka-Volterra model via exponential dichotomy theory. AIMS Mathematics, 2022, 7(3): 3788-3801. doi: 10.3934/math.2022210 |
[5] | M. Adel, M. M. Khader, Hijaz Ahmad, T. A. Assiri . Approximate analytical solutions for the blood ethanol concentration system and predator-prey equations by using variational iteration method. AIMS Mathematics, 2023, 8(8): 19083-19096. doi: 10.3934/math.2023974 |
[6] | Yinyin Wu, Fanfan Chen, Qingchi Ma, Dingbian Qian . Subharmonic solutions for degenerate periodic systems of Lotka-Volterra type with impulsive effects. AIMS Mathematics, 2023, 8(9): 20080-20096. doi: 10.3934/math.20231023 |
[7] | Nazmul Sk, Bapin Mondal, Abhijit Sarkar, Shyam Sundar Santra, Dumitru Baleanu, Mohamed Altanji . Chaos emergence and dissipation in a three-species food web model with intraguild predation and cooperative hunting. AIMS Mathematics, 2024, 9(1): 1023-1045. doi: 10.3934/math.2024051 |
[8] | Xuechao Zhang, Yuhan Hu, Shichang Lu, Haomiao Guo, Xiaoyan Wei, Jun li . Dynamic analysis and control of online information dissemination model considering information beneficiaries. AIMS Mathematics, 2025, 10(3): 4992-5020. doi: 10.3934/math.2025229 |
[9] | Li Wang, Hui Zhang, Suying Liu . On the existence of almost periodic solutions of impulsive non-autonomous Lotka-Volterra predator-prey system with harvesting terms. AIMS Mathematics, 2022, 7(1): 925-938. doi: 10.3934/math.2022055 |
[10] | Iqra Batool, Naim Bajcinca . Stability analysis of a multiscale model including cell-cycle dynamics and populations of quiescent and proliferating cells. AIMS Mathematics, 2023, 8(5): 12342-12372. doi: 10.3934/math.2023621 |
With the abundance of raw data generated from various sources including social networks, big data has become essential in acquiring, processing, and analyzing heterogeneous data from multiple sources for real-time applications. In this paper, we propose a big data framework suitable for pre‑processing and classification of image as well as text analytics by employing two key workflows, called big data (BD) pipeline and machine learning (ML) pipeline. Our unique end-to-end workflow integrates data cleansing, data integration, data transformation and data reduction processes, followed by various analytics using suitable machine learning techniques. Further, our model is the first of its kind to augment facial recognition with sentiment analysis in a distributed big data framework. The implementation of our model uses state-of-the-art distributed technologies to ingest, prepare, process and analyze big data for generating actionable data insights by employing relevant ML algorithms such as k-NN, logistic regression and decision tree. In addition, we demonstrate the application of our big data framework to facial recognition system using open sources by developing a prototype as a use case. We also employ sentiment analysis on non-repetitive semi structured public data (text) such as user comments, image tagging, and other information associated with the facial images. We believe our work provides a novel approach to intersect Big Data, ML and Face Recognition and would create new research to alleviate some of the challenges associated with big data processing in real world applications.
In the United States of America, human papillomavirus (HPV) is the most common sexually transmitted infection (STI) in males and females [8]. Most sexually active males and females will get at least one type of HPV infection at some point in their lives [5]. In the United States, about 79 million are currently infected with HPV and about 14 million people become newly infected each year [8]. There are more than 150 different types of HPV [7]. Health problems related to HPV include genital warts and cancer. Most people infected with genital HPV do not know they are infected and never develop symptoms or health problems from it. Some people find out they have HPV when they get genital warts. Females may find out they have HPV when they get an abnormal Pap test result during cervical cancer screening. Others may only find out once they have developed more serious problems from HPV, such as cancer [5]. Most HPV infections cause no symptoms and are not clinically significant, but persistent infection can lead to disease or cancer.
Mathematical epidemic models have been used to study HPV infections in various populations. For example, Alsaleh and Gummel [1] in a recent paper, used a deterministic model to assess the impact of vaccination on both high-risk and low-risk HPV infection types. Ribassin-Majed and Clemencon [14] used a deterministic mathematical model to assess the impact of vaccination on non-cancer causing HPV (6/11) in French males and females. Lee and Tameru [13] used a deterministic model to assess the impact of HPV on cervical cancer in African American females (AAF). In all these studies, the HPV models have a constant recruitment function for the demographic equations.
In this paper, we use a two-sex HPV model with "fitted" logistic demographics to study the HPV disease dynamics in AAF and African American males (AAM) of 16 years and older. Using US Census Bureau data for AAF and AAM populations, we illustrate that the "fitted" logistic demographic equation captures the African American (AA) population better than the constant recruitment demographic equation. We compute the basic reproduction number,
The paper is organized as follows: In Section 2, we introduce a demographic equation for AAF (respectively, AAM) and we "fit" it to the US Census Bureau data of AAF (respectively, AAM) of 16 years and older. We introduce, in Section 3, a two-sex African American HPV model. In Section 4, we study disease-free equilibria and compute the basic reproduction number
In [14], Ribassin-Majed et al. used a HPV model with constant recruitment rate in the demographic equation to study HPV disease dynamics in male and female populations of France. In the absence of the HPV disease, the demographic equation of their model is the following ordinary differential equation:
dNdt=Λ−μN, | (1) |
where
In this paper, we use logistic models that are "fitted" to the
2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2014 | |
AAF population 16 years and older [12,16] | 13,825,055 | 14,041,520 | 14,259,413 | 14,473,927 | 14,707,490 | 14,952,963 | 15,224,330 | 15,486,244 | 15,743,096 | 15,992,822 | 16,176,048 | 16,471,449 | 16,696,303 | 16,918,225 | 17,139,986 |
AAF total population [12,16] | 18,787,192 | 19,013,351 | 19,229,855 | 19,434,349 | 19,653,829 | 19,882,081 | 20,123,789 | 20,374,894 | 20,626,043 | 20,868,282 | 21,045,595 | 21,320,013 | 21,543,051 | 21,767,521 | 21,988,307 |
2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2014 | |
AAM population 16 years and older [12,16] | 11,909,507 | 12,124,810 | 12,332,791 | 12,518,252 | 12,756,370 | 12,996,123 | 13,266,163 | 13,517,841 | 13,765,707 | 14,006,594 | 14,181,655 | 14,490,027 | 14,724,637 | 14,950,933 | 15,176,189 |
AAM total population [12,16] | 17,027,514 | 17,249,678 | 17,454,795 | 17,631,747 | 17,856,753 | 18,079,607 | 18,319,259 | 18,560,639 | 18,803,371 | 19,033,988 | 19,260,298 | 19,487,042 | 19,719,238 | 19,945,997 | 20,169,931 |
From Tables 1 and 2, we note that both male and female populations of African Americans of 16 years and older as well as the total populations exhibit increasing trends from 2000 to 2014. In the next section, we use a logistic differential equation model to "capture" the AA population data of Tables 1 and 2.
In [3], Brauer and Castillo-Chavez used the logistic equation "fitted" to United States Census Bureau data to model the total United States population. We use the same approach to "fit" the solution of the following logistic equation to the AAF population of 16 years and older (see Table 1) and the AAM population of 16 years and older (see Table 2).
dNidt=(ri−μi)Ni(1−NiKi(ri−μi)/ri),t≥0, | (2) |
where index
Ni(t)=0andNi(t)=Ki(ri−μi)/ri1+(Ki(ri−μi)/riN0i−1)e−(ri−μi)t. | (3) |
Let
Rdi=riμi. |
K∗i=Ki(ri−μi)ri,ast→∞, |
and the population persists. However, when
Equation (2) gives the per capita growth rate,
dNi/dtNi=(ri−μi)(1−NiKi(ri−μi)/ri). | (4) |
Using the
Fitting the line to the curve gives
Using our estimates, we express the nontrivial solution (3) of the logistic growth model for AAF of 16 years and older as
Nf(t)=52,967,1171+2.772176873e−0.021298978t, | (5) |
and for AAM of 16 years and older as
Nm(t)=99,484,6731+7.205050059e−0.019877926t, | (6) |
where year 2001 is taken as
The plot of the data of AAF of 16 years or older and solution (5) in Figure 1 show that our "fitted" model captures the AAF data of Table 1. Similarly, the plot of the data of AAM of 16 years or older and solution (6) in Figure 2 show that our "fitted" model captures the AAM data of Table 2.
When the population in equation (1) consists only of AAF (
Ni(t)=Λiμi+(N0i−Λiμi)e−μit. | (7) |
Using the initial condition
Nf(t)=2,315,038,811−2,300,997,291e−0.007266t, | (8) |
and
Nm(t)=1,887,409,019−1,875,284,209e−0.008227t. | (9) |
In Figure 1, we compare the "fitted" solution (8) of Model (1), and our "fitted" solution (5) of Model (2), to the US Census Bureau data in Table 1. Figure 1 shows that Model (2), the "fitted" logistic model, captures better the
Similarly, Figure 2 shows that, as in the female population, the "fitted" solution (9) of Model (1) over estimates the AAM US Census Bureau data while our "fitted" solution (5) of Model (2) captures it.
To study the HPV dynamics in male and female African American populations of 16 years and older, we assume that the total AAF population (respectively, total AAM population) of 16 years and older is governed by Model (2) with
{dSfdt=rfNf(1−NfKf)−σfSfImNf+Nm+δIf−μfSf,dSmdt=rmNm(1−NmKm)−σmSmIfNf+Nm+δIm−μmSm,dIfdt=σfSfImNf+Nm−(δ+μf)If,dImdt=σmSmIfNf+Nm−(δ+μm)Im, | (10) |
where
Parameter (per day) | Description | Reference |
Death rate for AAF population | [6] | |
Death rate for AAM population | [6] | |
Clearance rate | [11] | |
Intrinsic growth rate for AAF population | Estimated | |
Intrinsic growth rate for AAM population | Estimated | |
Carrying capacity for AAF population | Estimated | |
Carrying capacity for AAF population | Estimated | |
Infection rate for AAF population | [1] | |
Infection rate for AAM population | [1] |
Notice that since
Nf(t)=Sf(t)+If(t)andNm(t)=Sm(t)+Im(t), |
adding the
Consequently, in Model (10), the total AAF and AAM populations, governed by our "fitted" logistic equations (5) and (6) are bounded. We will study Model (10) with the parameter values listed in Table 3 and with the initial conditions listed in Table 4.
= | 8,618,960 | |
= | 7,119,370 | |
= | 5,422,560 | |
= | 5,005,440 |
Notice that in Table 4,
In this section, we show that Model (10) is well-posed. In particular, we obtain that all orbits are nonnegative and there is no population explosion in Model (10).
Theorem 3.1. All solutions of Model (10) are nonnegative and bounded.
Proof. Consider the following nonnegative initial conditions
If
Similarly, if
If
Similarly, if
Recall that
Let
Then Model (10) is equivalent to
dXdt=F(X). |
In AAF population,
Unlike in [14], it is possible for Model (10) to exhibit up to four disease-free equilibrium points (DFEs), where
Since
Since
To determine the stability of
J|Pf0=[μf−rf02μf−rf+δ−σf0rm−μm02μm−rm+δ00−(δ+μf)σf000−(δ+μm)]. |
Similarly, to determine the stability of
J|Pm0=[rf−μf0rf+δ00μm−rm−σm2μm−rm+δ00−(δ+μf)000σm−(δ+μm)]. |
To determine the stability of
{dIfdt=σf(K∗f−If)ImK∗f+K∗m−(δ+μf)If,dImdt=σm(K∗m−Im)IfK∗f+K∗m−(δ+μm)Im,dSfdt=rfK∗f(1−K∗fKf)−σfSfImK∗f+K∗m+δIf−μfSf,dSmdt=rmK∗m(1−K∗mKm)−σmSmIfK∗f+K∗m+δIm−μmSm. | (11) |
Using the next generation matrix method [17] we obtain the following two matrices
F=[σf(K∗f−If)ImK∗f+K∗mσm(K∗m−Im)IfK∗f+K∗m00] |
andV=[(δ+μf)If(δ+μm)Im−rfK∗f(1−K∗fKf)+σfSfImK∗f+K∗m−δIf+μfSf−rmK∗m(1−K∗mKm)+σmSmIfK∗f+K∗m−δIm+μmSm]. |
Then, using the Jacobian matrices of
DF(Qfm)=[F000]andDV(Qfm)=[V0WU], |
where
Hence,
FV−1=1K∗f+K∗m[0σfK∗fδ+μmσmK∗mδ+μf0]. |
By the next generation matrix method [17], the reproduction number for Model (10),
R0=ρ(FV−1)=√R0fR0m,where |
R0f=σfK∗f(δ+μf)(K∗f+K∗m)andR0m=σmK∗m(δ+μm)(K∗f+K∗m). |
When
It is interesting to note that in [13], Lee and Tameru, obtained
To find an effective mitigation strategy that seeks to reduce HPV infection in AA population within the shortest time possible, in the next section, we use sensitivity analysis to study the impact of each model parameter on
Sensitivity indices are used to measure the relative change in a state variable when a parameter changes. Typically, the normalized forward sensitivity index of a variable to a parameter is defined as the ratio of the relative change in the variable to the relative change in the parameter. When the variable is a differential function of the parameter, the sensitive index may be alternatively defined using partial derivatives [4,10,18,19].
Definition 4.1([4,10,18,19]). The normalized forward sensitivity index of a variable,
Υuq:=∂u∂q×qu. |
We use Definition 4.1 to derive the sensitivity indices of the basic reproduction number
Increasing (respectively, decreasing) the clearance rate,
To illustrate the impact of HPV on AAF and AAM populations of 16 years and older, we simulate Model (10) with the parameter values listed in Table 3 and the initial conditions listed in Table 4.
Parameter | Sensitivity index of | Order of Importance |
-0.9915 | 1 | |
0.5000 | 2 | |
0.5000 | 3 | |
0.1526 | 4 | |
-0.1526 | 5 | |
-0.0631 | 6 | |
0.0586 | 7 | |
-0.0561 | 8 | |
0.0520 | 9 |
Simulations of our HPV Model (10) are performed using Matlab software, and are illustrated in Figure 4. Figure 4 (a) shows that susceptible population of AAF of 16 years and older,
To protect against HPV infections, HPV vaccines are available for males and females. Gardasil and Cervarix are two HPV vaccines that have market approval in many countries. Next, we introduce an extension of Model (10) with vaccinated male and female classes. We will use the extended model to study the impact of vaccination on Figure 4.
To introduce HPV vaccination in AAF and AAM populations of Model (10), we let
As in [14], we divide the AA population into eight compartments.
{dSfdt=rf(1−pf)Nf(1−NfKf)−σfSfIm+IvmNf+Nm+δIf−μfSf,dSvfdt=pfrfNf(1−NfKf)−(1−τ)σfSvfIm+IvmNf+Nm+δIvf−μfSvf,dSmdt=rm(1−pm)Nm(1−NmKm)−σmSmIf+IvfNf+Nm+δIm−μmSm,dSvmdt=pmrmNm(1−NmKm)−(1−τ)σmSvmIf+IvfNf+Nm+δIvm−μmSvm,dIfdt=σfSfIm+IvmNf+Nm−(δ+μf)If,dIvfdt=(1−τ)σfSvfIm+IvmNf+Nm−(δ+μf)Ivf,dImdt=σmSmIf+IvfNf+Nm−(δ+μm)Im,dIvmdt=(1−τ)σmSvmIf+IvfNf+Nm−(δ+μm)Ivm, | (12) |
where
= | 5,257,566 | |
= | 3,361,394 | |
= | 5,667,019 | |
= | 1,452,351 | |
= | 5,086,421 | |
= | 336,139 | |
= | 4,860,205 | |
= | 145,235 |
Note that in Table 6,
Proceeding exactly as in Theorem 3.1, we obtain the following result.
Theorem 5.1. All solutions of Model (12) are nonnegative and bounded.
Notice that when all vaccinated classes are missing
From Model (12), the demographic equations for the female and male total populations are respectively the following:
{dNfdt=rfNf(1−NfKf)−μfNf,dNmdt=rmNm1−NmKm)−μmNm. | (13) |
The equilibrium points of Model (13) are
As in Model (10), to state the "limiting" system of Model (12), we replace
{dSfdt=rf(1−pf)K∗f(1−K∗fKf)−σfSfIm+IvmK∗f+K∗m+δIf−μfSf,dSmdt=rm(1−pm)K∗m(1−K∗mKm)−σmSmIf+IvfK∗f+K∗m+δIm−μmSm,dIfdt=σfSfIm+IvmK∗f+K∗m−(δ+μf)If,dIvfdt=(1−τ)σf(K∗f−Sf−If−Ivf)Im+IvmK∗f+K∗m−(δ+μf)Ivf,dImdt=σmSmIf+IvfK∗f+K∗m−(δ+μm)Im,dIvmdt=(1−τ)σm(K∗m−Sm−Im−Ivm)If+IvfK∗f+K∗m−(δ+μm)Ivm, | (14) |
DFE of System (14) is
F=[σfSfIm+IvmK∗f+K∗m(1−τ)σf(K∗f−Sf−If−Ivf)Im+IvmK∗f+K∗mσmSmIf+IvfK∗f+K∗m(1−τ)σm(K∗m−Sm−Im−Ivm)If+IvfK∗f+K∗m00] |
andV=[(δ+μf)If(δ+μf)Ivf(δ+μm)Im(δ+μm)Ivm−rf(1−pf)K∗f(1−K∗fKf)+σfSfIm+IvmK∗f+K∗m−δIf+μfSf−rm(1−pm)K∗m(1−K∗mKm)+σmSmIf+IvfK∗f+K∗m−δIm+μmSm]. |
Let
DF(Q)=[F000]andDV(Q)=[V0WU], |
where
F=[00σf(1−pf)K∗fK∗f+K∗mσf(1−pf)K∗fK∗f+K∗m00(1−τ)σfpfK∗fK∗f+K∗m(1−τ)σfpfK∗fK∗f+K∗mσm(1−pm)K∗mK∗f+K∗mσm(1−pm)K∗mK∗f+K∗m00(1−τ)σmpmK∗mK∗f+K∗m(1−τ)σmpmK∗mK∗f+K∗m00], |
V=[δ+μf0000δ+μf0000δ+μm0000δ+μm],U=[μf00μm] |
andW=[−δ0σf(1−pf)K∗fK∗f+K∗mσf(1−pf)K∗fK∗f+K∗mσm(1−pm)K∗mK∗f+K∗mσm(1−pm)K∗mK∗f+K∗m−δ0]. |
Hence,
FV−1=[00σf(1−pf)K∗f(δ+μm)(K∗f+K∗m)σf(1−pf)K∗f(δ+μm)(K∗f+K∗m)00(1−τ)σfpfK∗f(δ+μm)(K∗f+K∗m)(1−τ)σfpfK∗f(δ+μm)(K∗f+K∗m)σm(1−pm)K∗m(δ+μf)(K∗f+K∗m)σm(1−pm)K∗m(δ+μf)(K∗f+K∗m)00(1−τ)σmpmK∗m(δ+μf)(K∗f+K∗m)(1−τ)σmpmK∗m(δ+μf)(K∗f+K∗m)00]. |
By the next generation matrix method [17], the reproduction number for Model (12),
Rv0=ρ(FV−1)=√Rv0fRv0m,where |
Rv0f=(1−τpf)σfK∗f(δ+μf)(K∗f+K∗m)andRv0m=(1−τpm)σmK∗m(δ+μm)(K∗f+K∗m). |
Rv0f=(1−τpf)R0fandRv0m=(1−τpm)R0m. |
Since
Thus, adopting a HPV vaccination program decreases the basic reproduction number,
Using the parameter values of Table 3,
In the next section, we use sensitivity analysis to illustrate the impact of model parameters on
We use Definition 4.1 to derive the sensitivity indices of the basic reproduction number
From Table 7 and Figure 5, increasing (respectively, decreasing) the clearance rate,
Parameter | Sensitivity index of | Order of Importance |
-0.9915 | 1 | |
0.5000 | 2 | |
0.5000 | 3 | |
-0.3829 | 4 | |
-0.2704 | 5 | |
0.1526 | 6 | |
-0.1526 | 7 | |
-0.1124 | 8 | |
-0.0631 | 9 | |
0.0586 | 10 | |
-0.0561 | 11 | |
0.0520 | 12 |
To illustrate the impact of HPV on AAF and AAM populations of 16 years and older when a vaccination program is applied with
Simulations of our HPV Model (12) are performed using Matlab software, and are illustrated in Figure 6. Figures 6 (a-b) show that susceptible population of AAF of 16 years and older,
To study the impact of the presence of the vaccinated class on the results of Figure 4, we simulate Model (12) using initial conditions in Table 8. For these simulations of Model (12), we keep all the parameter values at their current values in Figure 4, where
![]() | |||
5,257,566 | 4,309,480 | 2,585,688 | |
3,361,394 | 4,309,480 | 6,033,272 | |
5,667,019 | 3,559,685 | 2,135,811 | |
1,452,351 | 3,559,685 | 4,983,559 | |
5,086,421 | 4,991,612 | 4,819,233 | |
336,139 | 430,948 | 603,327 | |
4,860,205 | 4,649,472 | 4,507,084 | |
145,235 | 355,969 | 498,356 |
Note that in Figure 6 and Table 6,
In AA population,
Furthermore, in both AAF and AAM populations, Figures 9 and 10 show that the number of infected populations is lower when the population is under a vaccination policy than when the population is not being vaccinated. Thus, HPV vaccines that provide partial immunity to both AAF and AAM populations of 16 years and older not only lower the number of HPV infectives but increase the number of susceptibles in both female and male populations.
Furthermore, we obtained in Figures 7-10 that the increase (respectively, decrease) in the susceptible (respectively, HPV infective) populations is larger when a bigger proportion of the population is vaccinated.
We use a two-sex HPV model with "fitted" logistic demographics to study HPV disease dynamics in AAF and AAM populations of 16 years and older. In agreement with Lee and Tameru [13], we obtained that in AA population,
Using sensitivity analysis on
● Increasing (respectively, decreasing) the clearance rate,
● Increasing (respectively, decreasing) the infection rate of the AAF population,
● Increasing (respectively, decreasing) the infection rate of the AAM population,
In the second part of the paper, we extended our model to include vaccination classes in both male and female AA populations of 16 years and older. We obtained that in AA population when the vaccination program is implemented,
● Increasing (respectively, decreasing) the clearance rate,
● Increasing (respectively, decreasing) the infection rate of the AAF population,
● Increasing (respectively, decreasing) the infection rate of the AAM population,
● Increasing (respectively, decreasing) the success rate of vaccination,
● Increasing (respectively, decreasing) the proportion of HPV vaccinated females,
Furthermore, using the extended model with vaccination we obtained the following results:
● Adopting a vaccination policy lowers HPV infections in both AAF and AAM populations.
● Vaccinating a larger proportion of AAF and AAM populations leads to fewer cases of HPV infections in the vaccinated population.
This research was partially supported by National Science Foundation under grant DUE-1439758.
[1] |
Chen M, Mao S, Liu Y (2014) Big data: A survey. Mobile networks and applications 19: 171-209. doi: 10.1007/s11036-013-0489-0
![]() |
[2] | McAfee A, Brynjolfsson E, Davenport TH, et al. (2012) Big data: the management revolution. Harvard business review 90: 60-68. |
[3] | Venkatraman R, Venkatraman S (2019) Big Data Infrastructure, Data Visualisation and Challenges. Proceedings of the 3rd International Conference on Big Data and Internet of Things, 13-17. |
[4] |
Labrinidis A, Jagadish HV (2012) Challenges and opportunities with big data. Proceedings of the VLDB Endowment 5: 2032-2033. doi: 10.14778/2367502.2367572
![]() |
[5] |
Venkatraman S, Venkatraman R (2019) Big data security challenges and strategies. AIMS MATHEMATICS 4: 860-879. doi: 10.3934/math.2019.3.860
![]() |
[6] | Masi I, Wu Y, Hassner T, et al. (2018) Deep face recognition: A survey. 2018 31st SIBGRAPI conference on graphics, patterns and images (SIBGRAPI), 471-478. |
[7] | Singh A, Bhadani R (2020) Mobile Deep Learning with TensorFlow Lite, ML Kit and Flutter. Packt Publishing. |
[8] |
Zhu Y, Jiang Y (2020) Optimization of face recognition algorithm based on deep learning multi feature fusion driven by big data. Image Vision Comput 104: 104023. doi: 10.1016/j.imavis.2020.104023
![]() |
[9] | Reddy KS, Krishna VV, Kumar VV (2016) A Method for Facial Recognition Based On Local Features. International Journal of Mathematics and Computation 27: 98-109. |
[10] | Qateef JS, Kazm AA (2016) Facial expression recognition via mapreduce assisted k-nearest neighbor algorithm. International Journal of Computer Science and Information Security 14: 170. |
[11] |
Sirovich L, Kirby M (1987) Low-dimensional procedure for the characterization of human faces. Josa a 4: 519-524. doi: 10.1364/JOSAA.4.000519
![]() |
[12] | Turk MA, Pentland AP (1991) Face recognition using eigenfaces. Proceedings. 1991 IEEE computer society conference on computer vision and pattern recognition, 586-587. IEEE Computer Society. |
[13] |
Bruce V, Young A (1986) Understanding face recognition. British journal of psychology 77: 305-327. doi: 10.1111/j.2044-8295.1986.tb02199.x
![]() |
[14] | Parkhi OM, Vedaldi A, Zisserman A (2015) Deep face recognition. |
[15] |
He X, Yan S, Hu Y, et al. (2005) Face recognition using Laplacianfaces. IEEE T Pattern Anal 27: 328-340. doi: 10.1109/TPAMI.2005.55
![]() |
[16] | Deng J, Guo J, Xue N, et al. (2019) Arcface: Additive angular margin loss for deep face recognition. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4690-4699. |
[17] | Zhou E, Cao Z, Yin Q (2015) Naive-deep face recognition: Touching the limit of LFW benchmark or not? arXiv preprint arXiv: 150104690. |
[18] | Wang H, Wang Y, Zhou Z, et al. (2018) Cosface: Large margin cosine loss for deep face recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, 5265-5274. |
[19] | Wen Y, Zhang K, Li Z, et al. (2016) A discriminative feature learning approach for deep face recognition. European conference on computer vision, 499-515. |
[20] | Deng J, Zhou Y, Zafeiriou S (2017) Marginal loss for deep face recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 60-68. |
[21] | Ding H, Zhou SK, Chellappa R (2017) Facenet2expnet: Regularizing a deep face recognition net for expression recognition. 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), 118-126. |
[22] | Wang F, Chen L, Li C, et al. (2018) The devil of face recognition is in the noise. Proceedings of the European Conference on Computer Vision (ECCV), 765-780. |
[23] | Benhlima L (2018) Big data management for healthcare systems: architecture, requirements, and implementation. Advances in bioinformatics 2018. |
[24] |
Asaithambi SPR, Venkatraman R, Venkatraman S (2020) MOBDA: Microservice-Oriented Big Data Architecture for Smart City Transport Systems. Big Data and Cognitive Computing 4: 17. doi: 10.3390/bdcc4030017
![]() |
[25] | Costa C, Santos MY (2016) BASIS: A big data architecture for smart cities. 2016 SAI Computing Conference (SAI), 1247-1256. |
[26] |
He X, Wang K, Huang H, et al. (2018) QoE-driven big data architecture for smart city. IEEE Commun Mag 56: 88-93. doi: 10.1109/MCOM.2018.1700231
![]() |
[27] | Lopez D, Manogaran G (2016) Big data architecture for climate change and disease dynamics. The human element of big data: issues, analytics, and performance, 301-331. |
[28] |
Dean J, Ghemawat S (2008) MapReduce: simplified data processing on large clusters. Communications of the ACM 51: 107-113. doi: 10.1145/1327452.1327492
![]() |
[29] | Peralta D, Del Río S, Ramírez-Gallego S, et al. (2015) Evolutionary feature selection for big data classification: A mapreduce approach. Math Probl Eng 2015. |
[30] |
Gao W, Zhao X, Gao Z, et al. (2019) 3D Face Reconstruction From Volumes of Videos Using a Mapreduce Framework. IEEE Access 7: 165559-165570. doi: 10.1109/ACCESS.2019.2938671
![]() |
[31] |
Mahmoud SM, Habeeb RS (2019) Analysis of Large Set of Images Using MapReduce Framework. International Journal of Modern Education and Computer Science 11: 47. doi: 10.5815/ijmecs.2019.12.05
![]() |
[32] | Apache Spark™. A unified analytics engine for large-scale data processing. |
[33] | Hazarika AV, Ram GJSR, Jain E (2017) Performance comparision of Hadoop and spark engine. 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC), 671-674. |
[34] | Zaharia M, Chowdhury M, Das T, et al. (2012) Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. 9th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 12), 15-28. |
[35] |
Luengo J, García S, Herrera F (2012) On the choice of the best imputation methods for missing values considering three groups of classification methods. Knowl Inf Syst 32: 77-108. doi: 10.1007/s10115-011-0424-2
![]() |
[36] |
Batista GE, Monard MC (2003) An analysis of four missing data treatment methods for supervised learning. Appl Artif Intell 17: 519-533. doi: 10.1080/713827181
![]() |
[37] | Wilson DL (1972) Asymptotic properties of nearest neighbor rules using edited data. IEEE T Syst Man Cy B, 408-421. |
[38] |
Sánchez JS, Barandela R, Marqués AI, et al. (2003) Analysis of new techniques to obtain quality training sets. Pattern Recogn Lett 24: 1015-1022. doi: 10.1016/S0167-8655(02)00225-8
![]() |
[39] |
Garcia S, Derrac J, Cano J, et al. (2012) Prototype selection for nearest neighbor classification: Taxonomy and empirical study. IEEE T Pattern Anal 34: 417-435. doi: 10.1109/TPAMI.2011.142
![]() |
[40] |
Triguero I, Derrac J, Garcia S, et al. (2011) A taxonomy and experimental study on prototype generation for nearest neighbor classification. IEEE T Syst Man Cy C 42: 86-100. doi: 10.1109/TSMCC.2010.2103939
![]() |
[41] |
García-Gil D, Luengo J, García S, et al. (2019) Enabling smart data: noise filtering in big data classification. Inform Sciences 479: 135-152. doi: 10.1016/j.ins.2018.12.002
![]() |
[42] |
Xue B, Zhang M, Browne WN, et al. (2015) A survey on evolutionary computation approaches to feature selection. IEEE T Evolut Comput 20: 606-626. doi: 10.1109/TEVC.2015.2504420
![]() |
[43] | Navot A, Shpigelman L, Tishby N, et al. (2005) Nearest neighbor based feature selection for regression and its application to neural activity. Advances in neural information processing systems 18: 996-1002. |
[44] | Ramírez-Gallego S, García S, Xiong N, et al. (2018) BELIEF: A distance-based redundancy-proof feature selection method for Big Data. arXiv preprint arXiv: 180405774. |
[45] |
Triguero I, Peralta D, Bacardit J, et al. (2015) MRPR: a MapReduce solution for prototype reduction in big data classification. Neurocomputing 150: 331-345. doi: 10.1016/j.neucom.2014.04.078
![]() |
[46] | García-Gil D, Ramírez-Gallego S, García S, et al. (2018) On the Use of Random Discretization and Dimensionality Reduction in Ensembles for Big Data. International Conference on Hybrid Artificial Intelligence Systems, 15-26. |
[47] | Triguero I, Galar M, Vluymans S, et al. (2015) Evolutionary undersampling for imbalanced big data classification. 2015 IEEE Congress on Evolutionary Computation (CEC), 715-722. |
[48] | Triguero I, Galar M, Merino D, et al. (2016) Evolutionary undersampling for extremely imbalanced big data classification under apache spark. 2016 IEEE Congress on Evolutionary Computation (CEC), 640-647. |
[49] |
Ramírez-Gallego S, García S, Benítez JM, et al. (2018) A distributed evolutionary multivariate discretizer for big data processing on apache spark. Swarm Evol Comput 38: 240-250. doi: 10.1016/j.swevo.2017.08.005
![]() |
[50] |
Maillo J, Triguero I, Herrera F (2015) A mapreduce-based k-nearest neighbor approach for big data classification. 2015 IEEE Trustcom/BigDataSE/ISPA 2: 167-172. doi: 10.1109/Trustcom.2015.577
![]() |
[51] |
Maillo J, Ramírez S, Triguero I, et al. (2017) kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data. Knowl-Based Syst 117: 3-15. doi: 10.1016/j.knosys.2016.06.012
![]() |
[52] |
Deng Z, Zhu X, Cheng D, et al. (2016) Efficient kNN classification algorithm for big data. Neurocomputing 195: 143-148. doi: 10.1016/j.neucom.2015.08.112
![]() |
[53] |
Gallego A-J, Calvo-Zaragoza J, Valero-Mas JJ, et al. (2018) Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation. Pattern Recogn 74: 531-543. doi: 10.1016/j.patcog.2017.09.038
![]() |
[54] |
Wang F, Wang Q, Nie F, et al. (2018) Efficient tree classifiers for large scale datasets. Neurocomputing 284: 70-79. doi: 10.1016/j.neucom.2017.12.061
![]() |
1. | O.M. Ogunmiloro, Modelling the Human Papilloma Virus Transmission in a Bisexually Active Host Community, 2020, 13, 20710216, 80, 10.14529/mmp200207 | |
2. | Mo’tassem Al-arydah, Two-Sex Logistic Model for Human Papillomavirus and Optimal Vaccine, 2021, 1793-5245, 2150011, 10.1142/S179352452150011X | |
3. | Xinwei Wang, Haijun Peng, Boyang Shi, Dianheng Jiang, Sheng Zhang, Biaosong Chen, Optimal vaccination strategy of a constrained time-varying SEIR epidemic model, 2019, 67, 10075704, 37, 10.1016/j.cnsns.2018.07.003 | |
4. | Shasha Gao, Maia Martcheva, Hongyu Miao, Libin Rong, The impact of vaccination on human papillomavirus infection with disassortative geographical mixing: a two-patch modeling study, 2022, 84, 0303-6812, 10.1007/s00285-022-01745-z | |
5. | Mo'tassem Al‐arydah, Mathematical modeling for relation between parents' health education and vaccine uptake, 2022, 0170-4214, 10.1002/mma.8860 | |
6. | Shasha Gao, Maia Martcheva, Hongyu Miao, Libin Rong, A two-sex model of human papillomavirus infection: Vaccination strategies and a case study, 2022, 536, 00225193, 111006, 10.1016/j.jtbi.2022.111006 | |
7. | Ramziya Rifhat, Zhidong Teng, Lei Wang, Ting Zeng, Liping Zhang, Kai Wang, Dynamical behavior and density function of a stochastic model of HPV infection and cervical cancer with a case study for Xinjiang, China, 2023, 00160032, 10.1016/j.jfranklin.2023.06.008 | |
8. | Praveen Kumar Rajan, Murugesan Kuppusamy, Abdullahi Yusuf, A fractional-order modeling of human papillomavirus transmission and cervical cancer, 2023, 2363-6203, 10.1007/s40808-023-01843-x | |
9. | Najat Ziyadi, A discrete-time nutrients-phytoplankton-oysters mathematical model of a bay ecosystem*, 2023, 17, 1751-3758, 10.1080/17513758.2023.2242720 | |
10. | Tuan Anh Phan, Farhana Sarower, Jinqiao Duan, Jianjun Paul Tian, Stochastic dynamics of human papillomavirus delineates cervical cancer progression, 2023, 87, 0303-6812, 10.1007/s00285-023-02018-z | |
11. | Shayidan Abuduwaili, Lei Wang, Zhidong Teng, Abidan Ailawaer, Ramziya Rifhat, Estimating real-time reproduction number for HPV infection in Xinjiang, China, 2025, 2025, 2731-4235, 10.1186/s13662-025-03896-x | |
12. | M. Arunkumar, Praveen Kumar Rajan, K. Murugesan, Mathematical Analysis of an Optimal Control Problem for Mitigating HPV Transmission and Cervical Cancer Progression through Educational Campaigns, 2025, 2731-8095, 10.1007/s40995-025-01804-2 |
2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2014 | |
AAF population 16 years and older [12,16] | 13,825,055 | 14,041,520 | 14,259,413 | 14,473,927 | 14,707,490 | 14,952,963 | 15,224,330 | 15,486,244 | 15,743,096 | 15,992,822 | 16,176,048 | 16,471,449 | 16,696,303 | 16,918,225 | 17,139,986 |
AAF total population [12,16] | 18,787,192 | 19,013,351 | 19,229,855 | 19,434,349 | 19,653,829 | 19,882,081 | 20,123,789 | 20,374,894 | 20,626,043 | 20,868,282 | 21,045,595 | 21,320,013 | 21,543,051 | 21,767,521 | 21,988,307 |
2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2014 | |
AAM population 16 years and older [12,16] | 11,909,507 | 12,124,810 | 12,332,791 | 12,518,252 | 12,756,370 | 12,996,123 | 13,266,163 | 13,517,841 | 13,765,707 | 14,006,594 | 14,181,655 | 14,490,027 | 14,724,637 | 14,950,933 | 15,176,189 |
AAM total population [12,16] | 17,027,514 | 17,249,678 | 17,454,795 | 17,631,747 | 17,856,753 | 18,079,607 | 18,319,259 | 18,560,639 | 18,803,371 | 19,033,988 | 19,260,298 | 19,487,042 | 19,719,238 | 19,945,997 | 20,169,931 |
Parameter (per day) | Description | Reference |
Death rate for AAF population | [6] | |
Death rate for AAM population | [6] | |
Clearance rate | [11] | |
Intrinsic growth rate for AAF population | Estimated | |
Intrinsic growth rate for AAM population | Estimated | |
Carrying capacity for AAF population | Estimated | |
Carrying capacity for AAF population | Estimated | |
Infection rate for AAF population | [1] | |
Infection rate for AAM population | [1] |
= | 8,618,960 | |
= | 7,119,370 | |
= | 5,422,560 | |
= | 5,005,440 |
Parameter | Sensitivity index of | Order of Importance |
-0.9915 | 1 | |
0.5000 | 2 | |
0.5000 | 3 | |
0.1526 | 4 | |
-0.1526 | 5 | |
-0.0631 | 6 | |
0.0586 | 7 | |
-0.0561 | 8 | |
0.0520 | 9 |
= | 5,257,566 | |
= | 3,361,394 | |
= | 5,667,019 | |
= | 1,452,351 | |
= | 5,086,421 | |
= | 336,139 | |
= | 4,860,205 | |
= | 145,235 |
Parameter | Sensitivity index of | Order of Importance |
-0.9915 | 1 | |
0.5000 | 2 | |
0.5000 | 3 | |
-0.3829 | 4 | |
-0.2704 | 5 | |
0.1526 | 6 | |
-0.1526 | 7 | |
-0.1124 | 8 | |
-0.0631 | 9 | |
0.0586 | 10 | |
-0.0561 | 11 | |
0.0520 | 12 |
![]() | |||
5,257,566 | 4,309,480 | 2,585,688 | |
3,361,394 | 4,309,480 | 6,033,272 | |
5,667,019 | 3,559,685 | 2,135,811 | |
1,452,351 | 3,559,685 | 4,983,559 | |
5,086,421 | 4,991,612 | 4,819,233 | |
336,139 | 430,948 | 603,327 | |
4,860,205 | 4,649,472 | 4,507,084 | |
145,235 | 355,969 | 498,356 |
2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2014 | |
AAF population 16 years and older [12,16] | 13,825,055 | 14,041,520 | 14,259,413 | 14,473,927 | 14,707,490 | 14,952,963 | 15,224,330 | 15,486,244 | 15,743,096 | 15,992,822 | 16,176,048 | 16,471,449 | 16,696,303 | 16,918,225 | 17,139,986 |
AAF total population [12,16] | 18,787,192 | 19,013,351 | 19,229,855 | 19,434,349 | 19,653,829 | 19,882,081 | 20,123,789 | 20,374,894 | 20,626,043 | 20,868,282 | 21,045,595 | 21,320,013 | 21,543,051 | 21,767,521 | 21,988,307 |
2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2014 | |
AAM population 16 years and older [12,16] | 11,909,507 | 12,124,810 | 12,332,791 | 12,518,252 | 12,756,370 | 12,996,123 | 13,266,163 | 13,517,841 | 13,765,707 | 14,006,594 | 14,181,655 | 14,490,027 | 14,724,637 | 14,950,933 | 15,176,189 |
AAM total population [12,16] | 17,027,514 | 17,249,678 | 17,454,795 | 17,631,747 | 17,856,753 | 18,079,607 | 18,319,259 | 18,560,639 | 18,803,371 | 19,033,988 | 19,260,298 | 19,487,042 | 19,719,238 | 19,945,997 | 20,169,931 |
Parameter (per day) | Description | Reference |
Death rate for AAF population | [6] | |
Death rate for AAM population | [6] | |
Clearance rate | [11] | |
Intrinsic growth rate for AAF population | Estimated | |
Intrinsic growth rate for AAM population | Estimated | |
Carrying capacity for AAF population | Estimated | |
Carrying capacity for AAF population | Estimated | |
Infection rate for AAF population | [1] | |
Infection rate for AAM population | [1] |
= | 8,618,960 | |
= | 7,119,370 | |
= | 5,422,560 | |
= | 5,005,440 |
Parameter | Sensitivity index of | Order of Importance |
-0.9915 | 1 | |
0.5000 | 2 | |
0.5000 | 3 | |
0.1526 | 4 | |
-0.1526 | 5 | |
-0.0631 | 6 | |
0.0586 | 7 | |
-0.0561 | 8 | |
0.0520 | 9 |
= | 5,257,566 | |
= | 3,361,394 | |
= | 5,667,019 | |
= | 1,452,351 | |
= | 5,086,421 | |
= | 336,139 | |
= | 4,860,205 | |
= | 145,235 |
Parameter | Sensitivity index of | Order of Importance |
-0.9915 | 1 | |
0.5000 | 2 | |
0.5000 | 3 | |
-0.3829 | 4 | |
-0.2704 | 5 | |
0.1526 | 6 | |
-0.1526 | 7 | |
-0.1124 | 8 | |
-0.0631 | 9 | |
0.0586 | 10 | |
-0.0561 | 11 | |
0.0520 | 12 |
![]() | |||
5,257,566 | 4,309,480 | 2,585,688 | |
3,361,394 | 4,309,480 | 6,033,272 | |
5,667,019 | 3,559,685 | 2,135,811 | |
1,452,351 | 3,559,685 | 4,983,559 | |
5,086,421 | 4,991,612 | 4,819,233 | |
336,139 | 430,948 | 603,327 | |
4,860,205 | 4,649,472 | 4,507,084 | |
145,235 | 355,969 | 498,356 |