Review Topical Sections

Proposed big data architecture for facial recognition using machine learning

  • Received: 03 November 2020 Accepted: 11 January 2021 Published: 03 February 2021
  • With the abundance of raw data generated from various sources including social networks, big data has become essential in acquiring, processing, and analyzing heterogeneous data from multiple sources for real-time applications. In this paper, we propose a big data framework suitable for pre‑processing and classification of image as well as text analytics by employing two key workflows, called big data (BD) pipeline and machine learning (ML) pipeline. Our unique end-to-end workflow integrates data cleansing, data integration, data transformation and data reduction processes, followed by various analytics using suitable machine learning techniques. Further, our model is the first of its kind to augment facial recognition with sentiment analysis in a distributed big data framework. The implementation of our model uses state-of-the-art distributed technologies to ingest, prepare, process and analyze big data for generating actionable data insights by employing relevant ML algorithms such as k-NN, logistic regression and decision tree. In addition, we demonstrate the application of our big data framework to facial recognition system using open sources by developing a prototype as a use case. We also employ sentiment analysis on non-repetitive semi structured public data (text) such as user comments, image tagging, and other information associated with the facial images. We believe our work provides a novel approach to intersect Big Data, ML and Face Recognition and would create new research to alleviate some of the challenges associated with big data processing in real world applications.

    Citation: Suriya Priya R Asaithambi, Sitalakshmi Venkatraman, Ramanathan Venkatraman. Proposed big data architecture for facial recognition using machine learning[J]. AIMS Electronics and Electrical Engineering, 2021, 5(1): 68-92. doi: 10.3934/electreng.2021005

    Related Papers:

    [1] Yaw Chang, Wei Feng, Michael Freeze, Xin Lu, Charles Smith . Elimination, permanence, and exclusion in a competition model under Allee effects. AIMS Mathematics, 2023, 8(4): 7787-7805. doi: 10.3934/math.2023391
    [2] Guillaume Cantin, Cristiana J. Silva . Complex network near-synchronization for non-identical predator-prey systems. AIMS Mathematics, 2022, 7(11): 19975-19997. doi: 10.3934/math.20221093
    [3] Haiping Pan, Yiqiu Mao . Transition and bifurcation analysis for chemotactic systems with double eigenvalue crossings. AIMS Mathematics, 2023, 8(10): 24681-24698. doi: 10.3934/math.20231258
    [4] Lini Fang, N'gbo N'gbo, Yonghui Xia . Almost periodic solutions of a discrete Lotka-Volterra model via exponential dichotomy theory. AIMS Mathematics, 2022, 7(3): 3788-3801. doi: 10.3934/math.2022210
    [5] M. Adel, M. M. Khader, Hijaz Ahmad, T. A. Assiri . Approximate analytical solutions for the blood ethanol concentration system and predator-prey equations by using variational iteration method. AIMS Mathematics, 2023, 8(8): 19083-19096. doi: 10.3934/math.2023974
    [6] Yinyin Wu, Fanfan Chen, Qingchi Ma, Dingbian Qian . Subharmonic solutions for degenerate periodic systems of Lotka-Volterra type with impulsive effects. AIMS Mathematics, 2023, 8(9): 20080-20096. doi: 10.3934/math.20231023
    [7] Nazmul Sk, Bapin Mondal, Abhijit Sarkar, Shyam Sundar Santra, Dumitru Baleanu, Mohamed Altanji . Chaos emergence and dissipation in a three-species food web model with intraguild predation and cooperative hunting. AIMS Mathematics, 2024, 9(1): 1023-1045. doi: 10.3934/math.2024051
    [8] Xuechao Zhang, Yuhan Hu, Shichang Lu, Haomiao Guo, Xiaoyan Wei, Jun li . Dynamic analysis and control of online information dissemination model considering information beneficiaries. AIMS Mathematics, 2025, 10(3): 4992-5020. doi: 10.3934/math.2025229
    [9] Li Wang, Hui Zhang, Suying Liu . On the existence of almost periodic solutions of impulsive non-autonomous Lotka-Volterra predator-prey system with harvesting terms. AIMS Mathematics, 2022, 7(1): 925-938. doi: 10.3934/math.2022055
    [10] Iqra Batool, Naim Bajcinca . Stability analysis of a multiscale model including cell-cycle dynamics and populations of quiescent and proliferating cells. AIMS Mathematics, 2023, 8(5): 12342-12372. doi: 10.3934/math.2023621
  • With the abundance of raw data generated from various sources including social networks, big data has become essential in acquiring, processing, and analyzing heterogeneous data from multiple sources for real-time applications. In this paper, we propose a big data framework suitable for pre‑processing and classification of image as well as text analytics by employing two key workflows, called big data (BD) pipeline and machine learning (ML) pipeline. Our unique end-to-end workflow integrates data cleansing, data integration, data transformation and data reduction processes, followed by various analytics using suitable machine learning techniques. Further, our model is the first of its kind to augment facial recognition with sentiment analysis in a distributed big data framework. The implementation of our model uses state-of-the-art distributed technologies to ingest, prepare, process and analyze big data for generating actionable data insights by employing relevant ML algorithms such as k-NN, logistic regression and decision tree. In addition, we demonstrate the application of our big data framework to facial recognition system using open sources by developing a prototype as a use case. We also employ sentiment analysis on non-repetitive semi structured public data (text) such as user comments, image tagging, and other information associated with the facial images. We believe our work provides a novel approach to intersect Big Data, ML and Face Recognition and would create new research to alleviate some of the challenges associated with big data processing in real world applications.



    1. Introduction

    In the United States of America, human papillomavirus (HPV) is the most common sexually transmitted infection (STI) in males and females [8]. Most sexually active males and females will get at least one type of HPV infection at some point in their lives [5]. In the United States, about 79 million are currently infected with HPV and about 14 million people become newly infected each year [8]. There are more than 150 different types of HPV [7]. Health problems related to HPV include genital warts and cancer. Most people infected with genital HPV do not know they are infected and never develop symptoms or health problems from it. Some people find out they have HPV when they get genital warts. Females may find out they have HPV when they get an abnormal Pap test result during cervical cancer screening. Others may only find out once they have developed more serious problems from HPV, such as cancer [5]. Most HPV infections cause no symptoms and are not clinically significant, but persistent infection can lead to disease or cancer.

    Mathematical epidemic models have been used to study HPV infections in various populations. For example, Alsaleh and Gummel [1] in a recent paper, used a deterministic model to assess the impact of vaccination on both high-risk and low-risk HPV infection types. Ribassin-Majed and Clemencon [14] used a deterministic mathematical model to assess the impact of vaccination on non-cancer causing HPV (6/11) in French males and females. Lee and Tameru [13] used a deterministic model to assess the impact of HPV on cervical cancer in African American females (AAF). In all these studies, the HPV models have a constant recruitment function for the demographic equations.

    In this paper, we use a two-sex HPV model with "fitted" logistic demographics to study the HPV disease dynamics in AAF and African American males (AAM) of 16 years and older. Using US Census Bureau data for AAF and AAM populations, we illustrate that the "fitted" logistic demographic equation captures the African American (AA) population better than the constant recruitment demographic equation. We compute the basic reproduction number, R0 , and perform sensitivity analysis on R0. We obtain that in the AA population R0<1. In addition, we use an extension of the model with vaccination classes to assess the impact of vaccination on the AAF and AAM populations.

    The paper is organized as follows: In Section 2, we introduce a demographic equation for AAF (respectively, AAM) and we "fit" it to the US Census Bureau data of AAF (respectively, AAM) of 16 years and older. We introduce, in Section 3, a two-sex African American HPV model. In Section 4, we study disease-free equilibria and compute the basic reproduction number R0. In Section 5, we introduce the model with vaccination. In Section 6, we study disease-free equilibria and compute the basic reproduction number Rv0 for HPV model with vaccination. We summarize our results in Section 7.


    2. Demographic equations

    In [14], Ribassin-Majed et al. used a HPV model with constant recruitment rate in the demographic equation to study HPV disease dynamics in male and female populations of France. In the absence of the HPV disease, the demographic equation of their model is the following ordinary differential equation:

    dNdt=ΛμN, (1)

    where N is total population of males or females and for both males and females μ is the constant per capita mortality rate and Λ is the constant recruitment rate. In [13], Lee and Tameru used Model (1) as the demographic equation for their single sex HPV model in African Americans.

    In this paper, we use logistic models that are "fitted" to the 20002014 US Census Bureau population data of AAF and AAM of 16 years and older (see Tables 1-2) as the demographic equations for AAF and AAM in a HPV model.

    Table 1. 2000 to 2014 US Census Bureau AAF population data.
    2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014
    AAF population 16 years and older [12,16] 13,825,055 14,041,520 14,259,413 14,473,927 14,707,490 14,952,963 15,224,330 15,486,244 15,743,096 15,992,822 16,176,048 16,471,449 16,696,303 16,918,225 17,139,986
    AAF total population [12,16] 18,787,192 19,013,351 19,229,855 19,434,349 19,653,829 19,882,081 20,123,789 20,374,894 20,626,043 20,868,282 21,045,595 21,320,013 21,543,051 21,767,521 21,988,307
     | Show Table
    DownLoad: CSV
    Table 2. 2000 to 2014 US Census Bureau AAM population data.
    2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014
    AAM population 16 years and older [12,16] 11,909,507 12,124,810 12,332,791 12,518,252 12,756,370 12,996,123 13,266,163 13,517,841 13,765,707 14,006,594 14,181,655 14,490,027 14,724,637 14,950,933 15,176,189
    AAM total population [12,16] 17,027,514 17,249,678 17,454,795 17,631,747 17,856,753 18,079,607 18,319,259 18,560,639 18,803,371 19,033,988 19,260,298 19,487,042 19,719,238 19,945,997 20,169,931
     | Show Table
    DownLoad: CSV

    From Tables 1 and 2, we note that both male and female populations of African Americans of 16 years and older as well as the total populations exhibit increasing trends from 2000 to 2014. In the next section, we use a logistic differential equation model to "capture" the AA population data of Tables 1 and 2.


    2.1. Demographic equation for AAF and AAM

    In [3], Brauer and Castillo-Chavez used the logistic equation "fitted" to United States Census Bureau data to model the total United States population. We use the same approach to "fit" the solution of the following logistic equation to the AAF population of 16 years and older (see Table 1) and the AAM population of 16 years and older (see Table 2).

    dNidt=(riμi)Ni(1NiKi(riμi)/ri),t0, (2)

    where index i=f refers to the female and index i=m refers to the male. Ni(t) is the total population of AAF of 16 years and older at time t if i=f, respectively, the total population of AAM of 16 years and older at time t if i=m. The constant μi is the death rate of population, ri is the intrinsic growth rate of population and Ki is the carrying capacity.

    N0i is the size of the population at time t=0, where year 2001 in Tables 1 and 2 is t=0. The solutions of (2) are

    Ni(t)=0andNi(t)=Ki(riμi)/ri1+(Ki(riμi)/riN0i1)e(riμi)t. (3)

    Let

    Rdi=riμi.

    Rdi is the demographic threshold of the population. When the constant intrinsic growth rate, ri, is bigger than the per capita mortality rate of population, μi, then Rdi>1. However, Rdi<1 when ri is less than μi. Consequently, for every N0i>0, when Rdi>1 then the nontrivial solution Ni(t) approaches the "death" adjusted carrying capacity,

    Ki=Ki(riμi)ri,ast,

    and the population persists. However, when Rdi<1 then Ni(t)0 as t and the population goes extinct.


    2.2. AAF and AAM "fitted" logistic demographic equations

    Equation (2) gives the per capita growth rate, dNi/dtNi, for the total AAF population of 16 years and older when i=f and the total AAM population of 16 years and older when i=m. That is, if Ni(t)0, then

    dNi/dtNi=(riμi)(1NiKi(riμi)/ri). (4)

    Using the 20022014 US Census Bureau data of Table 1 (respectively, Table 2), we estimate the values of dNf/dt (respectively, dNm/dt) by symmetric differences. We then graph the per capita growth rate, Y=dNf/dtNf (respectively, Y=dNm/dtNm) versus the AAF (respectively, AAM) population. Then the line with equation Y=0.0212989784.02117×1010Nf (respectively, Y=0.0198779261.9980893×1010Nm) "fits" the resulting curve. Therefore, the AAF (respectively, AAM) population of 16 years and older exhibits a logistic behavior (see Figures 1 and 2, respectively).

    Figure 1. The logistic demographic model "fits" 20002014 US Census Bureau data of AAF population of 16 years and older, while constant recruitment Model (1) over estimates the AAF population.
    Figure 2. The logistic demographic model "fits" 20002014 US Census Bureau data of AAM population of 16 years and older, while constant recruitment Model (1) over estimates the AAM population.

    Fitting the line to the curve gives rfμf=0.021298978>0 and rf/Kf=4.02117×1010 (respectively, rmμm=0.019877926>0 and rm/Km=1.9980893×1010). So, we estimate the intrinsic growth rate of AAF (respectively, AAM) to be rf=0.028564978 (respectively, rm=0.028104926) and the carrying capacity of AAF (respectively, AAM) is Kf=71,036,484 (respectively, Km=140,659,009). From our estimates of rf and μf we note that Rdf>1 and Kf=52,967,117 (rounded). Similarly, from our estimates of rm and μm we note that Rdm>1 and Km=99,484,673 (rounded).

    Using our estimates, we express the nontrivial solution (3) of the logistic growth model for AAF of 16 years and older as

    Nf(t)=52,967,1171+2.772176873e0.021298978t, (5)

    and for AAM of 16 years and older as

    Nm(t)=99,484,6731+7.205050059e0.019877926t, (6)

    where year 2001 is taken as t=0.

    The plot of the data of AAF of 16 years or older and solution (5) in Figure 1 show that our "fitted" model captures the AAF data of Table 1. Similarly, the plot of the data of AAM of 16 years or older and solution (6) in Figure 2 show that our "fitted" model captures the AAM data of Table 2.


    2.3. AAF and AAM "fitted" logistic equation versus constant recruitment model

    When the population in equation (1) consists only of AAF (i=f) or of AAM (i=f) of 16 years and older, then the solution is

    Ni(t)=Λiμi+(N0iΛiμi)eμit. (7)

    Using the initial condition N0f=14,041,520 [12] (respectively, N0m=12,124,810 [12]), the parameters Λf=16,821,072 (52% of AA population are female [2,13]) and μf=0.007266 [6] (respectively, Λm=15,527,714 (48% of AA population are male [2,13]) and μm=0.008227 [6]), solution (7) becomes

    Nf(t)=2,315,038,8112,300,997,291e0.007266t, (8)

    and

    Nm(t)=1,887,409,0191,875,284,209e0.008227t. (9)

    In Figure 1, we compare the "fitted" solution (8) of Model (1), and our "fitted" solution (5) of Model (2), to the US Census Bureau data in Table 1. Figure 1 shows that Model (2), the "fitted" logistic model, captures better the 20002014 US Census Bureau data of the AAF population of 16 years and older than Model (1), the constant recruitment demographic equation used in the France study [14] and in the study of Lee and Tameru [13].

    Similarly, Figure 2 shows that, as in the female population, the "fitted" solution (9) of Model (1) over estimates the AAM US Census Bureau data while our "fitted" solution (5) of Model (2) captures it.


    3. A two-sex African American HPV model

    To study the HPV dynamics in male and female African American populations of 16 years and older, we assume that the total AAF population (respectively, total AAM population) of 16 years and older is governed by Model (2) with i=f (respectively, Model (2) with i=m). As in [14], we divide the population into four compartments: Sf= susceptible AAF population, Sm= susceptible AAM population, If= HPV infected AAF population, Im= HPV infected AAM population. It is known that females and males recover from HPV at about the same rate [11]. We now introduce the following mathematical model that uses our "fitted" logistic demographics for the total population.

    {dSfdt=rfNf(1NfKf)σfSfImNf+Nm+δIfμfSf,dSmdt=rmNm(1NmKm)σmSmIfNf+Nm+δImμmSm,dIfdt=σfSfImNf+Nm(δ+μf)If,dImdt=σmSmIfNf+Nm(δ+μm)Im, (10)

    where Nf=Sf+If>0, Nm=Sm+Im>0 and the model parameters with their values are listed in Table 3.

    Table 3. Table of model parameters.
    Parameter (per day)DescriptionReference
    μf=0.007266Death rate for AAF population[6]
    μm=0.008227Death rate for AAM population[6]
    δ=0.9Clearance rate[11]
    rf=0.028564978Intrinsic growth rate for AAF populationEstimated
    rm=0.028104926Intrinsic growth rate for AAM populationEstimated
    Kf=71,036,484Carrying capacity for AAF populationEstimated
    Km=140,659,009Carrying capacity for AAF populationEstimated
    σf=0.5Infection rate for AAF population[1]
    σm=0.4Infection rate for AAM population[1]
     | Show Table
    DownLoad: CSV

    Notice that since

    Nf(t)=Sf(t)+If(t)andNm(t)=Sm(t)+Im(t),

    adding the Sfequation to the Ifequation (respectively, adding the Smequation to the Imequation) gives the AAF demographic equation (2) with i=f (respectively, AAM demographic equation (2) with i=m).

    Consequently, in Model (10), the total AAF and AAM populations, governed by our "fitted" logistic equations (5) and (6) are bounded. We will study Model (10) with the parameter values listed in Table 3 and with the initial conditions listed in Table 4.

    Table 4. Initial conditions for HPV Model (10).
    Sf(0)=8,618,960
    Sm(0)=7,119,370
    If(0)=5,422,560
    Im(0)=5,005,440
     | Show Table
    DownLoad: CSV

    Notice that in Table 4, Sf(0)+If(0)=Nf(0) of Table 1 (respectively, Sm(0)+Im(0)=Nm(0) of Table 2), where t=0 is year 2001.


    3.1. Boundedness of Orbits

    In this section, we show that Model (10) is well-posed. In particular, we obtain that all orbits are nonnegative and there is no population explosion in Model (10).

    Theorem 3.1. All solutions of Model (10) are nonnegative and bounded.

    Proof. Consider the following nonnegative initial conditions Sf(0)=S0f0, Sm(0)=S0m0, If(0)=I0f0 and Im(0)=I0m0.

    If I0f=0, then from the Ifequation of Model (10), we obtain that dIfdt|t=0=σfS0fI0mN0f+N0m0. Hence, If(t)0 for all t0 [15].

    Similarly, if I0m=0, then from the Imequation of Model (10), we obtain that dImdt|t=0=σmS0mI0fN0f+N0m0. Hence, Im(t)0 for all t0 [15].

    If S0f=0, then from the Sfequation of Model (10), we obtain that dSfdt|t=0=rfI0f(1I0fKf)+δI0f. I0fKf. Hence, Sf(t)0 for all t0.

    Similarly, if S0m=0, then from the Smequationn of Model (10), we obtain that dSmdt|t=0=rmI0m(1I0mKm)+δI0m. I0mKm. Hence, Sm(t)0 for all t0.

    Recall that Nf(t) is bounded, Sf(t)+If(t)=Nf(t), Sf(t)0 and If(t)0. Hence, Sf and If are bounded. Similarly, Sm and Im are bounded.

    Let X=[SfSmIfIm] and F(X)=[rfNf(1NfKf)σfSfImNf+Nm+δIfμfSfrmNm(1NmKm)σmSmIfNf+Nm+δImμmSmσfSfImNf+Nm(δ+μf)IfσmSmIfNf+Nm(δ+μm)Im].

    Then Model (10) is equivalent to

    dXdt=F(X).

    In AAF population, Rdf>1 implies that for t>0, Nf(t)>0 and limtNf(t)=Kf when Nf(0)>0. Similarly, in AAM population, Rdm>1 implies that for t>0, Nm(t)>0 and limtNm(t)=Km when Nm(0)>0. Hence, Nf(t)+Nm(t)>0 in Model (10) and F is C1. Consequently, with our parameters from Table 3 and initial conditions from Table 4, Model (10) has a unique nonnegative solution for t0.


    4. Disease-free equilibria, stability and basic reproduction number (R0)

    Unlike in [14], it is possible for Model (10) to exhibit up to four disease-free equilibrium points (DFEs), where Rdf>1 and Rdm>1.

    Since Rdf>1 and Rdm>1, then Model (10) has four DFEs, P00=(0,0,0,0), Pf0=(Kf,0,0,0), Pm0=(0,Km,0,0), and a DFE with both susceptible AAF and AAM with no HPV infected individuals, Pfm=(Kf,Km,0,0).

    Since Rdf>1 and Rdm>1, P00 is unstable.

    To determine the stability of Pf0, we compute the Jacobian matrix at Pf0.

    J|Pf0=[μfrf02μfrf+δσf0rmμm02μmrm+δ00(δ+μf)σf000(δ+μm)].

    J|Pf0 has the following four eigenvalues: λ1=μfrf, λ2=rmμm, λ3=(σ+μf) and λ4=(σ+μm). All model parameters are positive so λ3<0 and λ4<0. Hence, the stability of Pf0 is determined by the sign of μfrf and rmμm. Rdm>1 implies rm>μm>0 and λ2>0. Hence, Pf0 is unstable.

    Similarly, to determine the stability of Pm0, we compute the Jacobian matrix at Pm0.

    J|Pm0=[rfμf0rf+δ00μmrmσm2μmrm+δ00(δ+μf)000σm(δ+μm)].

    J|Pm0 has the following five eigenvalues: λ1=rfμf, λ2=rmμm, λ3=(σ+μf) and λ4=(σ+μm). Hence, the stability of Pm0 is determined by the sign of rfμf and rmμm. Rdf>1 implies rf>μf>0 and λ1>0. Hence, Pm0 is unstable.

    To determine the stability of Pfm, we recall that in this case, for N0f>0 and N0m>0, limtNf(t)=Kf and limtNm(t)=Km. Consequently, Model (10) reduces to the following "limiting" system:

    {dIfdt=σf(KfIf)ImKf+Km(δ+μf)If,dImdt=σm(KmIm)IfKf+Km(δ+μm)Im,dSfdt=rfKf(1KfKf)σfSfImKf+Km+δIfμfSf,dSmdt=rmKm(1KmKm)σmSmIfKf+Km+δImμmSm. (11)

    Using the next generation matrix method [17] we obtain the following two matrices

    F=[σf(KfIf)ImKf+Kmσm(KmIm)IfKf+Km00]
    andV=[(δ+μf)If(δ+μm)ImrfKf(1KfKf)+σfSfImKf+KmδIf+μfSfrmKm(1KmKm)+σmSmIfKf+KmδIm+μmSm].

    Then, using the Jacobian matrices of F and V evaluated at the DFE of (11), Qfm=(0,0,Kf,Km), we obtain the following matrices,

    DF(Qfm)=[F000]andDV(Qfm)=[V0WU],

    where F=[0σfKfKf+KmσmKmKf+Km0],V=[δ+μf00δ+μm],W=[δσfKfKf+KmσmKmKf+Kmδ] andU=[μf00μm].

    Hence,

    FV1=1Kf+Km[0σfKfδ+μmσmKmδ+μf0].

    By the next generation matrix method [17], the reproduction number for Model (10), R0, is the spectral radius of the next generation matrix, ρ(FV1), and

    R0=ρ(FV1)=R0fR0m,where 
    R0f=σfKf(δ+μf)(Kf+Km)andR0m=σmKm(δ+μm)(Kf+Km).

    R0f (respectively, R0m) is directly proportional to the product of the infection rate of AAF population and the proportion of AAF population at the death-adjusted equilibrium point (respectively, the infection rate of AAM population and the proportion of AAM population at the death-adjusted equilibrium point) and inversely proportional to the sum of clearance rate and death rate for AAF population (respectively, the sum of clearance rate and death rate for AAM population).

    When rf>μf and rm>μm, then R0<1 implies the DFE, Pfm, is locally asymptotically stable, whereas R0>1 implies Pfm is unstable [17]. Using the parameter values in Table 3, it is easy to see that R0f=0.1915<1 and R0m=0.2874<1. Hence, R0=0.2346<1. That is, in our Model (10) the DFE Pfm, is locally asymptotically stable.

    It is interesting to note that in [13], Lee and Tameru, obtained R0=0.519798<1 with a one sex HPV model under a constant recruitment. Since their model over estimates the AA population, their R0 is bigger than 0.2346. However, both models predict that R0<1 and HPV infection cannot get started in a fully susceptible AA population [13].

    To find an effective mitigation strategy that seeks to reduce HPV infection in AA population within the shortest time possible, in the next section, we use sensitivity analysis to study the impact of each model parameter on R0.


    4.1. Sensitivity analysis of R0

    Sensitivity indices are used to measure the relative change in a state variable when a parameter changes. Typically, the normalized forward sensitivity index of a variable to a parameter is defined as the ratio of the relative change in the variable to the relative change in the parameter. When the variable is a differential function of the parameter, the sensitive index may be alternatively defined using partial derivatives [4,10,18,19].

    Definition 4.1([4,10,18,19]). The normalized forward sensitivity index of a variable, u, that depends differentiably on a parameter, q, is defined as:

    Υuq:=uq×qu.

    We use Definition 4.1 to derive the sensitivity indices of the basic reproduction number R0 and we evaluate them using parameter values of Table 3.

    Increasing (respectively, decreasing) the clearance rate, δ, by 1% will decrease (respectively, increase) the value of R0 by about 0.99%. Increasing (respectively, decreasing) the infection rate of the AAF population, σf, by 1% will increase (respectively, decrease) the value of R0 by about 0.5%. Increasing (respectively, decreasing) the infection rate of the AAM population, σm, by 1% will increase (respectively, decrease) the value of R0 by about 0.5%.


    4.2. African American male and female HPV model simulations

    To illustrate the impact of HPV on AAF and AAM populations of 16 years and older, we simulate Model (10) with the parameter values listed in Table 3 and the initial conditions listed in Table 4.

    Table 5. Normalized sensitivity indices and order of importance of R0 to the nine parameters in Table (3).
    ParameterSensitivity index of R0Order of Importance
    δ-0.99151
    σf0.50002
    σm0.50003
    Kf0.15264
    Km-0.15265
    rm-0.06316
    μm0.05867
    μf-0.05618
    rf0.05209
     | Show Table
    DownLoad: CSV
    Figure 3. Normalized sensitivity indices of R0 are evaluated at values of the parameters of Table (3). The most sensitive parameters for R0 are the clearance rate, δ, the infection rate of the AAF population, σf, and the infection rate of the AAM population, σm. While the least sensitive parameters are the intrinsic growth rate for AAF population, rf, the intrinsic growth rate for AAM population, rm, the death rate of AAF population, μf, and the death rate for AAM population, μm.

    Simulations of our HPV Model (10) are performed using Matlab software, and are illustrated in Figure 4. Figure 4 (a) shows that susceptible population of AAF of 16 years and older, Sf, increases over time. Similarly, Figure 4 (b) shows that susceptible population of AAM of 16 years and older, Sm, increases over time. In the total AA population, R0<1 and as expected, Figure 4 (c) shows that the AAF HPV infected population of 16 years and older, If, decreases monotonically with time and Figure 4 (d) shows that the AAM HPV infected population of 16 years and older, Im, decreases monotonically with time.

    Figure 4. African American male and female HPV model simulations.

    To protect against HPV infections, HPV vaccines are available for males and females. Gardasil and Cervarix are two HPV vaccines that have market approval in many countries. Next, we introduce an extension of Model (10) with vaccinated male and female classes. We will use the extended model to study the impact of vaccination on Figure 4.


    5. A two-sex African American HPV model with vaccination

    To introduce HPV vaccination in AAF and AAM populations of Model (10), we let pf (respectively, pm) denote the proportion of HPV vaccinated females (respectively, males). For both males and females, we let τ denote the success rate of the vaccine. In AA population, τ=90% [14].

    As in [14], we divide the AA population into eight compartments. Sf= non vaccinated susceptible AAF population, Svf= vaccinated susceptible AAF population, Sm= non vaccinated susceptible AAM population, Svm= vaccinated susceptible AAM population, If= non vaccinated HPV infected AAF population, Ivf= vaccinated HPV infected AAF population, Im= non vaccinated HPV infected AAM population, Ivm= vaccinated HPV infected AAM population. Then Model (10) with vaccination in both AAF and AAM of 16 years and older becomes the following model.

    {dSfdt=rf(1pf)Nf(1NfKf)σfSfIm+IvmNf+Nm+δIfμfSf,dSvfdt=pfrfNf(1NfKf)(1τ)σfSvfIm+IvmNf+Nm+δIvfμfSvf,dSmdt=rm(1pm)Nm(1NmKm)σmSmIf+IvfNf+Nm+δImμmSm,dSvmdt=pmrmNm(1NmKm)(1τ)σmSvmIf+IvfNf+Nm+δIvmμmSvm,dIfdt=σfSfIm+IvmNf+Nm(δ+μf)If,dIvfdt=(1τ)σfSvfIm+IvmNf+Nm(δ+μf)Ivf,dImdt=σmSmIf+IvfNf+Nm(δ+μm)Im,dIvmdt=(1τ)σmSvmIf+IvfNf+Nm(δ+μm)Ivm, (12)

    where Nf=Sf+Svf+If+Ivf>0 and Nm=Sm+Svm+Im+Ivm>0 and the model parameters and their values are listed in Table 3. We will study Model (12) with parameter values listed in Table 3 and with the initial conditions listed in Table 6, where pf=39%, pm=20.4% and τ=90%.

    Table 6. Initial conditions for HPV Model (12).
    Sf(0)=5,257,566
    Svf(0)=3,361,394
    Sm(0)=5,667,019
    Svm(0)=1,452,351
    If(0)=5,086,421
    Ivf(0)=336,139
    Im(0)=4,860,205
    Ivm(0)=145,235
     | Show Table
    DownLoad: CSV

    Note that in Table 6, Sf(0)+Svf(0)+If(0)+Ivf(0)=Nf(0) of Table 1 (respectively, Sm(0)+Svm(0)+Im(0)+Ivm(0)=Nm(0) of Table 2) where t=0 is year 2001.

    Proceeding exactly as in Theorem 3.1, we obtain the following result.

    Theorem 5.1. All solutions of Model (12) are nonnegative and bounded.


    6. Disease-free equilibrium, stability and basic reproduction number Rv0

    Notice that when all vaccinated classes are missing (Svf=Svm=Ivf=Ivm=0) and pf=pm=τ=0, then Model (12) reduces to Model (10).

    From Model (12), the demographic equations for the female and male total populations are respectively the following:

    {dNfdt=rfNf(1NfKf)μfNf,dNmdt=rmNm1NmKm)μmNm. (13)

    The equilibrium points of Model (13) are (Nf,Nm)=(0,0), (0,Km), (Kf,0) and (Kf,Km). Since Rdf>1 and Rdm>1, (Nf,Nm)=(0,0), (0,Km) and (Kf,0) are unstable and (Kf,Km) is asymptotically stable.

    As in Model (10), to state the "limiting" system of Model (12), we replace Nf by Kf and Nm by Km. Since Nf=Sf+Svf+If+Ivf=Kf and Nm=Sm+Svm+Im+Ivm=Km, the "limiting" system for Model (12) is the following system of equations.

    {dSfdt=rf(1pf)Kf(1KfKf)σfSfIm+IvmKf+Km+δIfμfSf,dSmdt=rm(1pm)Km(1KmKm)σmSmIf+IvfKf+Km+δImμmSm,dIfdt=σfSfIm+IvmKf+Km(δ+μf)If,dIvfdt=(1τ)σf(KfSfIfIvf)Im+IvmKf+Km(δ+μf)Ivf,dImdt=σmSmIf+IvfKf+Km(δ+μm)Im,dIvmdt=(1τ)σm(KmSmImIvm)If+IvfKf+Km(δ+μm)Ivm, (14)

    DFE of System (14) is (Sf,Sm,If,Ivf,Im,Ivm)=((1pf)Kf,(1pm)Km,0,0,0,0). Using the next generation matrix method [17] we obtain the following two matrices

    F=[σfSfIm+IvmKf+Km(1τ)σf(KfSfIfIvf)Im+IvmKf+KmσmSmIf+IvfKf+Km(1τ)σm(KmSmImIvm)If+IvfKf+Km00]
    andV=[(δ+μf)If(δ+μf)Ivf(δ+μm)Im(δ+μm)Ivmrf(1pf)Kf(1KfKf)+σfSfIm+IvmKf+KmδIf+μfSfrm(1pm)Km(1KmKm)+σmSmIf+IvfKf+KmδIm+μmSm].

    Let Q=(If,Ivf,Im,Ivm,Sf,Sm)=(0,0,0,0,(1pf)Kf,(1pm)Km). Evaluating the Jacobian matrices of F and V at Q, we obtain the following matrices

    DF(Q)=[F000]andDV(Q)=[V0WU],

    where

    F=[00σf(1pf)KfKf+Kmσf(1pf)KfKf+Km00(1τ)σfpfKfKf+Km(1τ)σfpfKfKf+Kmσm(1pm)KmKf+Kmσm(1pm)KmKf+Km00(1τ)σmpmKmKf+Km(1τ)σmpmKmKf+Km00],
    V=[δ+μf0000δ+μf0000δ+μm0000δ+μm],U=[μf00μm]
    andW=[δ0σf(1pf)KfKf+Kmσf(1pf)KfKf+Kmσm(1pm)KmKf+Kmσm(1pm)KmKf+Kmδ0].

    Hence,

    FV1=[00σf(1pf)Kf(δ+μm)(Kf+Km)σf(1pf)Kf(δ+μm)(Kf+Km)00(1τ)σfpfKf(δ+μm)(Kf+Km)(1τ)σfpfKf(δ+μm)(Kf+Km)σm(1pm)Km(δ+μf)(Kf+Km)σm(1pm)Km(δ+μf)(Kf+Km)00(1τ)σmpmKm(δ+μf)(Kf+Km)(1τ)σmpmKm(δ+μf)(Kf+Km)00].

    By the next generation matrix method [17], the reproduction number for Model (12), Rv0, is the spectral radius of the next generation matrix, ρ(FV1), and

    Rv0=ρ(FV1)=Rv0fRv0m,where 
    Rv0f=(1τpf)σfKf(δ+μf)(Kf+Km)andRv0m=(1τpm)σmKm(δ+μm)(Kf+Km).

    Rv0<1 implies DFE ((1pf)Kf,pfKf,(1pm)Km,pmKm,0,0,0,0), is locally asymptotically stable, whereas Rv0>1 implies DFE is unstable [17].

    Rv0f=(1τpf)R0fandRv0m=(1τpm)R0m.

    Since 0<1τpf<1 and 0<1τpm<1, Rv0<R0<1.

    Thus, adopting a HPV vaccination program decreases the basic reproduction number, R0, in AA population.

    Using the parameter values of Table 3, τ=90% [14], pf=39% [9] and pm=20.4% [9], we obtain that Rv0f=0.1243 and Rv0m=0.2346. Hence, Rv0=0.1708.

    In the next section, we use sensitivity analysis to illustrate the impact of model parameters on Rv0.


    6.1. Sensitivity analysis of Rv0

    We use Definition 4.1 to derive the sensitivity indices of the basic reproduction number Rv0 and we evaluate them using, τ=90%, pf=39%, pm=20.4% and parameter values of Table (3).

    From Table 7 and Figure 5, increasing (respectively, decreasing) the clearance rate, δ, by 1% will decrease (respectively, increase) the value of Rv0 by about 0.99%. Increasing (respectively, decreasing) the infection rate of the AAF population, σf, by 1% will increase (respectively, decrease) the value of Rv0 by about 0.5%. Increasing (respectively, decreasing) the infection rate of the AAM population, σm, by 1% will increase (respectively, decrease) the value of Rv0 by about 0.5%. Increasing (respectively, decreasing) the success rate of vaccination, τ, by 1% will decrease (respectively, increase) the value of Rv0 by about 0.38%. Increasing (respectively, decreasing) the proportion of HPV vaccinated females, pf, by 1% will decrease (respectively, increase) the value of Rv0 by about 0.27%.

    Table 7. Normalized sensitivity indices and order of importance of Rv0 to model parameters.
    ParameterSensitivity index of Rv0Order of Importance
    δ-0.99151
    σf0.50002
    σm0.50003
    τ-0.38294
    pf-0.27045
    Kf0.15266
    Km-0.15267
    pm-0.11248
    rm-0.06319
    μm0.058610
    μf-0.056111
    rf0.052012
     | Show Table
    DownLoad: CSV
    Figure 5. Normalized sensitivity indices of Rv0 are evaluated at values of model parameters. The most sensitive parameters for Rv0 are the clearance rate, δ, the infection rate of the AAF population, σf, and the infection rate of the AAM population, σm, the success rate of HPV vaccine, τ, and the proportion of HPV vaccinated females, pf. While the least sensitive parameters are the intrinsic growth rate for AAF population, rf, the intrinsic growth rate for AAM population, rm, the death rate of AAF population, μf, and the death rate for AAM population, μm.

    6.2. African American male and female HPV model with vaccination simulations

    To illustrate the impact of HPV on AAF and AAM populations of 16 years and older when a vaccination program is applied with τ=90%, pf=39% and pm=20.4%, we simulate Model (12) with the parameter values listed in Table 3 and the initial conditions given in Table 6.

    Simulations of our HPV Model (12) are performed using Matlab software, and are illustrated in Figure 6. Figures 6 (a-b) show that susceptible population of AAF of 16 years and older, Sf (respectively, vaccinated AAF of 16 years and older Svf), increases over time. Similarly, Figures 6 (c-d) show that susceptible population of AAM of 16 years and older, Sm (respectively, vaccinated AAM of 16 years and older Svm), increases over time. In the total population of African American population, Rv0<1 and as expected, Figures 6 (e-f) show that the AAF HPV infected population of 16 years and older, If (respectively, vaccinated AAF of 16 years and older Ivf), decreases monotonically with time and Figures 6 (g-h) show that the AAM HPV infected population of 16 years and older, Im (respectively, vaccinated AAM of 16 years and older Ivm), decreases monotonically with time.

    Figure 6. African American male and female HPV model with vaccination simulations.

    To study the impact of the presence of the vaccinated class on the results of Figure 4, we simulate Model (12) using initial conditions in Table 8. For these simulations of Model (12), we keep all the parameter values at their current values in Figure 4, where τ=90%, while we vary pf and pm.

    Table 8. Initial conditions for HPV model.
    pf=39%
    pm=20.4%
    pf=50%
    pm=50%
    pf=70%
    pm=70%
    Sf(0)5,257,5664,309,4802,585,688
    Svf(0)3,361,3944,309,4806,033,272
    Sm(0)5,667,0193,559,6852,135,811
    Svm(0)1,452,3513,559,6854,983,559
    If(0)5,086,4214,991,6124,819,233
    Ivf(0)336,139430,948603,327
    Im(0)4,860,2054,649,4724,507,084
    Ivm(0)145,235355,969498,356
     | Show Table
    DownLoad: CSV

    Note that in Figure 6 and Table 6, Sf(0)+Svf(0) (respectively, If(0)+Ivf(0)) is the same as Sf(0) (respectivrly, If(0)) of Table 4 and Figure 4. Similarly, in Figure 6 and Table 6, Sm(0)+Svm(0) (respectively, Im(0)+Ivm(0)) is the same as Sm(0) (respectivrly, Im(0)) of Table 4 and Figure 4.


    6.3. Impact of HPV vaccination

    In AA population, Rv0<R0<1. Consequently, with and without vaccination, the susceptible population increases while the infective population decreases over time. In Figures 7 and 8, we illustrate that in both AAF and AAM populations, the increase in the susceptible populations is higher when a vaccination policy is adopted than when the population is not being vaccinated.

    Figure 7. Susceptible AAF population.
    Figure 8. Susceptible AAM population.

    Furthermore, in both AAF and AAM populations, Figures 9 and 10 show that the number of infected populations is lower when the population is under a vaccination policy than when the population is not being vaccinated. Thus, HPV vaccines that provide partial immunity to both AAF and AAM populations of 16 years and older not only lower the number of HPV infectives but increase the number of susceptibles in both female and male populations.

    Figure 9. HPV Infected AAF population.
    Figure 10. HPV Infected AAM population.

    Furthermore, we obtained in Figures 7-10 that the increase (respectively, decrease) in the susceptible (respectively, HPV infective) populations is larger when a bigger proportion of the population is vaccinated.


    7. Conclusion

    We use a two-sex HPV model with "fitted" logistic demographics to study HPV disease dynamics in AAF and AAM populations of 16 years and older. In agreement with Lee and Tameru [13], we obtained that in AA population, R0<1 and HPV cannot get started in a fully susceptible AA population.

    Using sensitivity analysis on R0, we obtained the following results:

    ● Increasing (respectively, decreasing) the clearance rate, δ, by 1% will decrease (respectively, increase) the value of R0 by about 0.99%.

    ● Increasing (respectively, decreasing) the infection rate of the AAF population, σf, by 1% will increase (respectively, decrease) the value of R0 by about 0.5%.

    ● Increasing (respectively, decreasing) the infection rate of the AAM population, σm, by 1% will increase (respectively, decrease) the value of R0 by about 0.5%.

    In the second part of the paper, we extended our model to include vaccination classes in both male and female AA populations of 16 years and older. We obtained that in AA population when the vaccination program is implemented, Rv0<1. Using sensitivity analysis on Rv0, we obtained the following results:

    ● Increasing (respectively, decreasing) the clearance rate, δ, by 1% will decrease (respectively, increase) the value of R0 by about 0.99%.

    ● Increasing (respectively, decreasing) the infection rate of the AAF population, σf, by 1% will increase (respectively, decrease) the value of R0 by about 0.5%.

    ● Increasing (respectively, decreasing) the infection rate of the AAM population, σm, by 1% will increase (respectively, decrease) the value of R0 by about 0.5%.

    ● Increasing (respectively, decreasing) the success rate of vaccination, τ, by 1% will decrease (respectively, increase) the value of R0 by about 0.38%.

    ● Increasing (respectively, decreasing) the proportion of HPV vaccinated females, pf, by 1% will decrease (respectively, increase) the value of R0 by about 0.27%.

    Furthermore, using the extended model with vaccination we obtained the following results:

    ● Adopting a vaccination policy lowers HPV infections in both AAF and AAM populations.

    ● Vaccinating a larger proportion of AAF and AAM populations leads to fewer cases of HPV infections in the vaccinated population.


    Acknowledgments

    This research was partially supported by National Science Foundation under grant DUE-1439758.




    [1] Chen M, Mao S, Liu Y (2014) Big data: A survey. Mobile networks and applications 19: 171-209. doi: 10.1007/s11036-013-0489-0
    [2] McAfee A, Brynjolfsson E, Davenport TH, et al. (2012) Big data: the management revolution. Harvard business review 90: 60-68.
    [3] Venkatraman R, Venkatraman S (2019) Big Data Infrastructure, Data Visualisation and Challenges. Proceedings of the 3rd International Conference on Big Data and Internet of Things, 13-17.
    [4] Labrinidis A, Jagadish HV (2012) Challenges and opportunities with big data. Proceedings of the VLDB Endowment 5: 2032-2033. doi: 10.14778/2367502.2367572
    [5] Venkatraman S, Venkatraman R (2019) Big data security challenges and strategies. AIMS MATHEMATICS 4: 860-879. doi: 10.3934/math.2019.3.860
    [6] Masi I, Wu Y, Hassner T, et al. (2018) Deep face recognition: A survey. 2018 31st SIBGRAPI conference on graphics, patterns and images (SIBGRAPI), 471-478.
    [7] Singh A, Bhadani R (2020) Mobile Deep Learning with TensorFlow Lite, ML Kit and Flutter. Packt Publishing.
    [8] Zhu Y, Jiang Y (2020) Optimization of face recognition algorithm based on deep learning multi feature fusion driven by big data. Image Vision Comput 104: 104023. doi: 10.1016/j.imavis.2020.104023
    [9] Reddy KS, Krishna VV, Kumar VV (2016) A Method for Facial Recognition Based On Local Features. International Journal of Mathematics and Computation 27: 98-109.
    [10] Qateef JS, Kazm AA (2016) Facial expression recognition via mapreduce assisted k-nearest neighbor algorithm. International Journal of Computer Science and Information Security 14: 170.
    [11] Sirovich L, Kirby M (1987) Low-dimensional procedure for the characterization of human faces. Josa a 4: 519-524. doi: 10.1364/JOSAA.4.000519
    [12] Turk MA, Pentland AP (1991) Face recognition using eigenfaces. Proceedings. 1991 IEEE computer society conference on computer vision and pattern recognition, 586-587. IEEE Computer Society.
    [13] Bruce V, Young A (1986) Understanding face recognition. British journal of psychology 77: 305-327. doi: 10.1111/j.2044-8295.1986.tb02199.x
    [14] Parkhi OM, Vedaldi A, Zisserman A (2015) Deep face recognition.
    [15] He X, Yan S, Hu Y, et al. (2005) Face recognition using Laplacianfaces. IEEE T Pattern Anal 27: 328-340. doi: 10.1109/TPAMI.2005.55
    [16] Deng J, Guo J, Xue N, et al. (2019) Arcface: Additive angular margin loss for deep face recognition. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4690-4699.
    [17] Zhou E, Cao Z, Yin Q (2015) Naive-deep face recognition: Touching the limit of LFW benchmark or not? arXiv preprint arXiv: 150104690.
    [18] Wang H, Wang Y, Zhou Z, et al. (2018) Cosface: Large margin cosine loss for deep face recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, 5265-5274.
    [19] Wen Y, Zhang K, Li Z, et al. (2016) A discriminative feature learning approach for deep face recognition. European conference on computer vision, 499-515.
    [20] Deng J, Zhou Y, Zafeiriou S (2017) Marginal loss for deep face recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 60-68.
    [21] Ding H, Zhou SK, Chellappa R (2017) Facenet2expnet: Regularizing a deep face recognition net for expression recognition. 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), 118-126.
    [22] Wang F, Chen L, Li C, et al. (2018) The devil of face recognition is in the noise. Proceedings of the European Conference on Computer Vision (ECCV), 765-780.
    [23] Benhlima L (2018) Big data management for healthcare systems: architecture, requirements, and implementation. Advances in bioinformatics 2018.
    [24] Asaithambi SPR, Venkatraman R, Venkatraman S (2020) MOBDA: Microservice-Oriented Big Data Architecture for Smart City Transport Systems. Big Data and Cognitive Computing 4: 17. doi: 10.3390/bdcc4030017
    [25] Costa C, Santos MY (2016) BASIS: A big data architecture for smart cities. 2016 SAI Computing Conference (SAI), 1247-1256.
    [26] He X, Wang K, Huang H, et al. (2018) QoE-driven big data architecture for smart city. IEEE Commun Mag 56: 88-93. doi: 10.1109/MCOM.2018.1700231
    [27] Lopez D, Manogaran G (2016) Big data architecture for climate change and disease dynamics. The human element of big data: issues, analytics, and performance, 301-331.
    [28] Dean J, Ghemawat S (2008) MapReduce: simplified data processing on large clusters. Communications of the ACM 51: 107-113. doi: 10.1145/1327452.1327492
    [29] Peralta D, Del Río S, Ramírez-Gallego S, et al. (2015) Evolutionary feature selection for big data classification: A mapreduce approach. Math Probl Eng 2015.
    [30] Gao W, Zhao X, Gao Z, et al. (2019) 3D Face Reconstruction From Volumes of Videos Using a Mapreduce Framework. IEEE Access 7: 165559-165570. doi: 10.1109/ACCESS.2019.2938671
    [31] Mahmoud SM, Habeeb RS (2019) Analysis of Large Set of Images Using MapReduce Framework. International Journal of Modern Education and Computer Science 11: 47. doi: 10.5815/ijmecs.2019.12.05
    [32] Apache Spark™. A unified analytics engine for large-scale data processing.
    [33] Hazarika AV, Ram GJSR, Jain E (2017) Performance comparision of Hadoop and spark engine. 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC), 671-674.
    [34] Zaharia M, Chowdhury M, Das T, et al. (2012) Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. 9th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 12), 15-28.
    [35] Luengo J, García S, Herrera F (2012) On the choice of the best imputation methods for missing values considering three groups of classification methods. Knowl Inf Syst 32: 77-108. doi: 10.1007/s10115-011-0424-2
    [36] Batista GE, Monard MC (2003) An analysis of four missing data treatment methods for supervised learning. Appl Artif Intell 17: 519-533. doi: 10.1080/713827181
    [37] Wilson DL (1972) Asymptotic properties of nearest neighbor rules using edited data. IEEE T Syst Man Cy B, 408-421.
    [38] Sánchez JS, Barandela R, Marqués AI, et al. (2003) Analysis of new techniques to obtain quality training sets. Pattern Recogn Lett 24: 1015-1022. doi: 10.1016/S0167-8655(02)00225-8
    [39] Garcia S, Derrac J, Cano J, et al. (2012) Prototype selection for nearest neighbor classification: Taxonomy and empirical study. IEEE T Pattern Anal 34: 417-435. doi: 10.1109/TPAMI.2011.142
    [40] Triguero I, Derrac J, Garcia S, et al. (2011) A taxonomy and experimental study on prototype generation for nearest neighbor classification. IEEE T Syst Man Cy C 42: 86-100. doi: 10.1109/TSMCC.2010.2103939
    [41] García-Gil D, Luengo J, García S, et al. (2019) Enabling smart data: noise filtering in big data classification. Inform Sciences 479: 135-152. doi: 10.1016/j.ins.2018.12.002
    [42] Xue B, Zhang M, Browne WN, et al. (2015) A survey on evolutionary computation approaches to feature selection. IEEE T Evolut Comput 20: 606-626. doi: 10.1109/TEVC.2015.2504420
    [43] Navot A, Shpigelman L, Tishby N, et al. (2005) Nearest neighbor based feature selection for regression and its application to neural activity. Advances in neural information processing systems 18: 996-1002.
    [44] Ramírez-Gallego S, García S, Xiong N, et al. (2018) BELIEF: A distance-based redundancy-proof feature selection method for Big Data. arXiv preprint arXiv: 180405774.
    [45] Triguero I, Peralta D, Bacardit J, et al. (2015) MRPR: a MapReduce solution for prototype reduction in big data classification. Neurocomputing 150: 331-345. doi: 10.1016/j.neucom.2014.04.078
    [46] García-Gil D, Ramírez-Gallego S, García S, et al. (2018) On the Use of Random Discretization and Dimensionality Reduction in Ensembles for Big Data. International Conference on Hybrid Artificial Intelligence Systems, 15-26.
    [47] Triguero I, Galar M, Vluymans S, et al. (2015) Evolutionary undersampling for imbalanced big data classification. 2015 IEEE Congress on Evolutionary Computation (CEC), 715-722.
    [48] Triguero I, Galar M, Merino D, et al. (2016) Evolutionary undersampling for extremely imbalanced big data classification under apache spark. 2016 IEEE Congress on Evolutionary Computation (CEC), 640-647.
    [49] Ramírez-Gallego S, García S, Benítez JM, et al. (2018) A distributed evolutionary multivariate discretizer for big data processing on apache spark. Swarm Evol Comput 38: 240-250. doi: 10.1016/j.swevo.2017.08.005
    [50] Maillo J, Triguero I, Herrera F (2015) A mapreduce-based k-nearest neighbor approach for big data classification. 2015 IEEE Trustcom/BigDataSE/ISPA 2: 167-172. doi: 10.1109/Trustcom.2015.577
    [51] Maillo J, Ramírez S, Triguero I, et al. (2017) kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data. Knowl-Based Syst 117: 3-15. doi: 10.1016/j.knosys.2016.06.012
    [52] Deng Z, Zhu X, Cheng D, et al. (2016) Efficient kNN classification algorithm for big data. Neurocomputing 195: 143-148. doi: 10.1016/j.neucom.2015.08.112
    [53] Gallego A-J, Calvo-Zaragoza J, Valero-Mas JJ, et al. (2018) Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation. Pattern Recogn 74: 531-543. doi: 10.1016/j.patcog.2017.09.038
    [54] Wang F, Wang Q, Nie F, et al. (2018) Efficient tree classifiers for large scale datasets. Neurocomputing 284: 70-79. doi: 10.1016/j.neucom.2017.12.061
  • This article has been cited by:

    1. O.M. Ogunmiloro, Modelling the Human Papilloma Virus Transmission in a Bisexually Active Host Community, 2020, 13, 20710216, 80, 10.14529/mmp200207
    2. Mo’tassem Al-arydah, Two-Sex Logistic Model for Human Papillomavirus and Optimal Vaccine, 2021, 1793-5245, 2150011, 10.1142/S179352452150011X
    3. Xinwei Wang, Haijun Peng, Boyang Shi, Dianheng Jiang, Sheng Zhang, Biaosong Chen, Optimal vaccination strategy of a constrained time-varying SEIR epidemic model, 2019, 67, 10075704, 37, 10.1016/j.cnsns.2018.07.003
    4. Shasha Gao, Maia Martcheva, Hongyu Miao, Libin Rong, The impact of vaccination on human papillomavirus infection with disassortative geographical mixing: a two-patch modeling study, 2022, 84, 0303-6812, 10.1007/s00285-022-01745-z
    5. Mo'tassem Al‐arydah, Mathematical modeling for relation between parents' health education and vaccine uptake, 2022, 0170-4214, 10.1002/mma.8860
    6. Shasha Gao, Maia Martcheva, Hongyu Miao, Libin Rong, A two-sex model of human papillomavirus infection: Vaccination strategies and a case study, 2022, 536, 00225193, 111006, 10.1016/j.jtbi.2022.111006
    7. Ramziya Rifhat, Zhidong Teng, Lei Wang, Ting Zeng, Liping Zhang, Kai Wang, Dynamical behavior and density function of a stochastic model of HPV infection and cervical cancer with a case study for Xinjiang, China, 2023, 00160032, 10.1016/j.jfranklin.2023.06.008
    8. Praveen Kumar Rajan, Murugesan Kuppusamy, Abdullahi Yusuf, A fractional-order modeling of human papillomavirus transmission and cervical cancer, 2023, 2363-6203, 10.1007/s40808-023-01843-x
    9. Najat Ziyadi, A discrete-time nutrients-phytoplankton-oysters mathematical model of a bay ecosystem*, 2023, 17, 1751-3758, 10.1080/17513758.2023.2242720
    10. Tuan Anh Phan, Farhana Sarower, Jinqiao Duan, Jianjun Paul Tian, Stochastic dynamics of human papillomavirus delineates cervical cancer progression, 2023, 87, 0303-6812, 10.1007/s00285-023-02018-z
    11. Shayidan Abuduwaili, Lei Wang, Zhidong Teng, Abidan Ailawaer, Ramziya Rifhat, Estimating real-time reproduction number for HPV infection in Xinjiang, China, 2025, 2025, 2731-4235, 10.1186/s13662-025-03896-x
    12. M. Arunkumar, Praveen Kumar Rajan, K. Murugesan, Mathematical Analysis of an Optimal Control Problem for Mitigating HPV Transmission and Cervical Cancer Progression through Educational Campaigns, 2025, 2731-8095, 10.1007/s40995-025-01804-2
  • Reader Comments
  • © 2021 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(6231) PDF downloads(402) Cited by(8)

Article outline

Figures and Tables

Figures(15)  /  Tables(2)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog