Loading [MathJax]/jax/output/SVG/jax.js
Research article

Using multiple linear regression for biochemical oxygen demand prediction in water


  • Biochemical oxygen demand (BOD) is an important water quality measurement but takes five days or more to obtain. This may result in delays in taking corrective action in water treatment. Our goal was to develop a BOD predictive model that uses other water quality measurements that are quicker than BOD to obtain; namely pH, temperature, nitrogen, conductivity, dissolved oxygen, fecal coliform, and total coliform. Principal component analysis showed that the data spread was in the direction of the BOD eigenvector. The vectors for pH, temperature, and fecal coliform contributed the greatest to data variation, and dissolved oxygen negatively correlated to BOD. K-means clustering suggested three clusters, and t-distributed stochastic neighbor embedding showed that BOD had a strong influence on variation in the data. Pearson correlation coefficients indicated that the strongest positive correlations were between BOD, and fecal and total coliform, as well as nitrogen. The largest negative correlation was between dissolved oxygen, and BOD. Multiple linear regression (MLR) using fecal, and total coliform, dissolved oxygen, and nitrogen to predict BOD, and training/test data of 80%/20% and 90%/10% had performance indices of RMSE = 2.21 mg/L, r = 0.48 and accuracy of 50.1%, and RMSE = 2.18 mg/L, r = 0.54 and an accuracy of 55.5%, respectively. BOD prediction was better than previous MLR models. Increasing the percentage of the training set above 80% improved the model accuracy but did not significantly impact its prediction. Thus, MLR can be used successfully to estimate BOD in water using other water quality measurements that are quicker to obtain.

    Citation: Isaiah Kiprono Mutai, Kristof Van Laerhoven, Nancy Wangechi Karuri, Robert Kimutai Tewo. Using multiple linear regression for biochemical oxygen demand prediction in water[J]. Applied Computing and Intelligence, 2024, 4(2): 125-137. doi: 10.3934/aci.2024008

    Related Papers:

    [1] Min Zhang, Yi Wang, Yan Li . Reducibility and quasi-periodic solutions for a two dimensional beam equation with quasi-periodic in time potential. AIMS Mathematics, 2021, 6(1): 643-674. doi: 10.3934/math.2021039
    [2] Xinyu Guan, Nan Kang . Stability for Cauchy problem of first order linear PDEs on Tm with forced frequency possessing finite uniform Diophantine exponent. AIMS Mathematics, 2024, 9(7): 17795-17826. doi: 10.3934/math.2024866
    [3] Xin Liu, Yan Wang . Averaging principle on infinite intervals for stochastic ordinary differential equations with Lévy noise. AIMS Mathematics, 2021, 6(5): 5316-5350. doi: 10.3934/math.2021314
    [4] Zayd Hajjej, Menglan Liao . General stability for a system of coupled quasi-linear and linear wave equations and with memory term. AIMS Mathematics, 2023, 8(12): 30668-30682. doi: 10.3934/math.20231567
    [5] Zhensheng Yu, Peixin Li . An active set quasi-Newton method with projection step for monotone nonlinear equations. AIMS Mathematics, 2021, 6(4): 3606-3623. doi: 10.3934/math.2021215
    [6] Muhammad Aslam Noor, Khalida Inayat Noor, Bandar B. Mohsen . Some new classes of general quasi variational inequalities. AIMS Mathematics, 2021, 6(6): 6406-6421. doi: 10.3934/math.2021376
    [7] Noor Alam, Mohammad Safi Ullah, Jalil Manafian, Khaled H. Mahmoud, A. SA. Alsubaie, Hamdy M. Ahmed, Karim K. Ahmed, Soliman Al Khatib . Bifurcation analysis, chaotic behaviors, and explicit solutions for a fractional two-mode Nizhnik-Novikov-Veselov equation in mathematical physics. AIMS Mathematics, 2025, 10(3): 4558-4578. doi: 10.3934/math.2025211
    [8] Huanhuan Zhang, Jia Mu . Periodic problem for non-instantaneous impulsive partial differential equations. AIMS Mathematics, 2022, 7(3): 3345-3359. doi: 10.3934/math.2022186
    [9] Muhammad Afzal, Tariq Ismaeel, Azhar Iqbal Kashif Butt, Zahid Farooq, Riaz Ahmad, Ilyas Khan . On the reducibility of a class of almost-periodic linear Hamiltonian systems and its application in Schrödinger equation. AIMS Mathematics, 2023, 8(3): 7471-7489. doi: 10.3934/math.2023375
    [10] Saudia Jabeen, Bandar Bin-Mohsin, Muhammad Aslam Noor, Khalida Inayat Noor . Inertial projection methods for solving general quasi-variational inequalities. AIMS Mathematics, 2021, 6(2): 1075-1086. doi: 10.3934/math.2021064
  • Biochemical oxygen demand (BOD) is an important water quality measurement but takes five days or more to obtain. This may result in delays in taking corrective action in water treatment. Our goal was to develop a BOD predictive model that uses other water quality measurements that are quicker than BOD to obtain; namely pH, temperature, nitrogen, conductivity, dissolved oxygen, fecal coliform, and total coliform. Principal component analysis showed that the data spread was in the direction of the BOD eigenvector. The vectors for pH, temperature, and fecal coliform contributed the greatest to data variation, and dissolved oxygen negatively correlated to BOD. K-means clustering suggested three clusters, and t-distributed stochastic neighbor embedding showed that BOD had a strong influence on variation in the data. Pearson correlation coefficients indicated that the strongest positive correlations were between BOD, and fecal and total coliform, as well as nitrogen. The largest negative correlation was between dissolved oxygen, and BOD. Multiple linear regression (MLR) using fecal, and total coliform, dissolved oxygen, and nitrogen to predict BOD, and training/test data of 80%/20% and 90%/10% had performance indices of RMSE = 2.21 mg/L, r = 0.48 and accuracy of 50.1%, and RMSE = 2.18 mg/L, r = 0.54 and an accuracy of 55.5%, respectively. BOD prediction was better than previous MLR models. Increasing the percentage of the training set above 80% improved the model accuracy but did not significantly impact its prediction. Thus, MLR can be used successfully to estimate BOD in water using other water quality measurements that are quicker to obtain.



    Suppose the quasi-periodic linear system

    dxdt=A(ω1t,ω2t,,ωrt)x (1.1)

    in which tR, xCr, A(ω1t,ω2t,,ωrt) is quasi-periodic(q-p) time dependent r×r matrix and the basic frequencies ω1,,ωr are rational independent.

    The system (1.1) is said to be reducible, if there exists a so called quasi-periodic Lyapunov-Perron (L-P) transformation x=P(ω1t,,ωrt)y, so that the transformed system is a linear system with constant coefficients. We call the transformation x=P(ω1t,,ωrt)y is quasi-periodic L-P transformation, if P(t) is non singular and P, P1 and ˙P are quasi-periodic and are bounded in tR.

    Many researchers have discussed the reducibility problems for quasi-periodic linear systems. For r=1, i.e., the periodic case, the well known Floquet theorem states that there always exists a periodic change of variables x=P(ω1t)y so that the system ˙x=A(ω1t)x is reducible to a constant coefficient system ˙y=By,˙φ=ω, where B is a constant matrix. For r>1, i.e., quasi-periodic case, there is an example in [1] which shows that the system (1.1) is not always reducible. Earlier for q-p case, Coppel [2] proved that a linear differential equation with bounded coefficient matrix is pseudo-autonomous if and only if it is almost reducible and Johnson and Sell [3] showed that if (1.1) satisfies the full spectrum assumption, then there is a quasi-periodic linear change of variables x=P(ω1t,,ωrt)y that transforms (1.1) to a constant coefficient system ˙y=By, where B is a constant matrix. Their results failed for the pure imaginary spectrum [4].

    The first reducibility result by KAM method was given by Dinaburg and Sinai [5] who proved that the linear Schrödinger equation d2xdt2+q(ω1t,ω2t,,ωrt)x=λx is reducible for 'most' large enough λ in measure sense, where ω is fixed satisfying the Diophantine condition: |k,ω|>α1|k|τ, 0kZr, where α,τ are positive constants. See also Rüssmann [6] for a refined result.

    In 1992 Jorba and Simó [7] considered the following linear differential system

    dxdt=(A+λˉQ+λ2Q(ω1t,,ωrt))x,xRd, (1.2)

    in which A,ˉQ are constant diagonal matrices, and Q is an analytic q-p matrix having r basic frequencies, and with a small parameter λ. Using the KAM method, They proved that there exists a positive measure Cantor subset E(0,λ0), λ01 such that for any λE, the system (1.2) is reducible, provided that the following non-degeneracy conditions

    |αi(λ)αj(λ)|>δ>0,|ddλ(αi(λ)αj(λ))|>χ>0,1i<jd (1.3)

    where αi(λ), 1im, are the eigenvalues of ˉA=A+λˉQ. In 1999, Xu [8] improved the result for the weaker non-degeneracy conditions.

    Eliasson [9] considered the following linear Shrödinger equation

    d2xdt2+(λ+Q(ωt)x=0.

    For almost all λ(a,b), the full measure reducibility result is proved in a Lebesgue measure sense provided that Q is small. On the other hand, Krikorian [10] generalized the work for linear systems with coefficients in so(3). Then, Eliasson [11] discussed the full measure reducibility result for the following parameter dependent systems

    dxdt=(A(λ)+Q(ω1t,,ωrt,λ))x, (1.4)

    in which tR, xCd, a constant matrix A of dimension d×d, the parameter λ(a,b), and an analytic mapping Q:Tr×(a,b)gl(m,C), a Diophantine vector (ω1,,ωr) and for sufficiently small |Q|.

    He and You [12] proved the positive measure reducibility result for the following quasi-periodic skew-product systems: dxdt=(A(λ)+Q(φ,λ))x,˙φ=ω, close to constant. The result is proved by using KAM method, under weaker non-resonant conditions and non-degeneracy conditions.

    All the above mentioned results only discuss the reducibility of linear systems with the Diophantine condition

    |k,ω|α1|k|τ,0kZd, (1.5)

    where α>1 and τ>d1.

    In our problem, we are going to focussed on the Brjuno-Rüssmann condition (see [13,14]) which is slightly weaker than the Diophantine condition (1.5), if the frequencies ω=(ω1,,ωd) satisfy

    |k,ω|α1Δ(|k|),0kZd, (1.6)

    where α>1 and some Rüssmann approximation function Δ. These are continuous, increasing and unbounded functions Δ:[0,+)[1,+) such that Δ(0)=1 and

    +1lnΔ(t)t2dt<.

    Remark: If we have Δ(t)=tτ, then the Brjuno-Rüssmann conditions (1.6) becomes the Diophantine conditions (1.5).

    Furthermore, in this article we will generalize the result of He and You [12] for quasi-periodic linear systems using Brjuno-Rüssmann non-resonant condition which is slightly weaker than the Diophantine condition.

    This article is organized as: at the end of Section 1, the statement of the main result is given and in Section 2 proof of the main result is given.

    To state our main result, we now give some definitions and results.

    Definition 1.1. ([15,16])

    A vector ωRd is Brjuno if the following condition is satisfied

    n=12nln(1Ωn)<,Ωn=minνZd,0<|ν|2n|ω,ν|.

    The set of Brjuno vectors is of full Lebesgue measure. In particular, it contains all Diophantine vectors. Conversely, there are vectors that are Brjuno and are not Diophantine.

    This article aims to discuss the positive measure reducibility for q-p linear systems like (1.4) proposed by He and You [12]. The existed positive measure reducibility is discussed by using the Diophantine conditions, but we will discuss the positive measure reducibility using the Brjuno-Rüssmann condition.

    Equivalently, for the system (1.4), we suppose the following skew-product system

    dxdt=(A(λ)+Q(φ,λ))x,˙φ=ω, (1.7)

    where xCr, the parameter λΛ=(a,b), A is a r×r constant matrix, and Q(φ,λ) is an analytic mapping from Tr×(a,b) to gl(m,C), (ω1,ω2,,ωn) is a Brjuno vector and |Q| is sufficiently small.

    In our discussion, we will use the following equivalent formulation of reducibility:

    Consider

    dZdt=b(t)Z (1.8)

    be an analytic q-p linear system. For the skew-product system, it can be rewritten as:

    dZdt=B(φ)Z,˙φ=ω, (1.9)

    where b,B are in the Lie algebra g=g(m,C) and their solutions have values in the Lie group G=GL(m,C). For a complex neighbourhood Wh(Tr) if B is an analytic on Wh(Tr), then we represent BCωh(Tr,g). It is said that the analytic g-valued functions B1,B2Cωh(Tr,g) are conjugated, if a L-P transformation G-valued function PCωh(Tr,G), s.t. for the solutions Z1,Z2 corresponding to B1,B2, we have the following relation

    Z2=P(φ)Z1

    and the conjugate relation can be denoted by:

    B1B2(modP).

    It is easy to prove that B1B2(modP) can equivalently be written in the form of following equality

    B2=DωPP1+PB1P1, (1.10)

    where Dω=φ˙φ denotes the derivative in the direction of frequency vector ω. B1 is known to be reducible if it conjugates to a constant B2.

    In our article, we shall prove that, for any λΛ=(a,b), where λ is the parameter and Λ is a positive measure set, then a L-P transformation P(φ), such that the system A+Q(φ) is transformed into a constant system A.

    For the positive measure reducibility, we will use the non-degeneracy conditions (or the transverse conditions as in Eliasson and Krikorian terminology). Without loss of generality, let's suppose a block-diagonal matrix A(λ)=diag(A1(λ),,As(λ)) with

    dist(σ(Ai),σ(Aj))>ϱ>0,forij,

    where σ(Ai) represents the eigenvalues set for Ai. Let (see in [12] for definition)

    Jij(k,λ)=ik,ωIlilj+(IliAj(λ)ATi(λ)Ilj),
    J(k,λ)=ik,ωIn2+(InA(λ)AT(λ)In),
    dij(k,λ)=det[ik,ωIlilj+(IliAj(λ)ATi(λ)Ilj)].

    For the skew-product system (1.7), by using Lemmas 1.1 and 1.2 in [12], we set for k,ωR

    gij(k,λ)={αuσ(Ai),βvσ(Aj)(ik,ω(αu(λ)βv(λ)),ij;αu,αvσ(Ai),uv(ik,ω(αu(λ)αv(λ)),i=j.

    Remark: It is easily seen that if ACω(Λ,g) and the division of σ(A) is sufficiently separated, then all gij are analytic functions of λ, 1i,js.

    For the proof of this remark (see in [17]).

    Thus, we assume the following:

    Non-degeneracy Conditions: There exist an integer d0 and ς0 such that

    max0ld|lλlgij(k,λ)|>ς,for all1i,js (1.11)

    uniformly hold λΛ and k,ωR.

    Remark: The condition (1.11) will assure that the small denominator condition always holds for the "most" parameter λ. Here, we take only those k in which |k,ω|2δ0, because for the large enough |k,ω|, we always see that the matrix ik,ωIlilj(IliAj(λ)ATi(λ)Ilj) is automatically non-singular and the small denominator condition is satisfied. It can easily be seen that the condition (1.11) is weaker than the non-degeneracy condition (1.3) used by Jorba and Simó.

    Moreover, the property that gij(k,λ) depends analytically on λ can be preserved under small perturbations, and at each iterative step, we will preserve the non-degeneracy conditions.

    To state the main result, consider Q as an analytic g-valued function that can be defined on a complex neighbourhood of Tr×Λ:

    Wh(Tr×Λ)={(ϑ,λ)Cr×Λ|dist(ϑ,Tr)<h},

    where λΛ=(a,b). Defined the norm of Q as:

    ||Q||h=max0ldsup(ϑ,λ)Wh(Tr×Λ)|lQλl|

    similarly

    ||A||=max0ldsupλΛ|lA(λ)λl|

    where |||| denotes the matrix norm.

    Theorem 1.1. Consider the skew-product system (1.7) in which ω is a fixed Brjuno vector and it satisfies the Brjuno-Rüssmann condition (1.6) and A(λ) satisfies the non-degeneracy condition (1.11), and there exists K>0 such that ||A||K. Then there exist ε>0,h>0, such that if ||Q(,)||h=ε1<ε, the measure of the set of parameter λs for which the system (1.7) is non-reducible is no larger than CL(10ε1)c, with some positive constants C,c, and L denotes the length of the parameter interval Λ.

    Theorem 1.1 will be proven by KAM iteration. At each iterative step, we have a L-P transformation close to identity as

    P(φ)=I+Z(φ), (2.1)

    where Z(φ)Cωh(Tr,g), P(φ)Cωh(Tr,G) and by using the L-P transformation (2.1), the quasi-periodic system dxdt=(A+Q)x is changed into

    dxdt=(DωPP1+P(A+Q)P1)x.

    Since Z is very small and in the expansion form P1 can be written as:

    P1=IZ+Z2+O(||Z||3).

    So, we have

    DωPP1+P(A+Q)P1=DωZ(IZ+Z2+O(||Z||3))+(I+Z)(A+Q)(IZ+Z2+O(||Z||3))=A+DωZ+[Z,A]+QDωZZ+[Z,Q]+AZ2ZAZ+O(||Z||3). (2.2)

    In general, we have to find a small Z in which the transformed system is still of the form dxdt=(A++Q+)x, where A+ is block-diagonal as A and Q+ is much smaller than Q.

    To do this, we have to calculate Z solving the following linearized equation

    DωZ[A,Z]=Q (2.3)

    where [A,Z]=AZZA and to prove

    Q+=DωZZ+[Z,Q]+AZ2ZAZ+O(||Z||3)

    is more smaller.

    In this subsection, we will solve the linearized equation, for this we need the following:

    Definition: Let u=(u1,,um)Tm. Its norm is denoted by ||u|| and is defined as:

    ||u||=max1im|ui|.

    Definition: For a m×m matrix S=(sij), its operator norm is denoted by ||S|| and is equivalent to m×max|sij|.

    Notation: Let FCωh(Tr×Λ,g) and its Fourier series is F=kZrFkeik,φ, then the kth Fourier coefficients of F denoted by Fk, given by Fk=Treik,φF(φ)dφ.

    Remark 2.1. For FCωh(Tr×Λ,g), we have

    |Fk||F|he|k|h.

    Note. For kZd, we denote |k|=dn=1|kn|. Similarly, for a function f, its modulus is denoted by |f|.

    Throughout the discussion, to simplify notations, the letters c,C denote different positive constants.

    By substituting the Fourier series expansions of Z,Q into the Eq (2.3), and then by equating the corresponding Fourier coefficients on both sides, we obtain

    ik,ωZk(AZkZkA)=Qk. (2.4)

    suppose that the eigenvalues of the linear operator ik,ωId+[A,] in the left part are

    ik,ω(αiαj),1i,jn,αi,αjσ(A).

    The eigenvalues will be αiαj for k=0. As the considered matrix A=diag(A1,,As) is a block-diagonal with different blocks Ai,Aj and each block have different eigenvalues, .i.e. αuβv if αuAi,βvAj for ij, from conclusions as seen from other researchers [12,17,18,19,20], we see that the matrix IliAjATiIlj is non-singular if ij.

    In block-diagonal form, let Qk can be written as (Qkij), where Qkij is a matrix of order li×lj, 1i,js and li,lj are the orders of matrices Ai,Aj respectively.

    Now, for k=0, we solve the equation (2.4). Suppose

    Qd0=(Q011,,Q0ss)

    and

    Q0=Q0Qd0.

    For k=0, the equation (2.4) can be written as

    AZ0Z0A=Q0 (2.5)

    Equation (2.5) can not be solved completely because the eigenvalues involved the multiplicity. However, the following equation

    AZ0Z0A=Q0

    has a solution Z0=(Z0ij) with Z0ii=0 and

    AiZ0ijZ0ijAj=Q0ij,forij

    has the unique solution Z0ij.

    Moreover, we have the estimate [12]

    ||J1ij(0,λ)||maxij||[IljAi(λ)ATj(λ)Ili]1||cnKliljϱliljC(ϱ,n)Klilj, (2.6)

    and

    max0lr||lλlJ1ij(0,λ)||=max1lr||lλl(adJijdetJij)||C(ϱ,n,r)K(lilj)2

    as dist(σ(Ai(λ)),σ(Aj(λ)))>ϱ>0, for ij. Moreover, we get

    max0lr||lλlZ0(λ)||Cmax0lr||lλl(J1ij(0,λ)||||lλlQ0(λ)||C(ϱ,n,r)Kn4max0lr||lλlQ0(λ)||. (2.7)

    Now, we solve the Eq (2.4) for k0. From Lemma 3.2 as seen in [12], the solution of (2.4) is equivalent to the solution of the following vector equation

    J(k,λ)Zk(λ)=Qk(λ) (2.8)

    By using corollaries [12], Eq (2.8) is solvable the matrix J(k,λ) is invertible. Suppose P=I+Zk is a L-P transformation. Then by using the L-P transformation, the new system becomes

    dxdt=(A++Q+)x

    where

    A+=A+Qd0Q+=DωZZ1+[Z,Q]+AZ2ZAZ+O(||Z||3) (2.9)

    Since A and Qd0 are block-diagonal matrices, therefore A+ is also a block-diagonal. Next, we will show that in a smaller domain Q+ is much smaller and the non-degeneracy condition is satisfied by A+.

    Estimation of Q+.

    First of all, we estimate Zk. Actually, to control the solution of Zk, we need the following small denominator condition, .i.e. if there exist N>0 such that i,j

    |gij(k,λ)|N1Δ(|k|),1i,js. (2.10)

    where Δ is an approximation function as defined above.

    In order to estimate Zk, we need to estimate the operator J1ij(k,λ) for k0.

    Lemma 2.1. For k0 and the small denominator conditions (2.10) are satisfied by all parameters λ, then we have

    ||J1ij(k,λ)||cKliljN(Δ(|k|))lilj,ij, (2.11)
    ||J1(k,λ)||cKn2αnNn2(Δ(|k|))n2, (2.12)
    max0lr||lλlJ1(k,λ)||cKn4α2rnN2rn2(Δ(|k|))2rn2. (2.13)

    where c denotes constant.

    Proof. Since Jij is a non-singular matrix, so its inverse is defined as J1ij=adJij/detJij. By the small denominator conditions (2.10), we have

    |J(k,λ)|=|[ik,ωIn2+(InA(λ)AT(λ)In)]|(N1)n2(α1Δ(|k|))n

    and

    |Jij(k,λ)|=|[ik,ωIlilj+(IliAj(λ)ATi(λ)Ilj)]|(N1)lilj(α1Δ(|k|))li

    using the definition of the norm ||Jij|| and the small denominator condition (2.10), the estimate (2.11)can be found easily. Also as detJ=1i,jsdetJij, similarly we can calculate the estimations (2.12) and (2.13).

    For k0, from Eq (2.8), we have

    Zk(λ)=J1(k,λ)Qk(λ) (2.14)

    as Zk,Qk are the transpose of Zk and Qk respectively, therefore it is easy to prove ||Zk||=||Zk||,||Qk||=||Qk|| (see in [12] for the proof).

    In our article, we represent F(λ) a λ-dependent matrix as:

    |F(λ)|=max0lr||lF(λ)λl||.

    Since QCωh(Tr×Λ,g), then by the Remark 2.1, we have

    |Qk||Q|he|k|h.

    As a result, for k0 and for any 0<ˉh<h, we have

    |Zk(λ)||J1(k,λ)||Qk(λ)|CKn4α2rnN2rn2(Δ(|k|))2rn2|Q|he|k|h

    or

    |Zk(λ)|CKn4α2rnN2rn2(Δ(|k|))2rn2|Q|he|k|(hˉh)e|k|ˉh. (2.15)

    In particular, take an approximation function Δ(t)=etδ,δ<1, which satisfy the Brjuno-Rüssmann condition (1.6), since the function etδ2rn2et(hˉh) has the maximal value at t=(2rn2δhˉh)1δ1, one has

    |Zk(λ)|CKn4α2rnN2rn2|Q|he[2rn2(2rn2δhˉh)δδ1(2rn2δhˉh)1δ1(hˉh)]e|k|ˉhC(n,r,δ,α)Kn4N2rn2[|Q|h(hˉh)δ2δ1δ1|Q|h(hˉh)δδ1]e|k|ˉh. (2.16)

    Consider

    Z(t,λ)=kZrZk(λ)eik,t

    choose h:0<h<ˉh s.t. if ˉhh=hh<1. So, using the Lemma 4 in [7], we obtain

    |Z|hkZr|Zk|e|k|hCKn4|Q0|+CKn4N2rn2[|Q|h(hˉh)δ2δ1δ1|Q|h(hˉh)δδ1]kZr{0}e(ˉhh)|k|CKn4N2rn2[|Q|h(hˉh)δ2δ1δ1|Q|h(hˉh)δδ1](2ˉhh)me(ˉhh)m2C(n,r,δ,α,m)Kn4N2rn2[1(hh)δ2δ1δ1+m1(hh)δδ1+m]|Q|h. (2.17)

    Let s=δ2δ1δ1+m, and s=δδ1+m, we get

    |Z|hCKn4N2rn2[1(hh)s1(hh)s]|Q|h. (2.18)

    similarly, we can find

    |DωZ|hCKn4N2rn2[1(hh)s+11(hh)s+1]|Q|h
    |DωZZ|hCKn4N2rn2[1(hh)2s+11(hh)2s+1]|Q|2h
    |AZ2|h=|ZAZ|hCKn4N2rn2[1(hh)2s1(hh)2s]|Q|2h
    |[Z,Q]|h2|Z|h|Q|hCKn4N2rn2[1(hh)s1(hh)s]|Q|2h

    Hence, from Eq (2.9), we get

    |Q+|hCK2n4+1N2r+1n2[1(hh)2s+11(hh)2s+1]|Q|2h. (2.19)

    Verification of the non-degeneracy conditions for A+.

    Since

    A+=A+Qd0=diag(A1+Q011,,As+Q0ss).

    Let

    D+ij(k,λ)=det[ik,ωIlilj+(Ili(Aj(λ)+Q0jj(λ))(ATi(λ)+QT0ii(λ))Ij)].

    The new determinant D+ij is analytic with respect to λ as well.

    The above determinant can be rewritten as

    D+ij(k,λ)=Dij(k,λ)+Yij(k,λ).

    where Dij(k,λ)=det[ik,ωIlilj+(IliAj(λ)ATi(λ))Ij)] and Yij(k,λ) is a summary of 2lilj1 determinants denoted by yt(k,λ)(1t2lilj1). Furthermore, there exist at least one column in each determinant yt such that the entries in this column are either 0 or of the form cd, where c and d are entries of Q0jj and Q0ii respectively.

    As |Qd0|h|Q|h<ε, we get

    |lλlD+ij(k,λ)|C|A|ε,for1lr.

    similarly,

    |lλl(g+ij(k,λ)gij(k,λ))|C|A|ε,for1lr. (2.20)

    So, we have

    |lλlg+ij(k,λ)|ςC|A|εςCKε=ς,for1lr. (2.21)

    The proof is obvious. Note that, here we only need to choose such ks so that |(k,λ)| is not large enough, .i.e., |(k,λ)|CK, where |A|K, because for large enough |(k,λ)|, the matrix J(k,λ) becomes automatically non-singular. So, when |(k,λ)| has large values, then J+(k,λ) becomes naturally non-singular and no need to preserve non-degenerate property.

    Alternatively, we know from the perturbation theory of matrices that the continuous change of eigenvalues depends on the entries, and by Ostrowski theorem (see [21]), the distance between eigenvalues of any two blocks can be estimated as

    minijdist(σ(A+i),σ(A+j))=ϱ+>ϱcε1n.

    Now, we summarize the above discussions in the following conclusion.

    Conclusion 1.

    Consider Λ subset of (a, b) be some parameter segment, a one parameter family of constant elements ACω(Λ,g), and QCωh(Tr×Λ,g) be the perturbation. Suppose that there exist K,ε,N>0 s.t.

    |A|K,|Q|h<ε,

    for all λΛ, the non-degeneracy conditions (1.11) and the small denominator conditions (2.10) hold.

    Then, h>0 and a map ZCωh(Tr×Λ,g), and

    A+Cω(Λ,g)
    Q+Cωh(Tr×Λ,g),

    such that

    1) A+=A+Qd0,A++Q+A+Q

    2) We have the estimation (2.19), .i.e. |Q+|hCK2n4+1N2r+1n2[1(hh)2s+11(hh)2s+1]|Q|2h.

    3) We have preserved the non-degeneracy conditions.i.e., max0lr|lλlg+ij(k,λ)|ς.

    4) ϱ+>ϱcε1n,K+<K+ε.

    In this subsection, we will prove that the perturbation Q goes to zero very quickly provided that the small divisor conditions hold.

    First of all, consider the following two iterative sequences:

    hm=(12+12m)h1, (2.22)
    Nm=((65)m+1ηhm1hm)γ=(h1)γ2mγ((65)m+1η)γ (2.23)

    where γr is a constant, and η will be considered as in the following lemma

    Lemma 2.2. There exist positive constants η<1,b, s.t., if ε1 is sufficiently small, then m1

    εmηbe(65)m,
    Km2m1K1.

    Proof. Suppose that if we do this up to mth step, we have

    |Qm|hmεmηbe(65)m

    and

    KmKm1+εm12m1K1.

    By induction, we need to prove that

    |Qm+1|hm+1ηbe(65)m+1 (2.24)

    and

    Km+12mK1. (2.25)

    Indeed Eq (2.25) is satisfied as

    Km+1Km+εmKm+ηbe(65)mKm+12Km22m1K1=2mK1.

    And from Eq (2.19), we have

    εm+1CK2n4+1mN2r+1n2m[1(hmhm+1)2s+11(hmhm+1)2s+1]ε2m.

    To prove Eq (2.24), we need

    CK2n4+1mN2r+1n2m[1(hmhm+1)2s+11(hmhm+1)2s+1]η2be(65)2mηbe(65)m+1.

    Then by using Eqs (2.22) and (2.25), we have

    CK2n4+11h(2s+1)12m(2n4+1)+(m+1)(2s+1)N2r+1n2mη2be(4/5)(65)m1. (2.26)

    Let Rm(η)=N2r+1n2mηb1, if we choose

    b>2r+1n2γ+1, (2.27)

    then by Eq (2.23) we see that for smaller value of η, the value of Rm also goes smaller. Now, firstly we set η=η0<1. As the sequence

    2m(2n4+1)+(m+1)(2s+1)+mrRm(η0)e(4/5)(65)m,

    is bounded from above, let's denote its maximum by ˉβ. In order to satisfy Eq (2.26), it is enough to choose η s.t.

    CK2n4+11h(2s+1)1ˉβη1.

    Thus, define

    ηmin{CK(2n4+1)1h2s+11ˉβ1,η0},

    and so we obtained the Eq (2.26). If we choose η=(10ε1)1/b, then it is enough to take

    ε1min{CKb(2n4+1)1hb(2s+1)110βb,ηbe65}. (2.28)

    Hence, the proof of lemma is finished.

    From Eq (2.18), it can be seen that the sequence |Zm|hm converges to 0 with super-exponential velocity, then by the transformation Pm=I+Zm, we have PmI, and so the composition of transformations PmPm1P1 will also be convergent. On the other hand, from conclusion 1, we have

    ςmςm1CKmεm,

    so

    ςmςC1im1Kiεiς2, (2.29)

    for small enough ε1. Thus, the preservation of the non-degeneracy conditions is proved. By the way, for small enough ε1, we also have the estimate

    ϱmϱC1im1ε1niϱ2. (2.30)

    In this subsection, we will show that the set of parameters satisfying the small denominator conditions is of the large Lebesgue measure. In the end, we estimate the measure of the removed parameter set. At the mth step, for i,j,1i,js, we denote the removed set as:

    Rmkij={λ:|gmij(k,λ)|N1mΔ(|k|)}

    and consider

    Rmk=1i,jsRmkij,
    Rm=0kZrRmk.

    To calculate the estimate for the measure of Rmkij, the following lemma is needed:

    Lemma 2.3. Consider g(x) is a CM function on the closure ˉI, where IR1 is an interval of length L. Let Ih={x:|g(x)|h,h>0}. If for some constant r>0,|g(M)(x)|r for xI, then |Ih|cLh1/M, where |Ih| denotes the Lebesgue measure of Ih and constant c=2(2+3++M+r1).

    For the proof, see [22].

    Then, let L denotes the length of the parameter interval Λ, and using above Lemma 2.3, we obtain

    mes(Rmkij)cL(N1mΔ(|k|))1/r

    where c=2(2+3++r+2/ς), as gmij(k,λ)Cm(Λ) and using the non-degeneracy conditions and Eq (2.30). Thus,

    mes(Rm)Cn2LN1rm0kZr(1Δ(|k|))1/r.

    For Δ(|k|)=e|k|δ,δ<1, we have

    mes(Rm)Cn2LN1rm0kZre|k|δ/rC(n,r,δ,ς)LN1rm.

    By Eq (2.23), Nm>2mγηγ, we have

    N1rmηγr12mγr.

    Therefore, for η=(10ε1)1b and γr, one has

    mes(m=1Rm)CLηγrm=12mγrCLηγrC(n,r,δ,ς,γ,ϱ)L(10ε1)c,where,c=γbr.

    Hence, the proof of the main result is completed.

    In this article, we discussed the positive measure reducibility for quasi-periodic linear systems and proved that the system (1.7) is reduced to a constant coefficient system. The result was proved for a Brjuno vector ω and small parameter λ by using the KAM method, Brjuno-Rüssmann condition and non-degeneracy condition.

    The authors extend their appreciation to the Yibin University, Yibin, China.

    The authors declare no conflicts of interest in this paper.



    [1] T. Ahmad, K. Ahmad, M. Alam, Sustainable management of water treatment sludge through 3'R' concept, J. Clean. Prod., 124 (2016), 1–13. http://dx.doi.org/10.1016/j.jclepro.2016.02.073 doi: 10.1016/j.jclepro.2016.02.073
    [2] R. O. Carey, K. W. Migliaccio, Contribution of wastewater treatment plant effluents to nutrient dynamics in aquatic systems: a review, Environ. Manage., 44 (2009), 205–217. http://dx.doi.org/10.1007/s00267-009-9309-5 doi: 10.1007/s00267-009-9309-5
    [3] G. Crini, E. Lichtfouse, Advantages and disadvantages of techniques used for wastewater treatment, Environ. Chem. Lett., 17 (2019), 145–155. http://dx.doi.org/10.1007/s10311-018-0785-9 doi: 10.1007/s10311-018-0785-9
    [4] B. E. Igere, A. I. Okoh, U. U. Nwodo, Wastewater treatment plants and release: the vase of odin for emerging bacterial contaminants, resistance and determinant of environmental wellness, Emerging Contaminants, 6 (2020), 212–224. http://dx.doi.org/10.1016/j.emcon.2020.05.003 doi: 10.1016/j.emcon.2020.05.003
    [5] C. Holeton, P. A. Chambers, L. Grace, Wastewater release and its impacts on canadian waters, Can. J. Fish. Aquat. Sci., 68 (2011), 1836–1859. http://dx.doi.org/10.1139/f2011-096 doi: 10.1139/f2011-096
    [6] R. Jha, C. Ojha, K. Bhatia, Development of refined bod and do models for highly polluted kali river in india, J. Environ. Eng., 133 (2007), 839–852. http://dx.doi.org/10.1061/(ASCE)0733-9372(2007)133:8(839) doi: 10.1061/(ASCE)0733-9372(2007)133:8(839)
    [7] P. Yu, J. Cao, V. Jegatheesan, X. Du, A real-time bod estimation method in wastewater treatment process based on an optimized extreme learning machine, Appl. Sci., 9 (2019), 523. http://dx.doi.org/10.3390/app9030523 doi: 10.3390/app9030523
    [8] K. S. Ooi, Z. Y. Chen, P. E. Poh, J. Cui, Bod5 prediction using machine learning methods, Water Supply, 22 (2022), 1168–1183. http://dx.doi.org/10.2166/ws.2021.202 doi: 10.2166/ws.2021.202
    [9] M. I. Jordan, T. M. Mitchell, Machine learning: trends, perspectives, and prospects, Science, 349 (2015), 255–260. http://dx.doi.org/10.1126/science.aaa8415 doi: 10.1126/science.aaa8415
    [10] K. B. Newhart, R. W. Holloway, A. S. Hering, T. Y. Cath, Data-driven performance analyses of wastewater treatment plants: a review, Water Research, 157 (2019), 498–513. http://dx.doi.org/10.1016/j.watres.2019.03.030 doi: 10.1016/j.watres.2019.03.030
    [11] D. Wang, S. Thunéll, U. Lindberg, L. Jiang, J. Trygg, M. Tysklind, et al., A machine learning framework to improve effluent quality control in wastewater treatment plants, Sci. Total Environ., 784 (2021), 147138. http://dx.doi.org/10.1016/j.scitotenv.2021.147138 doi: 10.1016/j.scitotenv.2021.147138
    [12] A. E. Bilali, A. Taleb, Prediction of irrigation water quality parameters using machine learning models in a semi-arid environment, Journal of the Saudi Society of Agricultural Sciences, 19 (2020), 439–451. http://dx.doi.org/10.1016/j.jssas.2020.08.001 doi: 10.1016/j.jssas.2020.08.001
    [13] O. P. Okeke, I. I. Aminu, A. Rotimi, B. Najashi, M. Jibril, A. S. Ibrahim, et al., Performance analysis and control of wastewater treatment plant using adaptive neuro-fuzzy inference system (ANFIS) and multi-linear regression (MLR) techniques, GSC Advanced Engineering and Technology, 4 (2022), 001–016. http://dx.doi.org/10.30574/gscaet.2022.4.2.0033 doi: 10.30574/gscaet.2022.4.2.0033
    [14] V. Nourani, G. Elkiran, S. Abba, Wastewater treatment plant performance analysis using artificial intelligence–-an ensemble approach, Water Sci. Technol., 78 (2018), 2064–2076. http://dx.doi.org/10.2166/wst.2018.477 doi: 10.2166/wst.2018.477
    [15] Y. El Hammoudani, F. Dimane, Assessing behavior and fate of micropollutants during wastewater treatment: statistical analysis, Environ. Eng. Res., 26 (2021), 200359. http://dx.doi.org/10.4491/eer.2020.359 doi: 10.4491/eer.2020.359
    [16] S. Rahmat, W. A. H. Altowayti, N. Othman, S. M. Asharuddin, F. Saeed, S. Basurra, et al., Prediction of wastewater treatment plant performance using multivariate statistical analysis: a case study of a regional sewage treatment plant in melaka, malaysia, Water, 14 (2022), 3297. http://dx.doi.org/10.3390/w14203297 doi: 10.3390/w14203297
    [17] H. Z. Abyaneh, Evaluation of multivariate linear regression and artificial neural networks in prediction of water quality parameters, J. Environ. Health Sci. Engineer., 12 (2014), 40. http://dx.doi.org/10.1186/2052-336X-12-40 doi: 10.1186/2052-336X-12-40
    [18] K. S. Kumar, P. S. Kumar, M. J. R. Babu, Performance evaluation of waste water treatment plant, International Journal of Engineering Science and Technology, 2 (2010), 7785–7796.
    [19] U. Agrawal, Water quality data, Kaggle, 2020. Available from: https://www.kaggle.com/datasets/utcarshagrawal/water-quality-data.
    [20] D. Kobak, P. Berens, The art of using t-sne for single-cell transcriptomics, Nat. Commun., 10 (2019), 5416. http://dx.doi.org/10.1038/s41467-019-13056-x doi: 10.1038/s41467-019-13056-x
    [21] F. Pedregosa, Scikit-learn: machine learning in python fabian, J. Mach. Learn. Res., 12 (2011), 2825.
    [22] A. Zollanvari, Supervised learning in practice: the first application using scikit-learn, In: Machine learning with Python: theory and implementation, Cham: Springer, 2023,111–131. http://dx.doi.org/10.1007/978-3-031-33342-2_4
    [23] M. A. Razi, K. Athappilly, A comparative predictive analysis of neural networks (NNs), nonlinear regression and classification and regression tree (CART) models, Expert Syst. Appl., 29 (2005), 65–74. http://dx.doi.org/10.1016/j.eswa.2005.01.006 doi: 10.1016/j.eswa.2005.01.006
    [24] D. Chicco, M. J. Warrens, G. Jurman, The coefficient of determination r-squared is more informative than smape, mae, mape, mse and rmse in regression analysis evaluation, PeerJ Comput. Sci., 7 (2021), e623. http://dx.doi.org/10.7717/peerj-cs.623 doi: 10.7717/peerj-cs.623
    [25] A. Rechkemmer, M. Yin, When confidence meets accuracy: exploring the effects of multiple performance indicators on trust in machine learning models, Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, 2022,535. http://dx.doi.org/10.1145/3491102.3501967 doi: 10.1145/3491102.3501967
    [26] G. I. Webb, M. J. Pazzani, D. Billsus, Machine learning for user modeling, User Model. User-Adap., 11 (2001), 19–29. http://dx.doi.org/10.1023/A:1011117102175 doi: 10.1023/A:1011117102175
    [27] J. L. Lerios, M. V. Villarica, Pattern extraction of water quality prediction using machine learning algorithms of water reservoir, Int. J. Mech. Eng. Rob. Res., 8 (2019), 992–997. http://dx.doi.org/10.18178/ijmerr.8.6.992-997 doi: 10.18178/ijmerr.8.6.992-997
    [28] P. M. Kroonenberg, Applied multiway data analysis, Hoboken: John Wiley & Sons, 2008.
    [29] P. Fränti, R. Mariescu-Istodor, A. Akram, M. Satokangas, E. Reissell, Can we optimize locations of hospitals by minimizing the number of patients at risk? BMC Health Serv. Res., 23 (2023), 415. http://dx.doi.org/10.1186/s12913-023-09375-x doi: 10.1186/s12913-023-09375-x
    [30] T. M. Kodinariya, P. R. Makwana, Review on determining number of cluster in k-means clustering, International Journal of Advance Research in Computer Science and Management Studies, 1 (2013), 90–95.
    [31] E. Dogan, B. Sengorur, R. Koklu, Modeling biological oxygen demand of the melen river in turkey using an artificial neural network technique, J. Environ. Manage., 90 (2009), 1229–1235. http://dx.doi.org/10.1016/j.jenvman.2008.06.004 doi: 10.1016/j.jenvman.2008.06.004
    [32] P. Schober, C. Boer, L. A. Schwarte, Correlation coefficients: appropriate use and interpretation, Anesth. Analg., 126 (2018), 1763–1768. http://dx.doi.org/10.1213/ANE.0000000000002864 doi: 10.1213/ANE.0000000000002864
    [33] W. Cui, Z. Sun, H. Ma, S. Wu, The correlation analysis of atmospheric model accuracy based on the pearson correlation criterion, IOP Conf. Ser.: Mater. Sci. Eng., 780 (2020), 032045. http://dx.doi.org/10.1088/1757-899X/780/3/032045 doi: 10.1088/1757-899X/780/3/032045
    [34] G. K. Uyanık, N. Güler, A study on multiple linear regression analysis, Procedia-Social and Behavioral Sciences, 106 (2013), 234–240. http://dx.doi.org/10.1016/j.sbspro.2013.12.027 doi: 10.1016/j.sbspro.2013.12.027
    [35] A. Rácz, D. Bajusz, K. Héberger, Effect of dataset size and train/test split ratios in qsar/qspr multiclass classification, Molecules, 26 (2021), 1111. http://dx.doi.org/10.3390/molecules26041111 doi: 10.3390/molecules26041111
    [36] D. Güçlü, Ş. Dursun, Artificial neural network modelling of a large-scale wastewater treatment plant operation, Bioprocess Biosyst. Eng., 33 (2010), 1051–1058. http://dx.doi.org/10.1007/s00449-010-0430-x doi: 10.1007/s00449-010-0430-x
    [37] M. M. Hamed, M. G. Khalafallah, E. A. Hassanien, Prediction of wastewater treatment plant performance using artificial neural networks, Environ. Modell. Softw., 19 (2004), 919–928. http://dx.doi.org/10.1016/j.envsoft.2003.10.005 doi: 10.1016/j.envsoft.2003.10.005
    [38] S. A. Rebuffi, S. Gowal, D. A. Calian, F. Stimberg, O. Wiles, T. A. Mann, Data augmentation can improve robustness, Proceedings of the 35th International Conference on Neural Information Processing Systems, 2024, 29935–29948.
    [39] S. R. Shams, A. Jahani, S. Kalantary, M. Moeinaddini, N. Khorasani, The evaluation on artificial neural networks (ANN) and multiple linear regressions (MLR) models for predicting SO2 concentration, Urban Clim., 37 (2021), 100837. http://dx.doi.org/10.1016/j.uclim.2021.100837 doi: 10.1016/j.uclim.2021.100837
    [40] Q. Zhao, P. Fränti, Wb-index: a sum-of-squares based index for cluster validity, Data Knowl. Eng., 92 (2014), 77–89. http://dx.doi.org/10.1016/j.datak.2014.07.008 doi: 10.1016/j.datak.2014.07.008
    [41] P. Fränti, Efficiency of random swap clustering, J. Big Data, 5 (2018), 13. http://dx.doi.org/10.1186/s40537-018-0122-y doi: 10.1186/s40537-018-0122-y
    [42] P. Fränti, S. Sieranoja, K. Wikström, T. Laatikainen, Clustering diagnoses from 58 million patient visits in finland between 2015 and 2018, JMIR Med. Inform., 10 (2022), e35422. http://dx.doi.org/10.2196/35422 doi: 10.2196/35422
  • This article has been cited by:

    1. Muhammad Afzal, Tariq Ismaeel, Azhar Iqbal Kashif Butt, Zahid Farooq, Riaz Ahmad, Ilyas Khan, On the reducibility of a class of almost-periodic linear Hamiltonian systems and its application in Schrödinger equation, 2023, 8, 2473-6988, 7471, 10.3934/math.2023375
  • Reader Comments
  • © 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(791) PDF downloads(34) Cited by(0)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog