1.
Introduction
Consider the constrained system
where the function F:Ψ⟷Rn is a continuous, monotone, and Ψ⊂Rn is nonempty, closed and convex. The monotonicity of F implies
This problem has applications in financial forecasting problems [1] and compressive sensing [2]. The Newton method [3], quasi-Newton methods [4], and conjugate gradient (CG) methods [5] are among the most often used approaches for solving it. When dealing with large-scale nonsmooth algebraic systems, CG methods are seen to be the most successful. The iterative technique for the CG algorithm is as follows:
the vector dk is given by
where Fk=F(xk), and βk is the CG formula that differentiates CG methods. One of the most efficient CG formulas with restarting feature is the Polak-Ribiˊere-Polyak (PRP) [6] formula defined by
where yk=Fk+1−Fk. Despite the good features of the formula (1.5), it doesn't satisfy the sufficient descent condition (SDC). This opens up the possibility for future investigations on the modification of the PRP formula to satisfy the SDC. There are different PRP algorithms for solving the general monotone system of nonlinear equations in the literature. The first PRP algorithm for solving monotone algebraic systems was proposed by Cheng [7] where he extended the default PRP using the hyperplane technique discussed in [8]. Subsequently, Yu [9] proposed the derivative-free PRP for a large-scale monotone system that uses the backtracking line search procedure. Ahookhosh [10] proposed a descent three-term PRP algorithm for solving large-scale nonlinear monotone equations that combine extended PRP formula with the projection technique. Yuan and Zhang [11] proposed another descent-modified PRP CG algorithm for monotone nonlinear equations. Sabi'u and Gadu [12]proposed a projection-based hybrid FR and PRP algorithm for the monotone system. Others are; the improved three-term PRP CG method [13], spectral modified Polak–Ribiére–Polyak projection conjugate gradient method for solving monotone systems of nonlinear equations [14], the new hybrid algorithm for solving large-scale monotone nonlinear equations [15], and the accelerated CG algorithm for solving nonlinear monotone equations and image restoration problems [16].
Moreover, for the convex-constrained system (1.1), Jinkui [17] proposed a spectral PRP projection algorithm for solving nonlinear monotone equations with convex constraints. Followed by; the projection-based PRP-like algorithm [18], the modified spectral PRP conjugate gradient projection method for solving large-scale monotone equations and its application in compressed sensing [19], the new hybrid prpfr CG method for solving nonlinear monotone equations and image restoration problems [20], the modified PRP CG projection method for solving large scale nonlinear convex constrained monotone equations [21], the modified PRP CG method with weaker conditions [22], the PRP-like algorithm for monotone operator equations [23], and the modified PRP-type conjugate gradient projection algorithm for solving large-scale monotone nonlinear equations with convex constraint [24]. In this work, we will propose a scaled PRP CG gradient algorithm for constrained nonlinear systems in which the scaling parameters are determined by approaching the quasi-Newton direction and exploiting the advantages of the well-known Barzilai-Borwein technique. The appealing features of our algorithm are: (i) derivative and matrix-free algorithm, (ii) exploiting best out of the PRP formula numerically and theoretically in contrast to conventional scaled CG algorithms, (iii) the application of the proposed algorithm in solving the two-joint planar robotic manipulator problems, and finally, (iv) the proposed directions will satisfy the SDC independent of the line search procedure.
The next section will discuss the proposed algorithm using the scaling strategy, Section 3 contains the global convergence of the method. The numerical experiment will be presented in Section 4, and the last section will contain the concluding remarks.
2.
A scaled PRP CG method
The scaling strategy is proved to be an efficient strategy for enhancing numerical and theoretical performances of the CG methods, see [25,26]. In line with this, we scaled the PRP CG (SPRP) formula to get
such that |γ|≤1. We are going to propose two effective ways of computing the parameter γ at each iteration.
2.1. Based on quasi-Newton method
We are going to propose an optimal choice for the scaling parameter γ at every iteration by tending the proposed direction to approach the general quasi-Newton direction. Starting with the quasi-Newton equation
where Bk+1 is a positive definite and symmetric Jacobian approximate. Now, the general quasi-Newton direction is as follows:
where the matrix Hk+1 is the inverse of Bk+1. Now, using (1.4) and (2.1), the SPRPR search direction can be written as
In order to incorporate Jacobian approximation information in the quasi-Newton direction into our scheme, since the quasi-Newton schemes employ the actual Jacobian approximations, from (2.3), (2.4), and also by the virtue of equality relation, we get
Multiplying (2.5) by −Bk+1sTk to obtain
Solving for γ after eliminating the matrix Bk+1 in (2.6) by using (2.2) to get
We further achieved |γ1k|≤1 by using the Polak-Ribiˊere-Polyak non-negative restriction to obtain
2.2. Based on Barzilai-Borwein approach
We will again propose another optimal choice for the scaling parameter γ by using some prominent features of the Barzilai-Borwein approach. The Barzilai-Borwein [27] proposed some prevalent choices for the scaling parameters given by
By considering Perry's point of view, the scaled SPRP search direction can be written as
where ^Hk+1 is defined by
Now, we aim at taking the advantage of the Barzilai-Borwein [27] approach to propose the parameter γ by minimizing
where ‖.‖F stands for the Frobenius matrix norm. Moreover, if we assigned Q=^Hk+1−ωkI and utilized the relation ‖Q‖2F=Trace(QTQ), we arrived at
In addition, we ensure that all of the prevalent choices for the Barzilai-Borwein scaling parameters are properly integrated into our scheme by suggesting that
with 0<ωmin<ωmax<∞. We proposed the following modified version of (2.13) that satisfies |γ2k|≤1 as
Furthermore, to guarantee that the sufficient descent condition is fulfilled independent of the line search process, we provided the following modified SPRP directions
and,
where
Below we present the proposed algorithm.
Algorithm 2.1. The scaled PRP projection-based CG algorithm (SPRPCG)
Step 0. Start by initializing: ϵ≥0, b∈(0,1), θ>0, τ>0, a>0, 0<ωmin≤ωmax and x0∈Rn. Set k=0 and d0=−F0.
Step 1. If ‖Fk‖≤ϵ, terminate, otherwise continue with Step 2.
Step 2. Determine the scaled PRP CG direction dk via (2.16) or (2.17), where sk=uk−xk, and yk=F(uk)−Fk+bsk.
Step 3. Set uk=xk+αkdk and compute αk=max{τθi:i=0,1,2,⋯} satisfying
Step 4. If uk∈Ψ and ‖F(uk)‖=0 stop, otherwise
where A is the projection operator, and
Step 5. Set k=k+1 and repeat from Step 1.
3.
Convergence analysis
This section describes the global convergence of the proposed algorithm under the Lipschitz continuous and monotonicity assumptions on F.
Lemma 3.1. For all k≥0, the line search (2.19) is well-defined.
Proof. Suppose there exists k0≥0 such that for i=0,1,2,⋯,
Let i→∞ in (3.1), we have
Since the proposed directions satisfiy the sufficient condition, we have
Thus, the relations (3.2) and (3.3) yield a contradiction. Hence, the proof is complete.
Lemma 3.2. [25] If the sequences {xk} and {vk} are produced by the SPRPCG algorithm, then for some M>0 we have
Proof. Now, from the line search (2.19), we have
Assume that x∗∈Ω such that F(x∗)=0, using the monotonicity of F, we have
Using (3.5) and (3.6) to get
By the definition of vk, and the Cauchy Schwarz inequality in (3.7), we obtain
thus, we have
Therefore, {‖xk−x∗‖} is a decreasing sequence and hence convergent. Now, utilizing (3.9) and the Lispchitz continuity of F, we get
By (2.19), monotonicity of F, and the Cauchy Schwarz inequality, we obtain
which gives
From (3.12), we see that the sequence {uk} is bounded. Also, from (3.8), we obtain
This implies
The proof is now complete.
Theorem 3.1. The SPRPCG algorithm converges, i.e.,
Proof. Assume that (3.15) is not true, i.e., there exists q>0 such that
Again from the Lipschitz continuinity of F it is easy to see that ‖yk‖≤(L+a)‖sk‖=N, where L is assumed to be the Lipschitz constant. Thus,
Now, let h=2MN‖q2‖, therefore from (3.17) we get
Hence, for Z=max{‖d1‖,‖d2‖,⋯,‖dk0‖,M1−h+‖dk0‖}, we obtain the boundedness of the proposed direction. Similarly, we can show that (2.17) is also bounded. From the sufficient condition and the Cauchy-Schwarz inequality we get
From (3.19) we obtain
Therefore, from (3.14) and (3.20) we have
Suppose α′k doesn't safies the line search procedure (2.19), i.e.,
It is obvious that {xk} is bounded and has an accumulation point ˆx and an infinite index set D1 such that limk∈D1xk=ˆx. It also follows that {dk}k∈D1 is bounded. Therefore, there exist an infinite index set D2⊂D1 with an accumulation point ˆd such that limk∈D2dk=ˆd. Now, taking limit in (3.22), we obtain
Also taking the limit in (3.19), we get
The last two inequalities give a contradiction and the proof is complete.
4.
Numerical experiment
The proposed scaled PRP algorithm is numerically compared to the MPRPA [24], DFSP [26], and STCGM [25] algorithms, using published values in the compared papers. Furthermore, we initialized the following variables for the SPRPCG algorithm: b=0.2, a=0.0001, θ=0.99, τ=1, ωmin=0.0001, ωmax=104, Ψ=Rn+, and ϵ=10−10. Matlab14 is used to run all algorithms on an Intel Core i5 8th Generation personal computer. We carried out our experiment using the following test problems:
Problem 4.1. Reference [25] F(xi)=2xi−sin|xi|, for i=1,2,3,…,n, and Ω=Rn+.
Problem 4.2. Reference [25] Fi(x)=4xi+(xi+1−2xi)−x2i+13, for i=1,2,…,n−1, Fn(x)=4xn+(xn−1−2xn)−x2n−13, and Ω=Rn+.
Problem 4.3. Reference [26] Fi(x)=exi−1,i=1,2,…,n, and Ω=Rn+.
Problem 4.4. Reference [25] F1(x)=cos(x1)−9+3x1+8exp(x2) Fi(x)=cos(xi)−9+3xi+8exp(xi−2), for i=2,3,…,n, and Ω=Rn+
Problem 4.5. Reference [25] F1(x)=ex1−1, Fi(x)=exi+xi−1−1,i=2,3,⋯,n−1, and Ω=Rn+.
From Tables 1–5, we used eight initial points, namely; x10=(2,2,…,2), x20=(1,12,13,…,1n), x30=(1,1,…,1), x40=(1n,2n,…,1), x50=(n−1n,n−2n,…,n−1), x60=(2,22,23,…,2n), x70=(1−1,1−12,1−13,…,1−1n) and x80=(−3,−3,…,−3). In which the terms ITER, FVAL, TIME, and NORM stand for the number of iterations, function evaluation, computing time and the norm of Fk at the stopping criteria respectively.
When compared to the MPRP, STCGM, and DFSP algorithms, the proposed approach solved the Problem 1 with the lowest number of ITER and FVAL. However, for the computing time, the MPRP algorithm won initially and failed to STCGM as the dimension increases. Likewise for Problems 2–5. Moreover, it is remarkable to note that the MPRP algorithm is competing with the STCGM algorithm for computing time with respect to all the considered test problems. Finally, the proposed algorithm outperforms the MPRPA, STCGM, and DFSP algorithms for ITER, and FVAL, but competes with the STCGM algorithm with respect to TIME.
Furthermore, we used the well-known Dolan and Morˊe [28] performance profile to produce a more exact numerical comparison using the bulk data in all the tables. Figure 1 three subfigures indicate that the SPRPCG algorithm with two directions reflects the upper left curves with regards to 1a and 1b. However, for the 1c, the SPRPCG initially won in the fractions of 0 to approximately 0.9, and later failed to STCGM algorithm. As a result, when compared to the MPRPA, STCGM, and DFSP algorithms, the SPRPCG algorithm has fewer iterations, and function evaluations on average and competes with the STCGM algorithm for CPU time.
4.1. The motion control of robotic manipulator
IIn this part, we will tackle the motion control of a two-joint planar robotic manipulator using the SPRP projection-based algorithm. The following discrete-time kinematics equation describes the position level of the two-joint planar robot manipulator
where the kinematics function f(.) is defined by
in which, mj for j=1,2 is the length of the j−th rod, p1=cos(η1), r1=sin(η1), p2=cos(η1+η2), and r2=sin(η1+η2). Also, the vector σk∈R2 stands for the end effector position vector, and the joint angle vector is represented by ηk∈R2, for more detail about the kinematics equation we refer readers to [29] and the references therein. In our experiment we aimed at solving the series of optimization problems at a time interval tk∈[0,tf] defined as follows:
In this experiment, mj=1 is assigned for j=1,2, and the end effector is regulated to trace a Lissajous curve as provided by
For SPRP algorithm, we initialized b=0.2, a=0.08, θ=0.2, τ=1, ωmin=0.0001, ωmax=104, η0=[0,π3], and the end task duration tf=10s. We further divided the task duration [0,10] into 200 equal parts.
Figure 2 depicts the experimental findings generated by the SPRP algorithm, including the robot trajectories synthesized by the SPRP algorithm, the end plots of the effector trajectory and the intended route, and the error of the SPRP algorithm on both the vertical x and y axes. It is not difficult to see from Figures 2a and 2b that the SPRP algorithm completes the specified task satisfactorily. Furthermore, Figures 2c and 2d show that the resultant error is around 10−5.
5.
Conclusions
A robust descent-scaled PRP projection-based technique for solving monotone nonlinear problems with convex constraints was given. The monotone and Lipschitz continuous assumptions are applied to accomplish the proposed algorithm's global convergence. A detailed numerical comparison with the related algorithms indicated that the proposed technique is much more efficient than the existing algorithms. Furthermore, the proposed technique is used to solve the motion control problem of a two-joint planar robotic manipulator with extremely little error. Finally, the provided technique may be used for various problems, including solving unconstrained optimization problems, generic algebraic nonlinear systems, and so on.
Acknowledgments
This research received funding support from the NSRF via the Program Management Unit for Human Resources & Institutional Development, Research and Innovation, (grant number B05F650018).
Conflict of interest
The authors declare no conflict of interest.