Feasible robust Liu estimator to combat outliers and multicollinearity effects in restricted semiparametric regression model

W. B. Altukhaes; M. Roozbeh; N. A. Mohamed; W. B. Altukhaes; M. Roozbeh; N. A. Mohamed

doi:10.3934/math.20241519

AIMS Mathematics

2024, Volume 9, Issue 11: 31581-31606. doi: 10.3934/math.20241519

Previous Article Next Article

Research article Special Issues

Feasible robust Liu estimator to combat outliers and multicollinearity effects in restricted semiparametric regression model

1.
Institute of Mathematical Sciences, Faculty of Science, Universiti Malaya, Malaysia
2.
Department of Mathematics, College of Science and Humanities, Shaqra University, Sahqra 11961, Saudi Arabia
3.
Department of Statistics, Faculty of Mathematics, Statistics and Computer Sciences, Semnan University, Semnan, Iran

Received: 12 July 2024 Revised: 25 September 2024 Accepted: 07 October 2024 Published: 06 November 2024
MSC : 62G08, 62G35, 62J05, 62J07

Regression analysis frequently encounters two issues: multicollinearity among the explanatory variables, and the existence of outliers in the data set. Multicollinearity in the semiparametric regression model causes the variance of the ordinary least-squares estimator to become inflated. Furthermore, the existence of multicollinearity may lead to wide confidence intervals for the individual parameters and even produce estimates with wrong signs. On the other hand, as is often known, the ordinary least-squares estimator is extremely sensitive to outliers, and it may be completely corrupted by the existence of even a single outlier in the data. Due to such drawbacks of the least-squares method, a robust Liu estimator based on the least trimmed squares (LTS) method for the regression parameters is introduced under some linear restrictions on the whole parameter space of the linear part in a semiparametric model. Considering that the covariance matrix of the error terms is usually unknown in practice, the feasible forms of the proposed estimators are substituted, and their asymptotic distributional properties are derived. Moreover, necessary and sufficient conditions for the superiority of the Liu type estimators over their counterparts for choosing the biasing Liu parameter d are extracted. The performance of the feasible type of robust Liu estimators is compared with the classical ones in constrained semiparametric regression models using extensive Monte-Carlo simulation experiments and a real data example.

Keywords:

Citation: W. B. Altukhaes, M. Roozbeh, N. A. Mohamed. Feasible robust Liu estimator to combat outliers and multicollinearity effects in restricted semiparametric regression model[J]. AIMS Mathematics, 2024, 9(11): 31581-31606. doi: 10.3934/math.20241519

Related Papers:

[1]	Dayang Dai, Dabuxilatu Wang . A generalized Liu-type estimator for logistic partial linear regression model with multicollinearity. AIMS Mathematics, 2023, 8(5): 11851-11874. doi: 10.3934/math.2023600
[2]	Muhammad Nauman Akram, Muhammad Amin, Ahmed Elhassanein, Muhammad Aman Ullah . A new modified ridge-type estimator for the beta regression model: simulation and application. AIMS Mathematics, 2022, 7(1): 1035-1057. doi: 10.3934/math.2022062
[3]	Sihem Semmar, Omar Fetitah, Mohammed Kadi Attouch, Salah Khardani, Ibrahim M. Almanjahie . A Bernstein polynomial approach of the robust regression. AIMS Mathematics, 2024, 9(11): 32409-32441. doi: 10.3934/math.20241554
[4]	Yanting Xiao, Wanying Dong . Robust estimation for varying-coefficient partially linear measurement error model with auxiliary instrumental variables. AIMS Mathematics, 2023, 8(8): 18373-18391. doi: 10.3934/math.2023934
[5]	Gaosheng Liu, Yang Bai . Statistical inference in functional semiparametric spatial autoregressive model. AIMS Mathematics, 2021, 6(10): 10890-10906. doi: 10.3934/math.2021633
[6]	Juxia Xiao, Ping Yu, Zhongzhan Zhang . Weighted composite asymmetric Huber estimation for partial functional linear models. AIMS Mathematics, 2022, 7(5): 7657-7684. doi: 10.3934/math.2022430
[7]	Xin Liang, Xingfa Zhang, Yuan Li, Chunliang Deng . Daily nonparametric ARCH(1) model estimation using intraday high frequency data. AIMS Mathematics, 2021, 6(4): 3455-3464. doi: 10.3934/math.2021206
[8]	Emrah Altun, Mustafa Ç. Korkmaz, M. El-Morshedy, M. S. Eliwa . The extended gamma distribution with regression model and applications. AIMS Mathematics, 2021, 6(3): 2418-2439. doi: 10.3934/math.2021147
[9]	Muhammad Amin, Saima Afzal, Muhammad Nauman Akram, Abdisalam Hassan Muse, Ahlam H. Tolba, Tahani A. Abushal . Outlier detection in gamma regression using Pearson residuals: Simulation and an application. AIMS Mathematics, 2022, 7(8): 15331-15347. doi: 10.3934/math.2022840
[10]	Zawar Hussain, Atif Akbar, Mohammed M. A. Almazah, A. Y. Al-Rezami, Fuad S. Al-Duais . Diagnostic power of some graphical methods in geometric regression model addressing cervical cancer data. AIMS Mathematics, 2024, 9(2): 4057-4075. doi: 10.3934/math.2024198

Abstract

1. Introduction

The semiparametric regression model (SRM) is an appropriate tool to model a data set when the type of relationship between the dependent variable and some of the explanatory variables is linear parametric, but the link function of the dependent variable with other explanatory variables is not clear ^[1,2,3]. Consider the set of observations denoted as ${(y}_{1}, {\boldsymbol{x}}_{1}^{\mathrm{\top }}, {t}_{1})$ , ..., ${(y}_{n}, {\boldsymbol{x}}_{n}^{\mathrm{\top }}, {t}_{n})$ , which conform to the semiparametric regression model defined by

${y}_{i} = {\boldsymbol{x}}_{i}^{\mathrm{\top }}\boldsymbol{\beta }+f\left({t}_{i}\right)+{\varepsilon }_{i} , \;\;\;\; i = \mathrm{1, 2}, ..., n,$

(1.1)

where ${y}_{i}$ is the value of the response variable for ith observation, ${\boldsymbol{x}}_{i}^{\mathrm{\top }} = \left({x}_{i1}, ..., {x}_{ip}\right)$ represents a vector of explanatory variables, $\boldsymbol{\beta } = ({\beta }_{1}, ..., {\beta }_{p}{)}^{\mathrm{\top }}$ denotes a vector of the unknown parameters, and the ${t}_{i}\text{'}s$ are the observed points that match the boundaries of the domain $D\subset \mathbb{R}$ ^[4,5,6]. It is generally assumed that the unknown function $f(.)$ is a smooth function, while the ${\varepsilon }_{i}\text{'}s$ represent random errors that are considered to be independent of both $({x}_{i}, {t}_{i})$ .

Since the semiparametric regression models combine both parametric and nonparametric components, the response variable depends on the explanatory variables in a linear form but has a nonlinear relationship with other explanatory variables, which are more flexible than standard linear regression models according to (1.1) ^[7,8,9]. There are different ways to estimate $\beta$ and $f(.)$ . Some of the most important methods were introduced by ^[10,11].

The presence of nearly linear dependency among the columns of the design matrix $\boldsymbol{X} = ({\boldsymbol{x}}_{1}, ..., {\boldsymbol{x}}_{n}{)}^{\mathrm{\top }}$ is known as multicollinearity, and it is an issue that might arise in regression analysis. In this case, the matrix $\boldsymbol{S} = {\boldsymbol{X}}^{{\top }}\boldsymbol{X}$ contains one or more small eigenvalues, causing the regression coefficient estimations to be large in absolute value. The condition number is an effective measure for detecting the presence of multicollinearity. The matrix S is ill-conditioned under multicollinearity because its condition number tends to an extremely large value. Multicollinearity makes the ordinary least-squares estimator (OLSE) perform badly. Also, multicollinearity in data may cause confidence intervals to be too large for either the individual parameters or their linear mixes, which may lead to inaccurate predictions. Applying shrinkage estimators is widely used as an effective solution to address the issues arising from multicollinearity ^{[12,13,14,15]}. In this study, the shrinkage estimator, suggested by Liu ^[16], is applied to solve the problem of multicollinearity. Liu ^[16] combined the Stein type estimator with the conventional ordinary ridge regression estimator to derive the Liu estimator, as described in ^[17,18]. Other alternative approaches to addressing the issue of multicollinearity can be found in ^[19,20,21].

Besides the multicollinearity problem, another typical issue that arises in regression analysis is the presence of outliers, which are observations that do not follow the pattern of the main bulk of the data. Outliers can cause problems like inflated sums of squares, estimate bias, p-value confusion, and more. To combat these problems, robust regression methods are used. The ordinary least-squares estimator is known to be extremely affected by outliers, so the least trimmed squares approach is used to estimate both components of SRM in this research.

The breakdown point of an estimator is the fundamental measurement that is used to evaluate its robustness. This breakdown point concept refers to the percentage of outlying observations (up to 50 percent) that can contaminate the estimation promiscuously. In computational geometry, the investigation of effective algorithms for robust estimation methods has been an important field of study. Several researchers have examined the robust least median of squares (LMS) method, which is the hyperplane that minimizes the squared residual median ^[22]. Although the LMS estimator has been the subject of most publications on robust estimation in the field of linear models, Rousseeuw and Leroy ^[23] noted that LMS is not the optimal option due to its statistical features. They asserted that selecting the least trimmed squares is the better alternative option because both LTS and LMS have the same breakdown point, approximately 50%, but the objective function of LTS is smoother than LMS. Also, since LTS converges more quickly and is distributed asymptotically normally ^[22], it has superior statistical efficiency. For these reasons, LTS is a better choice as a starting point for two-step robust estimators such as the MM-estimator ^[24].

The main focus of this paper is to study a feasible generalized robust Liu estimator in a restricted semiparametric regression model. The organization of this article is as follows: Section 2 contains the classical estimator of a restricted semiparametric regression model based on the kernel method. After reviewing the concepts of Liu and least trimmed squares approaches in a semiparametric regression model, a new feasible robust Liu estimator in a restricted semiparametric regression model is suggested, and then its asymptotic bias and distributional covariance are derived in Section 3.Based on the obtained results, the feasible generalized robust Liu estimator is compared with the classical one in terms of the mean squared error. In Section 4, the efficiencies of the proposed estimators are assessed through Monte Carlo simulation experiments as well as with a real-world data example. Finally, some important findings are concluded in Section 5.

2. Feasible type of the classical estimators in RSRM

The estimators conform to certain restrictions in classical estimators. Let us examine the semiparametric regression model

$\boldsymbol{y} = \boldsymbol{X}\boldsymbol{\beta }+f\left(\boldsymbol{t}\right)+\boldsymbol{\varepsilon },$

(2.1)

where $\boldsymbol{y} = ({y}_{1}, ..., {y}_{n}{)}^{\mathrm{\top }}, \boldsymbol{X} = ({\boldsymbol{x}}_{1}, ..., {\boldsymbol{x}}_{n}{)}^{\mathrm{\top }}, f\left(\boldsymbol{t}\right) = \left(f\right({t}_{1}), ..., f({t}_{n}){)}^{\mathrm{\top }}, \boldsymbol{\varepsilon } = ({\varepsilon }_{1}, ..., {\varepsilon }_{n}{)}^{\mathrm{\top }}.$

Generally, we assume that $\boldsymbol{\varepsilon }$ is a vector of disturbances that follows a distribution with $E\left(\boldsymbol{\varepsilon }\right) = {\bf{0}}$ and $E\left({\boldsymbol{\varepsilon }}^{{\top }}\boldsymbol{\varepsilon }\right) = {\sigma }^{2}\boldsymbol{V}$ , where ${\sigma }^{2}$ is an unknown parameter and $\boldsymbol{V}$ is a symmetric and positive definite matrix. To estimate the linear part of model (2.1), we first remove the non-parametric effect by detrending. Given the assumption that β is known, a natural non-parametric estimator of f (.) is

$\hat{f}{\it{(}}\boldsymbol{t}{{)}} = k(\boldsymbol{t})(\boldsymbol{y}-\boldsymbol{X} \boldsymbol{\beta}),$

in which k(.) is a kernel function. Following ^[25], by substituting $\widehat{f}$ (t) for $f\left(\boldsymbol{t}\right)$ in Eq (2.1), the model may be reduced to

$\widetilde {\boldsymbol{y}} = \widetilde {\boldsymbol{X}}\boldsymbol{\beta }+\boldsymbol{\varepsilon } .$

(2.2)

where $\widetilde {\boldsymbol{y}} = \left({\boldsymbol{I}}_{\boldsymbol{n}}-\boldsymbol{K}\right)\boldsymbol{y}$ , $\widetilde {\boldsymbol{X}} = \left({\boldsymbol{I}}_{\boldsymbol{n}}-\boldsymbol{K}\right)\boldsymbol{X}$ , and $\boldsymbol{K}$ is the smoother matrix with $\left(i, j\right)$ -th component ${K}_{\omega }\left({t}_{i}, {t}_{j}\right)\mathrm{i}\mathrm{n}\; \mathrm{w}\mathrm{h}\mathrm{i}\mathrm{c}\mathrm{h}\;{K}_{\omega }\left( \cdot \right)$ is a kernel function of order $m$ with bandwidth parameter $\omega$ . We can simply use the following transformations to change model (2.2) into a standard regression model by multiplying ${\boldsymbol{V}}^{-1/2}$ on the both sides as follows:

$\stackrel{˘}{\boldsymbol{y}} = \stackrel{˘}{\boldsymbol{X}}\boldsymbol{\beta } + \stackrel{˘}{\boldsymbol{\varepsilon }} , E\left(\stackrel{˘}{\boldsymbol{\varepsilon }}\right) = {\bf{0}} , E\left(\stackrel{˘}{\boldsymbol{\varepsilon }}{\stackrel{˘}{\boldsymbol{\varepsilon }}}^{{\top }}\right) = {\sigma }^{2}{\boldsymbol{I}}_{\boldsymbol{n}} ,$

where $\stackrel{˘}{\boldsymbol{y}} = {\boldsymbol{V}}^{-1/2}\widetilde {\boldsymbol{y}}$ , $\stackrel{˘}{\boldsymbol{X}} = {\boldsymbol{V}}^{-1/2}\widetilde {\boldsymbol{X}}$ , and $\stackrel{˘}{\boldsymbol{\varepsilon }} = {\boldsymbol{V}}^{-1/2}\boldsymbol{\varepsilon }.$ Now, the estimation of $\boldsymbol{\beta }$ is performed using the generalized least-squares estimator (GLSE), which is known to be the best linear unbiased estimator

$\begin{array}{c}{\widehat{\boldsymbol{\beta }}}_{GLS} = {argmin}_{\boldsymbol{\beta }}{\left(\stackrel{˘}{\boldsymbol{y}}-\stackrel{˘}{\boldsymbol{X}}\boldsymbol{\beta }\right)}^{{\top }}(\stackrel{˘}{\boldsymbol{y}}-\stackrel{˘}{\boldsymbol{X}}\boldsymbol{\beta }) \\ = {\left({\stackrel{˘}{\boldsymbol{X}}}^{{\top }}\stackrel{˘}{\boldsymbol{X}}\right)}^{-1}{\stackrel{˘}{\boldsymbol{X}}}^{{\top }}\stackrel{˘}{\boldsymbol{y}} \\ = {\boldsymbol{C}}^{-1}{\widetilde {\boldsymbol{X}}}^{{\top }}{\boldsymbol{V}}^{-1}\widetilde {\boldsymbol{y}}, \end{array}$

(2.3)

where $\boldsymbol{C} = {\widetilde {\boldsymbol{X}}}^{\mathrm{\top }}{\boldsymbol{V}}^{-1}\widetilde {\boldsymbol{X}}$ .

In applications, the matrix V is not known. Therefore, ${\widehat{\boldsymbol{\beta }}}_{GLS}$ in Eq (2.3) is not applicable because it is a function of covariance matrix (V) which is not known. To solve this issue, we have to utilize a two-stage process and implement a feasible generalized least-squares estimator (FGLSE) by replacing the unknown parameter V with a suitable estimator ^[26], $\boldsymbol{S} = \frac{1}{n-p}(\widetilde {\boldsymbol{y}}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{LS}){\left(\widetilde {\boldsymbol{y}}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{LS}\right)}^{{\top }}$ , which is a consistent estimator, as follows:

${\widehat{\boldsymbol{\beta }}}_{FGLS} = {\boldsymbol{C}}_{F}^{-1}{\widetilde {\boldsymbol{X}}}^{{\top }}{\boldsymbol{S}}^{-1}\widetilde {\boldsymbol{y}},$

(2.4)

where ${\boldsymbol{C}}_{F} = {\widetilde {\boldsymbol{X}}}^{{\top }}{\boldsymbol{S}}^{-1}\widetilde {\boldsymbol{X}}$ , and ${\widehat{\boldsymbol{\beta }}}_{LS}$ is the ordinary least-squares estimator ${\left({\widetilde {\boldsymbol{X}}}^{{\top }}\widetilde {\boldsymbol{X}}\right)}^{-1}{\widetilde {\boldsymbol{X}}}^{{\top }}\widetilde {\boldsymbol{y}}$ . As demonstrated in Zellner ^[26], ${\widehat{\boldsymbol{\beta }}}_{FGLS} = {\widehat{\boldsymbol{\beta }}}_{GLS}+{O}_{p}\left({n}^{-1}\right)$ , and consequently ${\surd n(\widehat{\boldsymbol{\beta }}}_{FGLS}-\beta)$ and ${\surd n(\widehat{\boldsymbol{\beta }}}_{FGLS}-\beta)$ have the same normal asymptotical distribution, and so ${\mathrm{V}\mathrm{a}\mathrm{r}(\widehat{\boldsymbol{\beta }}}_{FGLS}) = {\boldsymbol{C}}_{F}^{-1}+o({n}^{-1})$ , where ${O}_{p}\left({n}^{-1}\right)$ indicates an amount which is of order ${n}^{-1}$ in probability and $o\left({n}^{-1}\right)$ indicates a quantity of a higher order of smallness than ${n}^{-1}$ .

Interestingly, another method to handle the strong and extremely strong multicollinearity problems is to obtain the estimators under particular constraints on the unknown parameters, which may be exact or stochastic (see ^[27,28,29] for more details). By applying some constraints on the parameter space of the linear part, Durbin ^[30], Theil and Goldberger ^[31], and Theil ^[32] proposed the ordinary mixed estimator (OME) for the vector of the regression coefficient. Assume that we had prior knowledge regarding $\beta$ in the sense of non-stochastic exact constraints ^[33,34,35], as follows:

$\boldsymbol{R}\boldsymbol{\beta } = \boldsymbol{r}$

where R is a known matrix $q\times \; \mathrm{p}$ of prior information of rank $q < p$ and r is a known $q\times \; 1$ vector. This restriction should come from an outside source (it might be determined, for example, by an outside source of information or an expert). Thus, when the regression parameters are restricted by a space of linear constraints non-stochastically represented by independent prior information, we provide the instruments necessary to compute the risk of estimators. Next, the performances of the new constrained estimators and the classical estimators may be compared under certain conditions. We show that the innovative constrained estimators outperform the classical ones in terms of least-risk functions, assuming linear restrictions. In these circumstances, certain non-sample information (a previous constraint on the parameters) may exist; they are often presented to the model as constraints. Compared to typical estimators, the restricted estimation performs better, and so, in this work, the restricted semiparametric regression model (RSRM) is fitted to the data set. The feasible generalized least-squares restricted estimator (FGLSRE) is derived by imposing a linear restriction as follows:

$\begin{array}{c} {\widehat{\boldsymbol{\beta }}}_{FGLSR} = {argmin}_{\boldsymbol{\beta }}{\left(\widetilde {\boldsymbol{y}}-\widetilde {\boldsymbol{X}}\boldsymbol{\beta }\right)}^{{\top }}{\boldsymbol{S}}^{-1}(\widetilde {\boldsymbol{y}}-\widetilde {\boldsymbol{X}}\boldsymbol{\beta }) \\ \;\;{\rm{s.t.}}\;\; \boldsymbol{R}\boldsymbol{\beta } = \boldsymbol{r} \\ = {\widehat{\boldsymbol{\beta }}}_{FGLS}-{\boldsymbol{C}}_{F}^{{\bf{-1}}}{\boldsymbol{R}}^{{\top }}{\left(\boldsymbol{R}{\boldsymbol{C}}_{F}^{{\bf{-1}}}{\boldsymbol{R}}^{{\top }}\right)}^{-1}\left(\boldsymbol{R}{\widehat{\boldsymbol{\beta }}}_{FGLS}-\boldsymbol{r}\right). \end{array}$

(2.5)

As it is known, the covariance matrix estimation of ${\widehat{\boldsymbol{\beta }}}_{FGLS}$ is equal to ${\sigma }^{2}{\boldsymbol{C}}_{F}^{-1}$ . So, the FGLSE and its covariance matrix are significantly influenced by the features of matrix ${\boldsymbol{C}}_{F}$ . The FGLS estimators gain susceptibility to various errors when ${\boldsymbol{C}}_{F}$ is ill-conditioned. Also, some of the estimations of the regression coefficients, for instance, might have incorrect signs or be statistically insignificant; this could lead to unstable estimators, which are characterized by large confidence intervals for the specific parameters. Making valid statistical inferences becomes challenging in the presence of these errors, and so a biased estimation technique is introduced and utilized for RSRM under the multicollinearity problem.

3. Feasible robust Liu estimator in RSRM

Multicollinearity leads to ${\boldsymbol{X}}^{{\top }}\boldsymbol{X}$ being ill-conditioned with a large condition number. When the signal to noise ratio ${\boldsymbol{\beta }}^{{\top }}\boldsymbol{\beta }/{\sigma }^{2}$ is small and the condition number of ${\boldsymbol{X}}^{{\top }}\boldsymbol{X}$ is large, the least-squares estimator is most severely affected by multicollinearity. In this situation, the high level of data noise is enlarged by ${\left({\boldsymbol{X}}^{{\top }}\boldsymbol{X}\right)}^{-1}$ , making the least-squares estimator highly unstable. To solve this drawback, Hoerl and Kennard ^[36] proposed the ridge estimator ${\widehat{\boldsymbol{\beta }}}_{k} = {({\boldsymbol{X}}^{{\top }}\boldsymbol{X}+k\boldsymbol{I})}^{-1}{\boldsymbol{X}}^{{\top }}\boldsymbol{y}$ in the standard linear regression model $\boldsymbol{y} = \boldsymbol{X}\boldsymbol{\beta }+\boldsymbol{\varepsilon }$ with $E\left(\boldsymbol{\varepsilon }\right) = {\bf{0}}$ and $E\left({\boldsymbol{\varepsilon }}^{{\top }}\boldsymbol{\varepsilon }\right) = {\sigma }^{2}\boldsymbol{I}$ , and it has become the most often used method for combating the multicollinearity problem that causes the least squares estimator to fail. Indeed, the ridge method solves the multicollinearity problem by adding a small non-stochastic constant k to the diagonal elements of ${\boldsymbol{X}}^{{\top }}\boldsymbol{X}$ to decrease its condition number. In practical use, the biasing parameter k in ridge approach is typically rather modest. It is obvious that the condition number of ${\boldsymbol{X}}^{{\top }}\boldsymbol{X}+k\boldsymbol{I}$ is a decreasing function of k. Thus, high values of k are needed to achieve small-scale control over the condition number of ${\boldsymbol{X}}^{{\top }}\boldsymbol{X}+k\boldsymbol{I}$ . Because of this, the small k selected in practice may not be sufficiently big to solve the severe multicollinearity problem of ${\boldsymbol{X}}^{{\top }}\boldsymbol{X}$ . So, the resultant ridge estimation may still be unstable since ${\boldsymbol{X}}^{{\top }}\boldsymbol{X}+k\boldsymbol{I}$ has remained ill-conditioned. Furthermore, despite its practical effectiveness, the ridge estimator is a complicated function of k. Although the Stein-type estimator ${\widehat{\boldsymbol{\beta }}}_{c} = {c\left({\boldsymbol{X}}^{{\top }}\boldsymbol{X}\right)}^{-1}{\boldsymbol{X}}^{{\top }}\boldsymbol{y}$ is a linear function of c, the shrinkage of each element of ${\widehat{\boldsymbol{\beta }}}_{c}$ is the same. To address these issues, Liu ^[12] proposed a new biased estimator ${\widehat{\boldsymbol{\beta }}}_{d} = {({\boldsymbol{X}}^{{\top }}\boldsymbol{X}+\boldsymbol{I})}^{-1}{(\boldsymbol{X}}^{{\top }}\boldsymbol{y}+d\widehat{\boldsymbol{\beta }}$ ) by combining the advantages of the ridge and Stein-type estimators, which effectively solved the problem of ill-conditioning in standard regression model, where 0 < d < 1 is a biasing parameter and $\widehat{\boldsymbol{\beta }} = {\left({\boldsymbol{X}}^{{\top }}\boldsymbol{X}\right)}^{-1}{\boldsymbol{X}}^{{\top }}\boldsymbol{y}$ . It is obvious that when d = 1, ${\widehat{\boldsymbol{\beta }}}_{d}$ = $\widehat{\boldsymbol{\beta }}$ .

According to ^[12] the mean squared error (MSE) of the Liu estimator is obtained by

$\mathrm{M}\mathrm{S}\mathrm{E}\left({\widehat{\boldsymbol{\beta }}}_{d}\right) = {\sigma }^{2}\sum _{j = 1}^{p}\frac{{\left({\lambda }_{j}+d\right)}^{2}}{{\lambda }_{j}{\left({\lambda }_{j}+1\right)}^{2}}+{\left(d-1\right)}^{2}\sum _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}} ,$

(3.1)

where ${\alpha }_{j}^{2}$ corresponds to the jth element of $\boldsymbol{\alpha } = {\boldsymbol{\varGamma }}^{{\top }}\boldsymbol{\beta }$ and $\boldsymbol{\varGamma }$ is an orthogonal matrix such that ${\boldsymbol{C}}_{F} = \boldsymbol{\varGamma }\boldsymbol{\varLambda }{\boldsymbol{\varGamma }}^{\boldsymbol{\top }}$ , in which $\boldsymbol{\varLambda } = diag\left({\lambda }_{1}, ..., {\lambda }_{p}\right)$ contains the eigenvalues of matrix ${\boldsymbol{C}}_{F}.$ Consequently, the biasing parameter d is chosen by minimizing $MSE\left({\widehat{\boldsymbol{\beta }}}_{d}\right)$ as follows:

$\widehat{d} = 1-{\widehat{\sigma }}_{FGLS}^{2}\left(\frac{\sum _{j = 1}^{p}\frac{1}{{\lambda }_{j}({\lambda }_{j}+1)}}{\sum _{j = 1}^{p}\frac{{\widehat{\alpha }}_{jFGLS}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}}\right) ,$

(3.2)

where ${\widehat{\sigma }}_{FGLS}^{2}$ and ${\widehat{\alpha }}_{jFGLS}^{2}$ are the unbiased estimators of ${\sigma }^{2}$ and ${\alpha }_{j}$ based on FGLSE, respectively, i.e., ${\widehat{\sigma }}_{FGLS}^{2} = \frac{1}{n-p}{\left(\boldsymbol{y}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{FGLS}\right)}^{{\top }}{\boldsymbol{S}}^{-1}\left(\boldsymbol{y}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{FGLS}\right)$ and ${\widehat{\boldsymbol{\alpha }}}_{FGLS} = {{\boldsymbol{\varGamma }}^{{\top }}\widehat{\boldsymbol{\beta }}}_{FGLS}$ .

The feasible generalized least-squares Liu estimator (FGLSLE) can be extended by ^[37,38,39] as follows:

$\begin{array}{c} {\widehat{\boldsymbol{\beta }}}_{FGLSL}\left(d\right) = ({\widetilde {\boldsymbol{X}}}^{{\top }}{\boldsymbol{S}}^{-1}\widetilde {\boldsymbol{X}}+\boldsymbol{I}{)}^{-1}({\widetilde {\boldsymbol{X}}}^{{\top }}{\boldsymbol{S}}^{-1}\widetilde {\boldsymbol{X}}+d\boldsymbol{I}){\widehat{\boldsymbol{\beta }}}_{FGLS} \\ = ({\boldsymbol{C}}_{F}+\boldsymbol{I}{)}^{-1}(\boldsymbol{C}+d\boldsymbol{I}){\widehat{\boldsymbol{\beta }}}_{FGLS} \\ = {\boldsymbol{F}}_{d}{\widehat{\boldsymbol{\beta }}}_{FGLS} , 0\le d\le 1 , \end{array}$

(3.3)

where ${\boldsymbol{F}}_{d} = ({\boldsymbol{C}}_{F}+\boldsymbol{I}{)}^{-1}\left({\boldsymbol{C}}_{F}+d\boldsymbol{I}\right)$ .

Based on the fact that ${\boldsymbol{F}}_{d}$ and ${\boldsymbol{C}}_{F}^{{\bf{-1}}}$ are commutative, the feasible generalized least-squares restricted Liu estimator (FGLSRLE) can be obtained for RSRM as follows ^[40,41,42]:

$\begin{array}{c} {\widehat{\boldsymbol{\beta }}}_{FGLSRL}\left(d\right) = {argmin}_{\boldsymbol{\beta }}{\left(\widetilde {\boldsymbol{y}}-\widetilde {\boldsymbol{X}}\boldsymbol{\beta }\right)}^{{\top }}{\boldsymbol{S}}^{-1}\left(\widetilde {\boldsymbol{y}}-\widetilde {\boldsymbol{X}}\boldsymbol{\beta }\right)+{\left(d{\widehat{\boldsymbol{\beta }}}_{FGLS}-\boldsymbol{\beta }\right)}^{{\top }}\left(d{\widehat{\boldsymbol{\beta }}}_{FGLS}-\boldsymbol{\beta }\right) \\ {\rm{s.t.}} \;\; \boldsymbol{R}\boldsymbol{\beta } = \boldsymbol{r} \\ = {\boldsymbol{F}}_{d}{\widehat{\boldsymbol{\beta }}}_{FGLS}-{\boldsymbol{F}}_{d}{\boldsymbol{C}}_{F}^{{\bf{-1}}}{\boldsymbol{R}}^{\mathrm{\top }}{\left(\boldsymbol{R}{\boldsymbol{C}}_{F}^{-1}{\boldsymbol{R}}^{\mathrm{\top }}\right)}^{-1}\left(\boldsymbol{R}{\widehat{\boldsymbol{\beta }}}_{FGLS}-\boldsymbol{r}\right) . \end{array}$

(3.4)

Lemma 3.1. If β satisfies the linear restriction Rβ = r, then the properties (bias, covariance, and mean squared error) of the suggested estimator can be calculated directly as follows:

$\mathrm{B}\mathrm{i}\mathrm{a}\mathrm{s}\left({\widehat{\boldsymbol{\beta }}}_{FGLSRL}\left(d\right)\right) = -\left(\boldsymbol{I}-{\boldsymbol{F}}_{d}\right)\boldsymbol{\beta }+o\left({n}^{-\frac{1}{2}}\right) ,$

(3.5)

$\mathrm{C}\mathrm{o}\mathrm{v}\left({\widehat{\boldsymbol{\beta }}}_{FGLSRL}\left(d\right)\right) = {\sigma }^{2}{\boldsymbol{F}}_{d}\boldsymbol{H}{\boldsymbol{F}}_{d}^{\mathrm{\top }}+o\left({n}^{-1}\right) ,$

(3.6)

$\mathrm{M}\mathrm{S}\mathrm{E}\left({\widehat{\boldsymbol{\beta }}}_{FGLSRL}\left(d\right)\right) = {\sigma }^{2}tr\left({\boldsymbol{F}}_{d}\boldsymbol{H}{\boldsymbol{F}}_{d}^{\mathrm{\top }}\right)+{\boldsymbol{\beta }}^{\mathrm{\top }}{\left({\boldsymbol{F}}_{d}-\boldsymbol{I}\right)}^{\mathrm{\top }}\left({\boldsymbol{F}}_{d}-\boldsymbol{I}\right)\boldsymbol{\beta }+o\left({n}^{-1}\right) ,$

(3.7)

where $\boldsymbol{H} = {\boldsymbol{C}}_{F}^{-1}\left(\boldsymbol{I}-{\boldsymbol{R}}^{\mathrm{\top }}{\left(\boldsymbol{R}{\boldsymbol{C}}_{F}^{-1}{\boldsymbol{R}}^{\mathrm{\top }}\right)}^{-1}\boldsymbol{R}{\boldsymbol{C}}_{F}^{-1}\right)$ .

Theorem 3.1. The MSE of FGLSRLE under the linear restriction Rβ = r can be given by

$MSE\left({\widehat{\boldsymbol{\beta }}}_{FGLSRL}\left(d\right)\right) = {\sigma }^{2}\sum _{j = 1}^{p}\frac{{\left({\lambda }_{j}+d\right)}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}{m}_{jj}+{\left(d-1\right)}^{2}\sum _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}+o\left({n}^{-1}\right) ,$

(3.8)

where ${m}_{jj}$ is the jth diagonal element of the matrix $\boldsymbol{M} = {\boldsymbol{\varGamma }}^{{\top }}\boldsymbol{H}\boldsymbol{\varGamma }$ .

Proof. Using $({\boldsymbol{C}}_{\boldsymbol{F}}+\boldsymbol{I}{)}^{-1} = \boldsymbol{\varGamma }(\boldsymbol{\varLambda }+\boldsymbol{I}{)}^{-1}{\boldsymbol{\varGamma }}^{{\top }}$ and $\left({\boldsymbol{C}}_{F}+d\boldsymbol{I}\right) = \boldsymbol{\varGamma }\left(\boldsymbol{\varLambda }+d\boldsymbol{I}\right){\boldsymbol{\varGamma }}^{{\top }}$ , we can write

$tr\left({\boldsymbol{F}}_{d}\boldsymbol{H}{\boldsymbol{F}}_{d}^{\mathrm{\top }}\right) = tr\left(({\boldsymbol{C}}_{F}+\boldsymbol{I}{)}^{-1}\left({\boldsymbol{C}}_{F}+d\boldsymbol{I}\right)\boldsymbol{H}\left({\boldsymbol{C}}_{F}+d\boldsymbol{I}\right)({\boldsymbol{C}}_{F}+\boldsymbol{I}{)}^{-1}\right)$

$= tr\left(\boldsymbol{\varGamma }(\boldsymbol{\varLambda }+\boldsymbol{I}{)}^{-1}{\boldsymbol{\varGamma }}^{{\top }}\boldsymbol{\varGamma }\left(\boldsymbol{\varLambda }+d\boldsymbol{I}\right){\boldsymbol{\varGamma }}^{{\top }}\boldsymbol{H}\boldsymbol{\varGamma }\left(\boldsymbol{\varLambda }+d\boldsymbol{I}\right){\boldsymbol{\varGamma }}^{{\top }}\boldsymbol{\varGamma }(\boldsymbol{\varLambda }+\boldsymbol{I}{)}^{-1}{\boldsymbol{\varGamma }}^{{\top }}\right)$

$= tr\left((\boldsymbol{\varLambda }+\boldsymbol{I}{)}^{-2}{\left(\boldsymbol{\varLambda }+d\boldsymbol{I}\right)}^{2}{\boldsymbol{\varGamma }}^{{\top }}\boldsymbol{H}\boldsymbol{\varGamma }\right)$

$= \sum _{j = 1}^{p}\frac{{\left({\lambda }_{j}+d\right)}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}{m}_{jj} .$

Also, from $({\boldsymbol{C}}_{F}+\boldsymbol{I}{)}^{-2} = \boldsymbol{\varGamma }(\boldsymbol{\varLambda }+\boldsymbol{I}{)}^{-2}{\boldsymbol{\varGamma }}^{{\top }}$ , we have

${\boldsymbol{\beta }}^{\mathrm{\top }}{\left({\boldsymbol{F}}_{d}-\boldsymbol{I}\right)}^{\mathrm{\top }}\left({\boldsymbol{F}}_{d}-\boldsymbol{I}\right)\boldsymbol{\beta } = {\boldsymbol{\alpha }}^{\mathrm{\top }}{\boldsymbol{\varGamma }}^{{\top }}{\left(({\boldsymbol{C}}_{F}+\boldsymbol{I}{)}^{-1}\left({\boldsymbol{C}}_{F}+d\boldsymbol{I}\right)-\boldsymbol{I}\right)}^{\mathrm{\top }}\left(({\boldsymbol{C}}_{F}+\boldsymbol{I}{)}^{-1}\left({\boldsymbol{C}}_{F}+d\boldsymbol{I}\right)-\boldsymbol{I}\right)\boldsymbol{\varGamma }\boldsymbol{\alpha }$

$= {\boldsymbol{\alpha }}^{\mathrm{\top }}{\boldsymbol{\varGamma }}^{{\top }}\left(\right({\boldsymbol{C}}_{F}+d\boldsymbol{I})-\left({\boldsymbol{C}}_{F}+\boldsymbol{I}\right))({\boldsymbol{C}}_{F}+\boldsymbol{I}{)}^{-2}(({\boldsymbol{C}}_{F}+d\boldsymbol{I})-\left({\boldsymbol{C}}_{F}+\boldsymbol{I}\right))\boldsymbol{\varGamma }\boldsymbol{\alpha }$

$= {{\left(d-1\right)}^{2}\boldsymbol{\alpha }}^{\mathrm{\top }}{\boldsymbol{\varGamma }}^{{\top }}\boldsymbol{\varGamma }(\boldsymbol{\varLambda }+\boldsymbol{I}{)}^{-2}{\boldsymbol{\varGamma }}^{{\top }}\boldsymbol{\varGamma }\boldsymbol{\alpha }$

$= {\left(d-1\right)}^{2}\sum _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}} .$

So, the proof is completed. ■

As an important result of Theorem 3.1, the optimal value of the biasing parameter d can be obtained by differentiating the MSE function of GLSRLE as a function of d (same as $g$ (d)) with respect to d, and solve it by setting the derivative equal to zero to extract the optimal value of d. Via direct calculation, we have

${g}^{{{'}}}\left(d\right) = \frac{\partial \mathrm{M}\mathrm{S}\mathrm{E}\left({\widehat{\boldsymbol{\beta }}}_{d}\right)}{\partial d} = {2\sigma }^{2}\sum _{j = 1}^{p}\frac{\left({\lambda }_{j}+d\right)}{{\left({\lambda }_{j}+1\right)}^{2}}{m}_{jj}+2\left(d-1\right)\sum _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}} = 0 .$

So, it can be written

$\left({\sigma }^{2}\sum \limits_{j = 1}^{p}\frac{{m}_{jj}}{{\left({\lambda }_{j}+1\right)}^{2}}+\sum\limits _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}\right)d = {\sum\limits _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}-\sigma }^{2}\sum \limits_{j = 1}^{p}\frac{{\lambda }_{j}{m}_{jj}}{{\left({\lambda }_{j}+1\right)}^{2}}$

$\Rightarrow d = \frac{{\sum _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}-\sigma }^{2}\sum _{j = 1}^{p}\frac{{\lambda }_{j}{m}_{jj}}{{\left({\lambda }_{j}+1\right)}^{2}}}{{\sigma }^{2}\sum _{j = 1}^{p}\frac{{m}_{jj}}{{\left({\lambda }_{j}+1\right)}^{2}}+\sum _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}}$

$= \frac{{\sum _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}\pm {\sigma }^{2}\sum _{j = 1}^{p}\frac{{m}_{jj}}{{\left({\lambda }_{j}+1\right)}^{2}}-\sigma }^{2}\sum _{j = 1}^{p}\frac{{\lambda }_{j}{m}_{jj}}{{\left({\lambda }_{j}+1\right)}^{2}}}{{\sigma }^{2}\sum _{j = 1}^{p}\frac{{m}_{jj}}{{\left({\lambda }_{j}+1\right)}^{2}}+\sum _{j = 1}^{p}\frac{{\alpha }_{j}^{2}}{{\left({\lambda }_{j}+1\right)}^{2}}}$

$= 1-{\sigma }^{2}\frac{\sum _{j = 1}^{p}\frac{{m}_{jj}}{\left({\lambda }_{j}+1\right)}}{\sum _{j = 1}^{p}\frac{{\alpha }_{j}^{2}+{\sigma }^{2}{m}_{jj}}{{\left({\lambda }_{j}+1\right)}^{2}}} .$

According to the fact that ${g}^{{{'}}{{'}}}\left(d\right)$ is positive for all values of d, we can conclude the obtained extremum value of d minimizes the MSE function of GLSRLE. Now, for practical aspects, the following estimator can be used for the optimal d in applications:

$\widehat{d} = 1-{\widehat{\sigma }}_{FGLSR}^{2}\left(\frac{\sum _{j = 1}^{p}\frac{{m}_{jj}}{({\lambda }_{j}+1)}}{\sum _{j = 1}^{p}\frac{{\widehat{\alpha }}_{jFGLSR}^{2}+{\widehat{\sigma }}_{FGLSR}^{2}{m}_{jj}}{{\left({\lambda }_{j}+1\right)}^{2}}}\right) ,$

(3.9)

where ${\widehat{\sigma }}_{FGLSR}^{2}$ and ${\widehat{\alpha }}_{jFGLSR}^{2}$ are the unbiased estimators of ${\sigma }^{2}$ and ${\alpha }_{j}$ based on GLSRE, respectively, i.e., ${\widehat{\sigma }}_{FGLSR}^{2} = \frac{1}{n-(p+q)}{\left(\boldsymbol{y}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{FGLSR}\right)}^{{\top }}{\boldsymbol{S}}^{-1}\left(\boldsymbol{y}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{FGLSR}\right)$ and ${\widehat{\boldsymbol{\alpha }}}_{FGLSR} = {{\boldsymbol{\varGamma }}^{{\top }}\widehat{\boldsymbol{\beta }}}_{FGLSR}$ , in which

${\widehat{\boldsymbol{\beta }}}_{FGLSR} = {\widehat{\boldsymbol{\beta }}}_{FGLS}-{\boldsymbol{C}}_{F}^{{\bf{-1}}}{\boldsymbol{R}}^{\mathrm{\top }}{\left(\boldsymbol{R}{\boldsymbol{C}}_{F}^{{\bf{-1}}}{\boldsymbol{R}}^{\mathrm{\top }}\right)}^{-1}\left(\boldsymbol{R}{\widehat{\boldsymbol{\beta }}}_{FGLS}-\boldsymbol{r}\right) .$

(3.10)

As was mentioned earlier, outlier observations have the potential to significantly corrupt the least-squares estimators and all of the estimators based on it due to their significant impact on the objective function. The robust regression approach is a broad term that encompasses various estimating approaches. Least trimmed squares is a robust regression method introduced by ^[43]. LTS seeks to tackle this issue by minimizing the sum of the lowest h squared residuals following the removal of a specific percentage of extreme values. In this case, h serves as a threshold, and the proportion of the outlying data is represented by the ratio α = (n − h)/n.

Typically, the value of h can be taken as h = [[n(1 − α)]], where [[x]] stands for the ceiling of x. Some other authors suggest to take $h = \left[n/2\right]+\left[(p+1)/2\right]$ , $h = \left[n\left(1-\alpha \right)\right]+\left[\alpha \left(p+1\right)\right]$ , $\mathrm{o}\mathrm{r}h = \left[n\left(1-\alpha \right)\right]+1$ (see ^[44]). The LTS estimator is computed by solving the $\left(\begin{array}{c}n\\ h\end{array}\right)$ total least-squares fits combinations of the index set {1, ..., n}. Thus, for large values of sample size, finding the global minimum in the objective function of the LTS method takes time and space. To accelerate the process of finding the solution (LTS fit), we use an analogue of the FAST-LTS algorithm extended by ^[24].

Let ${z}_{i}$ represent the indicator variable that signifies whether or not observation $i$ is regarded as a normal observation. The optimization problem of a feasible robust estimator based on the LTS approach in RSRM can be developed as follows:

$\begin{array}{c} {min}_{\boldsymbol{\beta }.\boldsymbol{z}}\psi \left(\boldsymbol{\beta }.\boldsymbol{z}\right) = {\left(\widetilde {\boldsymbol{y}}-\widetilde {\boldsymbol{X}}\boldsymbol{\beta }\right)}^{\mathrm{\top }}{\boldsymbol{S}}^{-\frac{1}{2}}{\boldsymbol{Z}\boldsymbol{S}}^{-\frac{1}{2}}(\widetilde {\boldsymbol{y}}-\widetilde {\boldsymbol{X}}\boldsymbol{\beta }) \\ s.t. \;\;\boldsymbol{R}\boldsymbol{\beta } = \boldsymbol{r} , \\ {\boldsymbol{e}}^{{\top }}\boldsymbol{z} = h , \\ {z}_{i}\in \left\{0.1\right\}.i = 1.\dots .n. \end{array}$

(3.11)

where Z is the diagonal matrix with diagonal elements $\boldsymbol{z} = {({z}_{1}.\dots., {z}_{n})}^{\top }$ , ${e = (1.\dots.1)}_{n\times 1}^{\mathrm{\top }}$ , and $h$ is a positive integer. The resultant estimator is the feasible generalized least trimmed squares restricted estimator (FGLTSRE), which is provided by

${\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right) = {\widehat{\boldsymbol{\beta }}}_{FGLTS}\left(\boldsymbol{z}\right)-{{\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)}^{-1}{\boldsymbol{R}}^{\mathrm{\top }}{\left({\boldsymbol{R}{\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)}^{-1}{\boldsymbol{R}}^{\mathrm{\top }}\right)}^{-1}\left(\boldsymbol{R}{\widehat{\boldsymbol{\beta }}}_{FGLTS}\left(\boldsymbol{z}\right)-\boldsymbol{r}\right) ,$

(3.12)

where ${\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right) = {\widetilde {\boldsymbol{X}}}^{{\top }}{\boldsymbol{S}}^{-1/2}{\boldsymbol{Z}\boldsymbol{S}}^{-1/2}\widetilde {\boldsymbol{X}}\;\mathrm{a}\mathrm{n}\mathrm{d}\;{\widehat{\boldsymbol{\beta }}}_{FGLTS}\left(\boldsymbol{z}\right) = {{\boldsymbol{C}}_{F}\left(z\right)}^{-1}{\widetilde {\boldsymbol{X}}}^{{\top }}{\boldsymbol{S}}^{-1/2}{\boldsymbol{Z}\boldsymbol{S}}^{-1/2}\widetilde {\boldsymbol{y}}$ .

Now, we aim to implement the robust estimators obtained previously via the Liu idea to extract the novel feasible robust Liu estimator that is resistant to the existence of multicollinearity and outliers in the data set. The feasible generalized least trimmed squares restricted Liu estimator (FGLTSRLE) for RSRM using a two stages estimator for d and β can be extended as follows:

$\begin{array}{l} \;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\; {\widehat{\sigma }}_{FGLTSR}^{2} = \frac{1}{n-(p+q)}{\left(\boldsymbol{y}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)\right)}^{{\top }}{\boldsymbol{S}}^{-\frac{1}{2}}\boldsymbol{Z}{\boldsymbol{S}}^{-\frac{1}{2}}\left(\boldsymbol{y}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)\right)\\ \;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\; {\widehat{d}}_{LTS} = 1-{\widehat{\sigma }}_{FGLTSR}^{2}\left(\frac{\sum _{j = 1}^{p}\frac{{m}_{jj}\left(\boldsymbol{z}\right)}{\left({\lambda }_{j}\right(\boldsymbol{z})+1)}}{\sum _{j = 1}^{p}\frac{{\widehat{\alpha }}_{jFGLTSR}^{2}\left(\boldsymbol{z}\right)+{\widehat{\sigma }}_{FGLTSR}^{2}{m}_{jj}\left(\boldsymbol{z}\right)}{{\left({\lambda }_{j}\left(\boldsymbol{z}\right)+1\right)}^{2}}}\right) \\ {\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left({\widehat{d}}_{LTS}.\boldsymbol{z}\right) = \\ \;\;\;\;\;\;\;\;\;\; {\boldsymbol{F}}_{{\widehat{d}}_{LTS}}\left(\boldsymbol{z}\right){\widehat{\boldsymbol{\beta }}}_{FGLTS}\left(\boldsymbol{z}\right)-{\boldsymbol{F}}_{{\widehat{d}}_{LTS}}\left(\boldsymbol{z}\right){{\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)}^{-1}{\boldsymbol{R}}^{\mathrm{\top }}{\left({\boldsymbol{R}{\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)}^{-1}{\boldsymbol{R}}^{\mathrm{\top }}\right)}^{-1}\left({\boldsymbol{R}\widehat{\boldsymbol{\beta }}}_{FGLTS}\left(\boldsymbol{z}\right)-\boldsymbol{r}\right) \end{array}$

(3.13)

where ${\lambda }_{j}\left(\boldsymbol{z}\right)$ is the jth eigenvalue of matrix ${\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right) = \boldsymbol{\varGamma }\left(\boldsymbol{z}\right)\boldsymbol{\varLambda }\left(\boldsymbol{z}\right){\boldsymbol{\varGamma }\left(\boldsymbol{z}\right)}^{{\top }}$ , ${m}_{jj}\left(\boldsymbol{z}\right)$ is the jth diagonal element of the matrix $\boldsymbol{M}\left(\boldsymbol{z}\right) = {\boldsymbol{\varGamma }\left(\boldsymbol{z}\right)}^{{\top }}\boldsymbol{H}\left(\boldsymbol{z}\right)\boldsymbol{\varGamma }\left(\boldsymbol{z}\right)$ in which $\boldsymbol{H}\left(\boldsymbol{z}\right) = {\boldsymbol{C}\left(\boldsymbol{z}\right)}^{-1}\left(\boldsymbol{I}-{\boldsymbol{R}}^{\mathrm{\top }}{\left({\boldsymbol{C}\left(\boldsymbol{z}\right)}^{-1}{\boldsymbol{R}}^{\mathrm{\top }}\right)}^{-1}{\boldsymbol{R}\boldsymbol{C}\left(\boldsymbol{z}\right)}^{-1}\right)$ , ${\widehat{\alpha }}_{jFGLTSR}^{2}\left(\boldsymbol{z}\right)$ is the jth element of ${\widehat{\boldsymbol{\alpha }}}_{FGLTSR}\left(\boldsymbol{z}\right) = {\boldsymbol{\varGamma }}^{{\top }}{\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)$ , and ${\boldsymbol{F}}_{{\widehat{d}}_{LTS}}\left(\boldsymbol{z}\right) = ({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}{)}^{-1}\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+{\widehat{d}}_{LTS}\boldsymbol{I}\right)$ .

Theorem 3.2. The estimation of the MSE of the suggested estimator (5.2) under the linear restriction Rβ = r can be given by

$\begin{array}{c} \mathrm{M}\widehat{\mathrm{S}}\mathrm{E}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left({\widehat{d}}_{LTS}.\boldsymbol{z}\right)\right) = {\widehat{\sigma }}_{FGLTSR}^{2}\sum _{j = 1}^{p}\frac{{\left({\lambda }_{j}\left(\boldsymbol{z}\right)+{\widehat{d}}_{LTS}\right)}^{2}}{{\left({\lambda }_{j}\left(\boldsymbol{z}\right)+1\right)}^{2}}{m}_{jj}\left(\boldsymbol{z}\right)+({\widehat{d}}_{LTS}-\\ 1)^{2}\sum _{j = 1}^{p}\frac{{\widehat{\alpha }}_{jFGLTSR}^{2}\left(\boldsymbol{z}\right)}{{\left({\lambda }_{j\left(\boldsymbol{z}\right)}+1\right)}^{2}}+o\left({n}^{-1}\right) . \end{array}$

(3.14)

Proof. The proof directly follows by mimicking the proof of Theorem 3.1. ■

Lemma 3.2. The covariance matrix of estimator ${\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)$ is smaller compared to the covariance of estimator ${\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)$ if and only if

$0 < \tau < \underset{{\mu }_{\boldsymbol{ii}} < 0}{\mathrm{min}}\left|\frac{1}{{\mu }_{ii}}\right| ,$

(3.15)

where $\tau = \frac{1}{1+d}$ and the ${\mu }_{ii}$ 's are the eigenvalues of matrix $\boldsymbol{H}{\left(\boldsymbol{z}\right)}^{-1}\boldsymbol{N}\left(\boldsymbol{z}\right)$ , in which $\boldsymbol{N}\left(\boldsymbol{z}\right) = {\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)\boldsymbol{H}\left(\boldsymbol{z}\right)+\boldsymbol{H}\left(\boldsymbol{z}\right){\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)$ .

Proof. The covariance matrix of the mentioned estimators can be written as

$\mathrm{C}\mathrm{o}\mathrm{v}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)\right) = {\sigma }^{2}{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)\boldsymbol{H}\left(\boldsymbol{z}\right){{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)}^{\mathrm{\top }}+o\left({n}^{-1}\right) ,$

$\mathrm{C}\mathrm{o}\mathrm{v}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)\right) = \mathrm{C}\mathrm{o}\mathrm{v}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d = 1.\boldsymbol{z}\right)\right) = {\sigma }^{2}\boldsymbol{H}\left(\boldsymbol{z}\right)+o\left({n}^{-1}\right) .$

So, the difference ${\Delta }^{\mathrm{*}} = \mathrm{C}\mathrm{o}\mathrm{v}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)\right)-\mathrm{C}\mathrm{o}\mathrm{v}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)\right)$ can be expressed as follows:

${\Delta }^{*} = \mathrm{C}\mathrm{o}\mathrm{v}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)\right)-\mathrm{C}\mathrm{o}\mathrm{v}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)\right)$

$= {\sigma }^{2}\left(\boldsymbol{H}\left(\boldsymbol{z}\right)-{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)\boldsymbol{H}\left(\boldsymbol{z}\right){{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)}^{\mathrm{\top }}\right)$

$= {\sigma }^{2}{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)\left({{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)}^{-1}\boldsymbol{H}\left(\boldsymbol{z}\right){\left({{(\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)}^{\mathrm{\top }}\right)}^{-1}-\boldsymbol{H}\left(\boldsymbol{z}\right)\right){{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)}^{\mathrm{\top }}$

$= {\sigma }^{2}{\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}\right)}^{-1}\left(\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}\right)\boldsymbol{H}\left(\boldsymbol{z}\right)\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}\right)-\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+d\boldsymbol{I}\right)\boldsymbol{H}\left(\boldsymbol{z}\right)\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+d\boldsymbol{I}\right)\right){\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}\right)}^{-1}$

$= {\sigma }^{2}\left(1-{d}^{2}\right){\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}\right)}^{-1}\left(\boldsymbol{H}\left(\boldsymbol{z}\right)+\frac{1}{1+d}\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)\boldsymbol{H}\left(\boldsymbol{z}\right)+\boldsymbol{H}\left(\boldsymbol{z}\right){\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)\right)\right){\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}\right)}^{-1}$

$= {\sigma }^{2}\left(1-{d}^{2}\right)\boldsymbol{U}\left(\boldsymbol{z}\right)\left(\boldsymbol{H}\left(\boldsymbol{z}\right)+\tau \boldsymbol{N}\left(\boldsymbol{z}\right)\right)\boldsymbol{U}\left(\boldsymbol{z}\right) ,$

where $\boldsymbol{U}\left(\boldsymbol{z}\right) = {\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}\right)}^{-1}$ , $\tau = \frac{1}{1+d}$ , and $\boldsymbol{N}\left(\boldsymbol{z}\right) = {\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)\boldsymbol{H}\left(\boldsymbol{z}\right)+\boldsymbol{H}\left(\boldsymbol{z}\right){\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)$ is a symmetric matrix. Since $\boldsymbol{H}\left(\boldsymbol{z}\right) = \boldsymbol{L}{\left(\boldsymbol{z}\right)}^{\mathrm{\top }}\boldsymbol{L}\left(\boldsymbol{z}\right)$ , in which

$\boldsymbol{L}\left(\boldsymbol{z}\right) = {\left({\boldsymbol{C}}_{F}{\left(\boldsymbol{z}\right)}^{-1}-{\boldsymbol{C}}_{F}{\left(\boldsymbol{z}\right)}^{-1}{\boldsymbol{R}}^{\mathrm{\top }}{\left(\boldsymbol{R}{\boldsymbol{C}}_{F}^{-1}{\boldsymbol{R}}^{\mathrm{\top }}\right)}^{-1}\boldsymbol{R}{\boldsymbol{C}}_{F}{\left(\boldsymbol{z}\right)}^{-1}\right)}^{1/2} ,$

and $\mathrm{R}\mathrm{a}\mathrm{n}\mathrm{k}\left(\boldsymbol{L}\left(\boldsymbol{z}\right)\right) = p-q < n$ , then $\boldsymbol{H}\left(\boldsymbol{z}\right)$ is a positive definite matrix. Therefore, a nonsingular matrix Q exists such that ${\boldsymbol{Q}}^{\mathrm{\top }}\boldsymbol{H}\left(\boldsymbol{z}\right)\boldsymbol{Q} = \boldsymbol{I}$ and ${\boldsymbol{Q}}^{\mathrm{\top }}\boldsymbol{N}\left(\boldsymbol{z}\right)\boldsymbol{Q} = \boldsymbol{P}\left(\boldsymbol{z}\right)$ , where $\boldsymbol{P}\left(\boldsymbol{z}\right)$ is a diagonal matrix and its diagonal elements are the roots of the polynomial equation $\left|\boldsymbol{H}{\left(\boldsymbol{z}\right)}^{-1}\boldsymbol{N}\left(\boldsymbol{z}\right)-\mu \boldsymbol{I}\right| = 0$ (see Graybill ^[45], pp. 408; and Harville ^[46] pp. 563), and so we have

${\Delta }^{*} = {\sigma }^{2}\left(1-{d}^{2}\right)\boldsymbol{U}\left(\boldsymbol{z}\right){\left({\boldsymbol{Q}}^{\mathrm{\top }}\right)}^{-1}\left({\boldsymbol{Q}}^{\mathrm{\top }}\boldsymbol{H}\left(\boldsymbol{z}\right)\boldsymbol{Q}+\tau {\boldsymbol{Q}}^{\mathrm{\top }}\boldsymbol{N}\left(\boldsymbol{z}\right)\boldsymbol{Q}\right){\boldsymbol{Q}}^{-1}\boldsymbol{U}\left(\boldsymbol{z}\right)$

$= {\sigma }^{2}\left(1-{d}^{2}\right)\boldsymbol{U}\left(\boldsymbol{z}\right){\left({\boldsymbol{Q}}^{\mathrm{\top }}\right)}^{-1}\left(\boldsymbol{I}+\tau \boldsymbol{P}\left(\boldsymbol{z}\right)\right){\boldsymbol{Q}}^{-1}\boldsymbol{U}\left(\boldsymbol{z}\right) ,$

where $\boldsymbol{I}+\tau \boldsymbol{P}\left(\boldsymbol{z}\right) = \mathrm{d}\mathrm{i}\mathrm{a}\mathrm{g}(1+\tau {\mu }_{11}, \dots, 1+\tau {\mu }_{pp})$ . And, since $\boldsymbol{N}\left(\boldsymbol{z}\right) = {\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)\boldsymbol{H}\left(\boldsymbol{z}\right)+\boldsymbol{H}\left(\boldsymbol{z}\right){\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)\ne {\bf{0}}$ , then at least one of diagonal elements of $\boldsymbol{P}\left(\boldsymbol{z}\right)$ is nonzero. Assume ${\mu }_{ii} < 0$ for at least one i. Then, the positive definiteness of $\boldsymbol{I}+\tau \boldsymbol{P}\left(\boldsymbol{z}\right)$ is ensured by

$0 < \tau < \underset{{\mu }_{\boldsymbol{ii}} < 0}{\mathrm{min}}\left|\frac{1}{{\mu }_{ii}}\right| .$

As a result, for every i = 1, ..., p, $1+\tau {\mu }_{ii} > 0$ , and $\boldsymbol{I}+\tau \boldsymbol{P}\left(\boldsymbol{z}\right)$ is a positive definite matrix. Therefore, ${\Delta }^{*}$ is turned into a positive definite matrix. It is now clear that, if and only if (3.15) fulfills the criteria, ${\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)$ has a smaller variance than ${\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)$ . ■

Next the necessary and sufficient condition is provided under which the FGLTSRLE in RSRM is preferable to the FGLTSRE in the sense of the mean squared error matrix (MSEM). The following lemma is required for the demonstration of the forthcoming theorem.

Lemma 3.3. (Farebrother ^[47]) Let A be an p×p positive definite matrix, b be an (p×1) nonzero vector, and δ a positive scalar value. Then, δA−bb^⊤ is non-negative if and only if ${\bf{b}}^{{\top }}{\bf{A}}^{-1}{\bf{b}}\le {\rm{ \mathsf{ δ} }}$ .

Theorem 3.3. Let us be given the estimator ${\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)$ under the linear regression model with true restrictions Rβ = r. ${\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left({\widehat{d}}_{LTS}.\boldsymbol{z}\right)$ is MSEM superior to ${\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)$ if and only if

$\frac{1-d}{1+d}{\boldsymbol{\beta }}^{\mathrm{\top }}\boldsymbol{G}{\left(\boldsymbol{z}\right)}^{-1}\boldsymbol{\beta }\le {\sigma }^{2} ,$

(3.16)

where $\boldsymbol{G}\left(\boldsymbol{z}\right) = \boldsymbol{H}\left(\boldsymbol{z}\right)+\tau \boldsymbol{N}\left(\boldsymbol{z}\right)$ .

Proof. We prove the necessary and sufﬁcient conditions for the MSEM difference $\Delta = \mathrm{M}\mathrm{S}\mathrm{E}\mathrm{M}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)\right)-\mathrm{M}\mathrm{S}\mathrm{E}\mathrm{M}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)\right)$ , where

$\mathrm{M}\mathrm{S}\mathrm{E}\mathrm{M}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)\right) = \mathrm{C}\mathrm{o}\mathrm{v}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)\right)+\mathrm{B}\mathrm{i}\mathrm{a}\mathrm{s}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)\right){\left(\mathrm{B}\mathrm{i}\mathrm{a}\mathrm{s}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d.\boldsymbol{z}\right)\right)\right)}^{\mathrm{\top }}$

$= {\sigma }^{2}{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)\boldsymbol{H}\left(\boldsymbol{z}\right){{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)}^{\mathrm{\top }}+\left(\boldsymbol{I}-{\boldsymbol{F}}_{d}\right)\boldsymbol{\beta }{\boldsymbol{\beta }}^{\mathrm{\top }}{\left(\boldsymbol{I}-{\boldsymbol{F}}_{d}\right)}^{\mathrm{\top }}+o\left({n}^{-1}\right) ,$

$\mathrm{M}\mathrm{S}\mathrm{E}\mathrm{M}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSR}\left(\boldsymbol{z}\right)\right) = \mathrm{M}\mathrm{S}\mathrm{E}\mathrm{M}\left({\widehat{\boldsymbol{\beta }}}_{FGLTSRL}\left(d = 1.\boldsymbol{z}\right)\right) = {\sigma }^{2}\boldsymbol{H}\left(\boldsymbol{z}\right)+o\left({n}^{-1}\right) .$

According to the proof of Lemma 3.2., the difference $\Delta$ can be expressed as follows:

$\begin{array}{l} \;\;\;\;\;\;\;\;\;\;\Delta = {\sigma }^{2}{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)\left({{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)}^{-1}\boldsymbol{H}\left(\boldsymbol{z}\right){\left({{(\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)}^{\mathrm{\top }}\right)}^{-1}-\boldsymbol{H}\left(\boldsymbol{z}\right)\right){{\boldsymbol{F}}_{d}\left(\boldsymbol{z}\right)}^{\mathrm{\top }}\\ -{\left(1-d\right)}^{2}{\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}\right)}^{-1}\boldsymbol{\beta }{\boldsymbol{\beta }}^{\mathrm{\top }}{\left({\boldsymbol{C}}_{F}\left(\boldsymbol{z}\right)+\boldsymbol{I}\right)}^{-1} \end{array}$

$= \boldsymbol{U}\left(\boldsymbol{z}\right)\left\{{\sigma }^{2}\left(1-{d}^{2}\right)\left(\boldsymbol{H}\left(\boldsymbol{z}\right)+\tau \boldsymbol{N}\left(\boldsymbol{z}\right)\right)-{\left(1-d\right)}^{2}\boldsymbol{\beta }{\boldsymbol{\beta }}^{\mathrm{\top }}\right\}\boldsymbol{U}\left(\boldsymbol{z}\right)$

$= {\left(1-d\right)}^{2}\boldsymbol{U}\left(\boldsymbol{z}\right)\left\{{\sigma }^{2}\frac{1+d}{1-d}\left(\boldsymbol{H}\left(\boldsymbol{z}\right)+\tau \boldsymbol{N}\left(\boldsymbol{z}\right)\right)-\boldsymbol{\beta }{\boldsymbol{\beta }}^{\mathrm{\top }}\right\}\boldsymbol{U}\left(\boldsymbol{z}\right)$

$= {\left(1-d\right)}^{2}\boldsymbol{U}\left(\boldsymbol{z}\right)\left({\sigma }^{2}\frac{1+d}{1-d}\boldsymbol{G}\left(\boldsymbol{z}\right)-\boldsymbol{\beta }{\boldsymbol{\beta }}^{\mathrm{\top }}\right)\boldsymbol{U}\left(\boldsymbol{z}\right) ,$

where $\boldsymbol{G}\left(\boldsymbol{z}\right) = \boldsymbol{H}\left(\boldsymbol{z}\right)+\tau \boldsymbol{N}\left(\boldsymbol{z}\right)$ . Now, by using Lemma 3.3 and supposing that condition (3.15) is met, it is concluded that $\Delta$ is positive definite if and only if

$\frac{1-d}{1+d}{\boldsymbol{\beta }}^{\mathrm{\top }}\boldsymbol{G}{\left(\boldsymbol{z}\right)}^{-1}\boldsymbol{\beta }\le {\sigma }^{2}, 0 < d < 1.$

■

Theorem 3.4. Let us be given the estimator ${\widehat{\boldsymbol{\beta }}}_{FGLSRL}\left(d\right)$ under the linear regression model with true restrictions Rβ = r. ${\widehat{\boldsymbol{\beta }}}_{FGLSRL}\left(d\right)$ is MSEM superior to ${\widehat{\boldsymbol{\beta }}}_{FGLSR}$ if and only if

$\frac{1-d}{1+d}{\boldsymbol{\beta }}^{\mathrm{\top }}{\boldsymbol{G}}^{-1}\boldsymbol{\beta }\le {\sigma }^{2} ,$

(3.17)

where $\boldsymbol{G} = \boldsymbol{H}+\tau \boldsymbol{N}$ and $\boldsymbol{N} = {\boldsymbol{C}}_{F}\boldsymbol{H}+\boldsymbol{H}{\boldsymbol{C}}_{F}$ .

Proof. The desired result is simply obtained, similar to the proof of Theorem 3.4. ■

4. Illustrative experiments

To demonstrate the advantages of the improved techniques that have been proposed for the restricted semiparametric regression model in the presence of the multicollinearity and outlier problems simultaneously, we examine the theoretical findings using some numerical experiments in this section. We evaluate the performance of the proposed techniques in both a real-world data set and Monte Carlo simulation schemes.

4.1. The Monte Carlo simulation studies

We conduct a numerical analysis to evaluate the precision of our robust estimators for RSRM when dealing with contaminated data sets with outliers and multicollinearity. In each replication, the regressors are randomly generated using the following structure: Indeed, in order to reach the various levels of multicollinearity, we used the approach proposed by ^[48,49], in which the explanatory variables were constructed using a device with 150 observations and ${10}^{3}$ iterations, based on the model described below:

${x}_{ij} = {(1-{\gamma }^{2})}^{1/2}{z}_{ij}+\gamma {z}_{ip}, i = 1, \dots , n\;\mathrm{a}\mathrm{n}\mathrm{d}\;j = 1, \dots , p ,$

where ${z}_{ij}$ are independent standard normal pseudo-random variables, and $\gamma$ is chosen such that the correlation between any two explanatory variables is equal to ${\gamma }^{2}$ . These variables are subsequently normalized to ensure that ${\boldsymbol{X}}^{{\top }}\boldsymbol{X}$ and ${\boldsymbol{X}}^{{\top }}\boldsymbol{y}$ are in correlation forms. Four distinct sets of correlation values are investigated, specifically for $\gamma = \; 0.25, 0.50, \; 0.75, \; \mathrm{a}\mathrm{n}\mathrm{d}\; 0.95$ . For the dependent variable, n observations are then calculated by

${y}_{i} = \sum _{j = 1}^{5}{x}_{ij}{\beta }_{j}+f\left({t}_{i}\right)+{\varepsilon }_{i}, i = 1, \dots , n ,$

(4.1)

where

$\boldsymbol{\beta } = {(-1, \;4, \;2, \;-5, \;-3)}^{\mathrm{\top }} ,$

$f\left(t\right) = exp\left\{\mathrm{sin}\left(t\right)\mathrm{cos}\left(t\right)+\surd t\right\}, t\in \left[0, 3\right] ,$

${\boldsymbol{\varepsilon }}_{(n\times 1)} = {\left({\boldsymbol{\varepsilon }}_{1}^{\mathrm{\top }}, {\boldsymbol{\varepsilon }}_{2}^{\mathrm{\top }}\right)}^{\mathrm{\top }} ,$

in which

${\boldsymbol{\varepsilon }}_{1 (h\times 1)}^{\mathrm{\top }}\sim {N}_{h}\left({\bf{0}}, {\sigma }^{2}\boldsymbol{V}\right), {\sigma }^{2} = 1.64, \left[{v}_{ij}\right] = exp\left\{-9|i-j|\right\}, h = \left[0.25n\right], \left[0.33n\right], \left[0.50n\right]$

and

${\boldsymbol{\varepsilon }}_{2 \left(\right(n-h)\times 1)}^{\mathrm{\top }}{\sim }^{i.i.d.}{\chi }_{1}^{2}\left(15\right) ,$

where ${\chi }_{m}^{2}\left(\delta \right)$ represents the m-degree of freedom non-central Chi-squared distribution with non-centrality parameter δ. The primary motivation behind selecting such structure for producing the error terms is to corrupt the data set and assess the resistance of the suggested techniques. In fact, we designated the last n−h error terms as independent non-central Chi-squared distributed random variables and the first h error terms as dependent normal random variables. The non-centrality parameter leads to that the outliers lie on one side of the real regression model and bias the non-robust estimations. For the restriction, we consider the following stochastic linear restrictions:

$\boldsymbol{R} = \left(\begin{array}{ccccc} 1 & 5 & -3 & -1 & -1 \\ -2 & -1 & 0 & -2 & 3 \\ 1 & 2 & 1 & 3 & -2 \\ 4 & -1 & 2 & 2 & 0 \end{array}\right), {\boldsymbol{r}} = \boldsymbol{R} \boldsymbol{\beta}.$

For estimating the nonparametric part of model (4.1), f (.), the weight proposed by ^[50] with the Gaussian kernel is used as follows:

${W}_{\omega }\left({t}_{j}\right) = \frac{1}{n\omega }K\left(\frac{{t}_{i}-{t}_{j}}{\omega }\right) = \frac{1}{n\omega }\frac{1}{\sqrt{2\pi }}exp\left\{\frac{{\left({t}_{i}-{t}_{j}\right)}^{2}}{2{\omega }^{2}}\right\} .$

Also, the cross-validation (C.V.) approach is applied for obtaining the optimum value of bandwidth $\omega$ , which minimizes the C.V. criterion.

The non-parametric component of model (4.1) is presented in Figure 1. This wavy function is challenging to predict and offers a useful example for testing the proposed estimation techniques. All calculations were performed with R 4.3.1, the statistical software program. Tables 1–14 present a summary of the results. After iterating the process for all simulations, the minimum, maximum, mean, median, and standard deviation values of MSEs for the linear and non-linear estimators were reported in Tables 1 & 2, respectively, where

$\begin{aligned} & \operatorname{mŝe}{\it{(}}\hat{\boldsymbol{f}}_{{\it{(}}{\boldsymbol{i}}{\it{)}}}, \boldsymbol{f}{\it{)}}=\frac{1}{n M} \sum\nolimits_{m=1}^M\|\hat{\boldsymbol{f}}_{{\it{(}}{\boldsymbol{i}}{\it{)}}}^{{\it{(}}{\boldsymbol{m}}{\it{)}}}-\boldsymbol{f}\|_2^2 \\ & \operatorname{mŝe}{\it{(}}\hat{\boldsymbol{f}}_{{\it{(}}{\boldsymbol{i}}{\it{)}}}, \boldsymbol{f}{\it{)}}=\frac{1}{n M} \sum\nolimits_{m=1}^M\|\hat{\boldsymbol{f}}_{{\it{(}}{\boldsymbol{i}}{\it{)}}}^{{\it{(}}{\boldsymbol{m}}{\it{)}}}-\boldsymbol{f}\|_2^2, \hat{\boldsymbol{f}}_{{\it{(}}i{\it{)}}}=\boldsymbol{K}{{(}}\boldsymbol{y}-\boldsymbol{X} \hat{\boldsymbol{\beta}}_{{\it{(}}i{\it{)}}}^{{\it{(}}{\boldsymbol{m}}{\it{)}}}{{)}} \end{aligned}$

in which $\hat{\beta}_{(i)}^{(\mathrm{m})}$ and $\hat{f}_{({\rm{i}})}^{(\mathrm{m})}$ are the ith estimators of the linear and non-linear parts (i = 1, ..., 4) obtained in the mth iteration for all of the four proposed approaches, and $\|\boldsymbol{v}\|_2^2 = \sum_{i = 1}^q v_i^2$ for $\boldsymbol{v} = ({v}_{1}, ..., {v}_{p}{)}^{\mathrm{\top }}$ . Also, PCDO is the percentage of the contaminated data with outliers (PCDO = 100 × $\frac{n-h}{n}$ %).

Figure 1. The nonlinear function of the simulated model.

DownLoad: Full-Size Img PowerPoint

Table 1. Mean squared error estimations of the proposed estimators for the linear part of the simulated data sets with n = 150.

	$\gamma$	0.25	0.50	0.75	0.95	0.25	0.50	0.75	0.95	0.25	0.50	0.75	0.95
			PCDO =25%				PCDO =33%				PCDO =50%
	min	1.03e+00	1.4501	1.9507	8.4008	1.5712	4.9800	4.4346	10.09635	2.7600	5.8985	8.4886	14.5415
	max	3.0344	3.7254	7.1833	38.9501	4.1060	5.6992	6.3436	23.1795	4.8609	12.5527	11.9615	39.6976
FGLSRE	mean	1.2606	2.2249	4.3570	16.5846	1.9013	3.9258	5.4962	19.9989	3.2705	9.3250	10.5542	26.3929
	median	1.1228	1.9902	4.1577	15.6974	1.8818	3.1400	5.2324	18.9837	3.1247	8.8504	10.2531	25.1407
	S.D.	1.3633	2.9277	2.5325	8.5307	2.2494	2.4276	3.7169	9.9713	6.9775	6.4574	5.7781	12.4557
	min	7.00e-02	1.1141	1.1376	5.8009	0.0937	1.6500	2.3008	9.45506	0.0750	1.0046	2.7407	12.9800
	max	2.0108	3.1206	5.8441	17.4920	1.4185	4.0889	4.9014	18.2030	3.8996	4.8417	6.2896	28.6880
FGLTSRE	mean	0.1579	1.9061	3.1640	13.6686	0.6468	3.1952	3.2871	12.0766	0.2389	2.2825	3.4733	15.1121
	median	0.0644	1.8409	3.0672	12.2794	0.4776	3.0808	3.1337	11.8957	0.1002	2.1237	3.2014	14.8865
	S.D.	0.6358	2.9805	2.2827	7.0157	1.1846	3.0649	3.4142	8.6359	1.3573	4.4232	3.6887	8.9685
	min	4.10e-01	0.0491	0.0858	2.5456	1.0680	2.0301	2.7207	4.1117	2.5720	3.1352	3.2247	13.3250
	max	1.8972	2.8717	3.1734	7.6356	2.9482	4.8827	5.5675	12.1979	4.9784	7.1075	7.2553	24.1935
FGLSRLE	mean	0.2437	0.6099	1.3926	3.8501	1.7648	2.9027	3.5436	6.2565	3.1045	4.3330	7.7663	17.4571
	median	0.1147	0.6019	0.1821	3.9388	0.6456	2.7531	3.2622	6.1038	3.0753	4.2515	7.2732	17.1550
	S.D.	0.3457	1.3459	2.5714	4.7190	2.4499	1.4511	2.7632	3.1329	6.8705	5.4671	3.7902	7.3125
	min	4.33e-03	0.0045	0.0037	0.0604	0.0345	0.0141	0.0109	0.6054	0.0017	0.1570	0.0048	1.1370
	max	1.9089	2.2738	1.1269	4.6501	0.9583	1.2217	1.3226	6.8717	1.6645	0.8509	0.8867	12.1068
FGLTSRLE	mean	0.1461	0.0992	0.1808	0.8070	0.9593	0.2282	0.3750	1.3672	0.2407	0.2941	0.4949	6.0419
	median	0.0581	0.0434	0.0745	0.3423	0.3887	0.0830	0.1660	0.9254	0.1183	0.3212	0.2139	5.8904
	S.D.	0.2205	1.1919	2.3091	3.1927	1.3874	1.2861	2.4600	2.9477	1.3493	1.9637	2.7041	3.9317

| Show Table

DownLoad: CSV

Table 2. Mean squared error estimations of the proposed estimators for the non-linear part of the simulated data sets with n = 150.

	$\gamma$	0.25	0.50	0.75	0.95	0.25	0.50	0.75	0.95	0.25	0.50	0.75	0.95
			PCDO =25%				PCDO =33%				PCDO =50%
	min	0.0374	0.3994	3.0404	5.0481	0.0449	0.8437	3.0321	6.0326	0.9591	1.0215	5.5270	10.11928
	max	6.2170	4.8515	5.4275	10.3723	2.7879	5.8039	4.4065	13.1375	9.6363	9.6595	7.6221	28.3078
FGLSRE	mean	0.6183	1.4321	3.4064	7.3234	0.3497	1.7602	3.5049	11.3692	1.6280	2.5842	5.5436	18.4228
	median	0.3508	1.2565	3.2481	7.2132	0.2345	1.3185	3.2925	11.2302	1.3582	2.3344	6.3158	16.2578
	S.D.	0.7293	1.9897	3.4365	3.3594	0.3291	2.6560	2.5909	4.4043	3.9471	5.6909	3.6291	9.4677
	min	0.0287	0.4223	2.0463	4.0252	0.0343	0.6355	2.0375	3.0287	0.0215	0.6205	3.5196	7.0279
	max	4.0802	4.8972	4.5401	9.8426	2.5455	3.7042	2.6508	8.7396	6.3417	5.3711	5.4789	23.3797
FGLTSRE	mean	0.4200	1.2698	2.2545	5.2060	0.2251	1.6775	2.3414	6.2519	0.5627	1.5171	3.4784	12.3705
	median	0.2451	1.1965	2.1852	4.9608	0.1736	1.2286	2.2140	6.1749	0.2998	1.2809	4.2729	11.2342
	S.D.	0.4864	2.7594	3.2296	3.1450	0.1909	2.4193	2.3521	4.2299	0.7026	1.6265	2.5539	7.3965
	min	0.0281	0.0475	0.2469	1.0435	0.03421	0.5787	2.0245	2.9891	0.9282	0.9250	5.1206	8.0220
	max	5.9483	3.0696	3.9557	3.4799	2.5126	6.0783	2.5968	6.2644	9.0662	7.4303	6.9023	21.3871
FGLSRLE	mean	0.5849	0.4537	0.9337	1.3566	0.3184	1.5949	2.5431	4.4030	1.6164	2.2959	5.1532	14.4144
	median	0.3256	0.2654	0.7635	1.1312	0.2174	1.3489	2.3237	4.2665	1.3497	2.0034	5.9194	14.2567
	S.D.	0.6946	0.5153	2.4658	2.3829	0.3070	2.6909	1.9268	4.4213	3.7313	2.7059	1.6403	6.4495
	min	0.0244	0.0450	0.0470	0.0259	0.0325	0.0369	0.0385	0.0340	0.0207	0.0253	0.0270	1.1271
	max	3.8884	3.0961	0.7341	1.0003	2.3036	3.9890	0.9417	1.6739	6.7319	1.9403	1.0525	5.0367
FGLTSRLE	mean	0.3968	0.2801	0.2675	0.7239	0.2173	0.4027	0.3713	0.5898	0.5475	0.5839	0.5652	3.3780
	median	0.2340	0.1993	0.1929	0.5678	0.1742	0.2428	0.2340	0.3023	0.3232	0.3432	0.3836	3.2335
	S.D.	0.4563	0.2756	2.2493	2.1681	0.1671	1.4512	1.3876	3.2692	0.6847	0.6430	1.5663	4.3896

| Show Table

DownLoad: CSV

Table 3. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.25 and PCDO = 25%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-1.0025	-1.0022	-1.0021	-1.0019
${\widehat{\beta }}_{2}$	3.9439	3.9519	3.9547	3.9586
${\widehat{\beta }}_{3}$	1.8317	1.8558	1.8640	1.8757
${\widehat{\beta }}_{4}$	-4.8547	-4.8754	-4.8825	-4.8926
${\widehat{\beta }}_{5}$	-2.9235	-2.9344	-2.9382	-2.9435
$\widehat{d}$	1.0000	1.0000	0.0002	0.4316
${e}^{\mathrm{\top }}z$	150.00	116.00	150.00	116.00

| Show Table

DownLoad: CSV

Table 4. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.50 and PCDO = 25%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-1.1020	-1.0916	-1.0014	-1.0014
${\widehat{\beta }}_{2}$	3.9059	3.9052	3.9693	3.9700
${\widehat{\beta }}_{3}$	1.8176	1.8957	1.9079	1.9101
${\widehat{\beta }}_{4}$	-4.8057	-4.8599	-4.9204	-4.9224
${\widehat{\beta }}_{5}$	-2.8398	-2.8526	-2.9581	-2.9591
$\widehat{d}$	1.000	1.000	0.0078	0.6775
${e}^{\mathrm{\top }}z$	150	131	150	131

| Show Table

DownLoad: CSV

Table 5. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.70 and PCDO = 25%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-1.7530	-1.4123	-1.1520	-1.0019
${\widehat{\beta }}_{2}$	3.1342	3.5500	3.8571	3.9585
${\widehat{\beta }}_{3}$	1.1027	1.3500	1.2712	1.8755
${\widehat{\beta }}_{4}$	-4.2296	-4.6705	-4.7888	-4.8925
${\widehat{\beta }}_{5}$	-2.1103	-2.5318	-2.7415	-2.9434
$\widehat{d}$	1.000	1.000	0.0054	0.6639
${e}^{\mathrm{\top }}z$	150	128	150	128

| Show Table

DownLoad: CSV

Table 6. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.95 and PCDO = 25%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-2.5080	-2.0052	-1.0036	-1.0035
${\widehat{\beta }}_{2}$	2.8241	3.0851	3.9207	3.9228
${\widehat{\beta }}_{3}$	0.4724	1.1554	1.7622	1.7684
${\widehat{\beta }}_{4}$	-3.7103	-4.1024	-4.7946	-4.8000
${\widehat{\beta }}_{5}$	-2.0002	-2.0134	-2.8919	-2.8947
$\widehat{d}$	1.000	1.000	0.6488	0.2912
${e}^{\mathrm{\top }}z$	150	122	122	108.392

| Show Table

DownLoad: CSV

Table 7. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.25 and PCDO = 33%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-1.0035	-1.0034	-1.0079	-1.0059
${\widehat{\beta }}_{2}$	3.9222	3.9244	3.8259	3.8707
${\widehat{\beta }}_{3}$	1.7667	1.7732	1.4776	1.6122
${\widehat{\beta }}_{4}$	-4.7985	-4.8042	-4.5488	-4.6651
${\widehat{\beta }}_{5}$	-2.8939	-2.8969	-2.7625	-2.8237
$\widehat{d}$	1.0000	1.0000	0.0067	0.6464
${e}^{\mathrm{\top }}z$	150	129	150	129

| Show Table

DownLoad: CSV

Table 8. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.50 and PCDO = 33%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-1.1029	-1.1025	-1.1023	-1.0021
${\widehat{\beta }}_{2}$	3.9065	3.9150	3.9199	3.9533
${\widehat{\beta }}_{3}$	1.8096	1.8109	1.8298	1.8599
${\widehat{\beta }}_{4}$	-4.6355	-4.7574	-4.7703	-4.8790
${\widehat{\beta }}_{5}$	-2.5134	-2.7250	-2.8017	-2.9363
$\widehat{d}$	1.0000	1.0000	0.0002	0.4261
${e}^{\mathrm{\top }}z$	150	130	150	130

| Show Table

DownLoad: CSV

Table 9. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.75 and PCDO = 33%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-1.9438	-1.4133	-1.6228	-1.0026
${\widehat{\beta }}_{2}$	2.9157	3.6282	3.5385	3.9429
${\widehat{\beta }}_{3}$	1.0472	1.3847	1.3156	1.8287
${\widehat{\beta }}_{4}$	-3.7816	-4.7140	-4.6407	-4.8521
${\widehat{\beta }}_{5}$	-2.0051	-2.6021	-2.5162	-2.9221
$\widehat{d}$	1.000	1.000	0.0001	0.4026
${e}^{\mathrm{\top }}z$	150	130	150	130

| Show Table

DownLoad: CSV

Table 10. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.95 and PCDO = 33%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-2.8086	-2.0067	-1.7142	-1.0038
${\widehat{\beta }}_{2}$	2.2198	3.0528	3.5466	3.9168
${\widehat{\beta }}_{3}$	0.4293	1.2685	1.1197	1.7505
${\widehat{\beta }}_{4}$	-3.5071	-4.0187	-4.3579	-4.7845
${\widehat{\beta }}_{5}$	-1.7406	-2.1003	-2.5726	-2.8866
$\widehat{d}$	1.000	1.000	0.0008	0.3654
${e}^{\mathrm{\top }}z$	150	118	150	118

| Show Table

DownLoad: CSV

Table 11. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.25 and PCDO = 50%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-1.8514	-1.0015	-1.8709	-1.0010
${\widehat{\beta }}_{2}$	3.9029	3.9678	3.8807	3.9790
${\widehat{\beta }}_{3}$	1.6097	1.9034	1.7422	1.9371
${\widehat{\beta }}_{4}$	-4.1220	-4.9166	-4.2501	-4.9457
${\widehat{\beta }}_{5}$	-2.4590	-2.9561	-2.4737	-2.9714
$\widehat{d}$	1.000	1.000	0.0013	0.0217
${e}^{\mathrm{\top }}z$	150	127	150	127

| Show Table

DownLoad: CSV

Table 12. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.50 and PCDO = 50%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-1.9411	-1.3117	-1.6436	-1.0011
${\widehat{\beta }}_{2}$	3.1654	3.4623	3.2789	3.9763
${\widehat{\beta }}_{3}$	0.8961	1.5868	1.0367	1.9289
${\widehat{\beta }}_{4}$	-3.7103	-4.7022	-4.0454	-4.9386
${\widehat{\beta }}_{5}$	-2.0528	-2.3485	-2.1512	-2.9677
$\widehat{d}$	1.000	1.000	0.0250	0.0203
${e}^{\mathrm{\top }}z$	150	123	150	123

| Show Table

DownLoad: CSV

Table 13. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.75 and PCDO = 50%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-1.0021	-1.0023	-1.0011	-1.0012
${\widehat{\beta }}_{2}$	3.9535	3.9485	3.9767	3.9726
${\widehat{\beta }}_{3}$	1.8606	1.8456	1.9300	1.9178
${\widehat{\beta }}_{4}$	-4.8796	-4.8667	-4.9395	-4.9290
${\widehat{\beta }}_{5}$	-2.9366	-2.9298	-2.9682	-2.9626
$\widehat{d}$	1.000	1.000	0.0018	0.0168
${e}^{\mathrm{\top }}z$	150	125	150	125

| Show Table

DownLoad: CSV

Table 14. Evaluation of the parameters for the proposed methods with

$\gamma$ = 0.95 and PCDO = 50%.

Coefficients	Method
Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
${\widehat{\beta }}_{1}$	-3.1056	-2.1161	-2.6811	-1.0016
${\widehat{\beta }}_{2}$	3.0766	3.1668	3.0995	3.9690
${\widehat{\beta }}_{3}$	0.3197	1.1703	1.0297	1.9072
${\widehat{\beta }}_{4}$	-2.6802	-3.6548	-3.1793	-4.9165
${\widehat{\beta }}_{5}$	-1.1117	-2.0683	-1.9681	-2.9567
$\widehat{d}$	1.000	1.000	0.0003	0.0205
${e}^{\mathrm{\top }}z$	150	127	150	127

| Show Table

DownLoad: CSV

shows the estimations of the non-linear part of model (4.1) using the proposed methods. In this figure, the nonparametric function is estimated by the kernel method after estimation of the linear part of model (4.1) by FGLSRE, FGLTSRE, FGLSRLE, and FGLTSRLE, respectively. To save space, the results have been only reported for n = 150 with PCDO = 25%, 33%, and 50%, and $\gamma = 0.95$ . From Figure 2, it is evident that the non-robust methods are completely corrupted by the outliers, especially for large values of PCDO.

Figure 2. Estimation of the nonparametric function under study by kernel method for n = 150, γ = 0.95, PCDO = 25% (low), PCDO = 33% (moderate), and PCDO = 50% (high).

DownLoad: Full-Size Img PowerPoint

4.2. Real-world data analysis

We analyze the hedonic pricing of housing features in order to motivate the challenge of linearly restricted estimations in the semiparametric regression model. Lot size has a big impact on housing costs. Ho ^[51] fit this data set using semiparametric least squares. The information is based on 92 detached houses that were sold in the Ottawa region in 1987. Here is how the variables are defined: The sale price (SP) is the dependent variable, while the lot size (lot area = LT), square footage of housing (SFH), average neighborhood income (ANI), distance to the highway (DHW), garage availability (GAR), and fireplace (FP) are the independent variables. At first, the pure parametric model is fit as follows:

${{\left(SP\right)}_{i} = {\beta }_{0}+{\beta }_{1}{\left(LT\right)}_{i}+{\beta }_{2}{\left(SFH\right)}_{i}+{\beta }_{3}{\left(FP\right)}_{i}+{\beta }_{4}{\left(DHW\right)}_{i}+{\beta }_{5}{\left(GAR\right)}_{i}+{\beta }_{6}{\left(ANI\right)}_{i}+{\varepsilon }_{i}}_{}.$

We use the added-variable charts to intuitively determine the parametric and nonparametric components of the model (see Sheather ^[52] for more details). Added-variable plots allow us to examine each predictor's influence graphically after adjusting for the effects of the other explanatory variables. Based on the analysis of the added-variable plot (Figure 3), we identify ANI as a nonparametric component. Therefore, the SRM is specified accordingly:

(4.2)

Figure 3. Added-variable plots of individual explanatory variables vs. dependent variable, linear fit (blue solid line), and kernel fit (red dashed line).

DownLoad: Full-Size Img PowerPoint

The "mctest" package in R is used to detect multicollinearity in the design matrix, producing the following results. The Farrar-Glauber test and other pertinent tests for multicollinearity are provided.

Overall Multicollinearity Diagnostics
	MC Results	detection
Determinant \|X'X\|:	0.005618	1
Farrar Chi-Square:	50.8378	1
Red Indicator:	0.2065	0
Sum of Lambda Inverse:	700.2104	1
Theil's Method:	-0.7320	0
Condition Number:	200.4021	1
1 -- > COLLINEARITY is detected by the test
0 -- > COLLINEARITY is not detected by the test

| Show Table

DownLoad: CSV

The correlation graphs are displayed in Figure 4 for the real data set. It is evident from the output above and Figure 4 that the independent variables in the real data set under investigation exhibit substantial multicollinearity. So, to address the multicollinearity issue, the suggested estimating techniques must be used.

Figure 4. Visualization of the correlation plots for the explanatory variables of the real data set.

DownLoad: Full-Size Img PowerPoint

The restriction $\boldsymbol{R}\boldsymbol{\beta } = \boldsymbol{r}$ may be identified as follows based on a basic investigation of the semiparametric regression model (4.2) using a robust Liu estimator:

$\boldsymbol{R} = \left(\begin{array}{ccccc} -1 & 0 & -1 & -1 & 1 \\ 1 & 0 & -1 & 2 & 0 \\ 0 & -1 & 0 & -2 & 8 \end{array}\right), \boldsymbol{r} = \left(\begin{array}{l} 0 \\ 0 \\ 0 \end{array}\right)$

Now, the linear hypothesis $\boldsymbol{R}\boldsymbol{\beta } = \boldsymbol{r}$ is examined in the framework of the restricted semiparametric regression model (4.2). The test statistic is computed as follows under $\boldsymbol{R}\boldsymbol{\beta } = \boldsymbol{r}$ :

${\chi }_{rank\left(R\right)}^{2} = {\left(\boldsymbol{R}{\widehat{\boldsymbol{\beta }}}_{FGLS}-\boldsymbol{r}\right)}^{{\top }}{\left(\boldsymbol{R}{\widehat{\bf{\Sigma }}}_{\widehat{\boldsymbol{\beta }}}{\boldsymbol{R}}^{{\top }}\right)}^{-1}\left(\boldsymbol{R}{\widehat{\boldsymbol{\beta }}}_{FGLS}-\boldsymbol{r}\right) = 0.4781,$

where ${\widehat{\bf{\Sigma }}}_{\widehat{\boldsymbol{\beta }}} = {\widehat{s}}^{2}{\left({\widetilde {\boldsymbol{X}}}^{{\top }}{\boldsymbol{S}}^{-1}\widetilde {\boldsymbol{X}}\right)}^{-1}$ , in which ${\widehat{s}}^{2} = \frac{1}{n-p}{\left(\boldsymbol{y}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{FGLS}\right)}^{{\top }}{\boldsymbol{S}}^{-1}\left(\boldsymbol{y}-\widetilde {\boldsymbol{X}}{\widehat{\boldsymbol{\beta }}}_{FGLS}\right)$ . Consequently, the restricted estimators are obtained. shows a brief evaluation of the proposed estimators. In this table, the values of $\mathrm{M}\widehat{\mathrm{S}}\mathrm{E}\; \mathrm{a}\mathrm{n}\mathrm{d}\; {\mathrm{R}}^{2}$ are calculated, in which ${\mathrm{R}}^{2} = 1-\frac{\mathrm{R}\mathrm{S}\mathrm{S}}{{\mathrm{S}}_{\mathrm{Y}\mathrm{Y}}}$ is the coefficient of determination of the model, where $\mathrm{R}\mathrm{S}\mathrm{S} = {\sum }_{i = 1}^{n}{({y}_{i}-{\widehat{y}}_{i})}^{2}$ is the residual sum of squares and ${\widehat{y}}_{i} = {x}_{i}\widehat{\beta }+\widehat{f}{(t}_{i})$ . Compared to the other procedures, FGLTSRLE seems to be accurately effective based on the results that were obtained.

Table 15. Evaluation of parameters for proposed estimators for the real data set method.

Coefficients	FGLSRE	FGLTSRE	FGLSRLE	FGLTSRLE
LT	0.7018	1.0509	0.8514	1.1235
SFH	46.7515	33.5686	38.9154	26.3747
FP	3.9311	2.5740	3.5210	1.9568
DHW	−1.6147	−0.7616	−0.9952	−0.4125
GAR	6.2476	4.3865	5.2015	2.9958
e^⊤z	92.0000	86.0000	92.0000	86.0000
$\widehat{d}$	1.0000	1.0000	0.1542	0.6741
$\mathrm{M}\widehat{\mathrm{S}}\mathrm{E}$	926.80	456.41	809.59	335.17
R²	0.2346	0.6156	0.3354	0.7325

| Show Table

DownLoad: CSV

Following the proposed estimations of the linear component of the model (4.2), the estimations on the non-parametric function by kernel smoothing is shown in . For estimation of the nonparametric effect, at first we estimated the parametric effects by one of the proposed methods, and then the kernel approach was applied to fit ${SP}_{i}-{x}_{i}^{\mathrm{\top }}\widehat{\beta }$ on ${ANI}_{i}, i = 1, \dots, n$ for all proposed linear estimators, where ${x}_{i}^{\mathrm{\top }} = ({LT}_{i}, {SFH}_{i}, {FP}_{i}, {DHW}_{i}, {GAR}_{i})$ . Table 15 and Figure 5 demonstrate how the Liu type of robust and non-robust estimators outperform non-Liu forms in both parametric and nonparametric estimations due to the presence of multicollinearity in the design matrix. Furthermore, robust estimators outperform non-robust estimators in model prediction since the data set contains some outlier observations.

Figure 5. Estimations for the nonparametric part of model (4.2).

DownLoad: Full-Size Img PowerPoint

5. Conclusions

In this research, Liu and non-Liu types of the feasible generalized restricted robust estimator are suggested in a semiparametric regression model when some additional linear constraints held on the linear parameter space and the variance matrix of the error terms were unknown. We introduced robust Liu estimators in the presence of multicollinearity among column vectors of the design matrix of a semiparametric regression model and outliers in the data set. We also introduced some new estimators of d by minimizing the mean squared error of the proposed estimators. After extracting the MSEM superiority condition of a feasible generalized least trimmed squares restricted Liu estimator over a non-Liu type based on some theorems, comprehensive Monte-Carlo simulation experiments and a real data analysis were conducted to evaluate the effectiveness of the suggested estimators. The numerical experiments illustrated that the suggested methods can be effectively implemented to predict the dependent variable of restricted SRMs without being affected by the corruptive impact of multicollinearity or outlier issues. As a good topic for future research, it is proposed to derive the asymptotic distribution of the proposed estimator by the interested authors (see ^[53,54] for more details).

Author contributions

W. B. Altukhaes: Methodology, Software, Validation, Formal analysis, Investigation, Resources, Data curation, Writing-original draft preparation, Writing-review and editing, Visualization, Funding acquisition; M. Roozbeh: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Resources, Data curation, Writing-original draft preparation, Writing-review and editing, Visualization, Project administration; N. A. Mohamed: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Resources, Data curation, Visualization, Supervision, Funding acquisition. All authors have read and agreed to the published version of the manuscript.

Acknowledgments

The first author would like to thank the Deanship of Scientific Research at Shaqra University for supporting this work. The second author thank the Research Council of Semnan University for its support. The second and third authors would like to thank the Ministry of Higher Education Malaysia for their support in funding this research through the Fundamental Research Grant Scheme (Project No.: FP072-2023) awarded to Nur Anisah Mohamed and Mahdi Roozbeh.

The authors would like to thank the three anonymous reviewers and handling editor for their valuable comments and corrections to an earlier version of this paper, which significantly improved the quality of our work.

Conflicts of Interest

The authors declare no conflict of interest.

References

[1]	P. Green, C. Jennison, A. Seheult, Analysis of field experiments by least squares smoothing, J. Roy. Statist. Soc. Ser. B, 47 (1985), 299–315. https://doi.org/10.1111/j.2517-6161.1985.tb01358.x doi: 10.1111/j.2517-6161.1985.tb01358.x
[2]	R.-F. Engle, C.-W.-J. Granger, J. Rice, A. Weiss, Semiparametric estimates of the relation between weather and electricity sales, J. Am. Stat. Assoc., 81 (1986), 310–320. https://doi.org/10.2307/2289218 doi: 10.2307/2289218
[3]	R.-L. Eubank, E.-L. Kambour, J.-T. Kim, K. Klipple, C.-S. Reese, M. Schimek, Estimation in partially linear models, Comput. Stat. Data. Anal., 29 (1998), 27–34. https://doi.org/10.1016/S0167-9473(98)00054-1 doi: 10.1016/S0167-9473(98)00054-1
[4]	P. Speckman, Kernel somoothing in partial linear models, J. R. Stat. Soc. Ser. B, 50 (1988), 413–436. https://doi.org/10.1111/j.2517-6161.1988.tb01738.x doi: 10.1111/j.2517-6161.1988.tb01738.x
[5]	R.-L. Eubank, Nonparametric Regression and Spline Smoothing, New York: Marcel Dekker, 1999. https://doi.org/10.1201/9781482273144
[6]	D. Ruppert, M.-P. Wand, R.-C. Carroll, Semiparametric Regression; Cambridge: Cambridge University Press, 2003. https://doi.org/10.1017/CBO9780511755453
[7]	W. Härdle, M. Müller, S. Sperlich, A. Werwatz, Nonparametric and Semiparmetric Models, Berlin/Heidelberg: Springer, 2004. https://doi.org/10.1007/978-3-642-17146-8
[8]	Yatchew, Semiparametric Regression for the Applied Econometrican, Cambridge: Cambridge University Press, 2003. https://doi.org/10.1017/CBO9780511615887
[9]	F. Akdeniz, G. Tabakan, Restricted ridge estimators of the parameters in semiparametric regression model, Commun. Stat. Theory Methods, 38 (2009), 1852–1869. https://doi.org/10.1080/03610920802470109 doi: 10.1080/03610920802470109
[10]	D.-E. Akdeniz, W.-K. Hardle, M. Osipenko, Difference based ridge and Liu type estimators in semiparametric regression models, J. Multivar. Anal., 105 (2012), 164–175. https://doi.org/10.1016/j.jmva.2011.08.018 doi: 10.1016/j.jmva.2011.08.018
[11]	M. Arashi, T. Valizadeh, Performance of Kibria's methods in partial linear ridge regression model, Stat. Pap., 56 (2015), 231–246. https://doi.org/10.1007/s00362-014-0578-6 doi: 10.1007/s00362-014-0578-6
[12]	F. Akdeniz, S. Kaçıranlar, On the almost unbiased generalized Liu estimator and unbiased estimation of the bias and MSE, Commun. Stat. Theory Methods, 24 (1995), 1789–1797. https://doi.org/10.1080/03610929508831585 doi: 10.1080/03610929508831585
[13]	M.-N. Akram, B.-M.-G. Kibria, M. Arashi, A.-F. Lukman, A new improved Liu estimator for the QSAR model with inverse Gaussian response, Commun. Stat. Simul. Comput., 53 (2024), 1873–1888. https://doi.org/10.1080/03610918.2022.2059088 doi: 10.1080/03610918.2022.2059088
[14]	S. Kaçıranlar, N. Ozbay, E. Ozkan, H. Guler, Comparison of Liu and two parameter principal component estimator to combat multicollinearity, Concurr. Comput. Pract. Exp., 34 (2022), e6737. https://doi.org/10.1002/cpe.6737 doi: 10.1002/cpe.6737
[15]	B. Kan, O. Alpu, B. Yazici, Robust ridge and robust Liu estimator for regression based on the LTS estimator, J. Appl. Stat., 40 (2013), 644–655. https://doi.org/10.1080/02664763.2012.750285 doi: 10.1080/02664763.2012.750285
[16]	K.-J. Liu, A new class of biased estimate in linear regression, Commun. Stat. Theory Methods, 22 (1993), 393–402. https://doi.org/10.1080/03610929308831027 doi: 10.1080/03610929308831027
[17]	M. Arashi, A.-F. Lukman, Z.-Y. Algamal, Liu regression after random forest for prediction and modeling in high dimension, J. Chemom., 36 (2022), e3393. https://doi.org/10.1002/cem.3393 doi: 10.1002/cem.3393
[18]	M. Arashi, M. Roozbeh, H.-A. Niroumand, A note on Stein-type shrinkage estimator in partial linear models, Statistics, 46 (2012), 673–685. https://doi.org/10.1080/02331888.2011.553682 doi: 10.1080/02331888.2011.553682
[19]	M. Roozbeh, S. Babaie-Kafaki, M. Manavi, A heuristic algorithm to combat outliers and multicollinearity in regression model analysis, Iran. J. Num. Anal. Opt., 12 (2022), 173–186. https://doi.org/10.22067/IJNAO.2021.68160.1008 doi: 10.22067/IJNAO.2021.68160.1008
[20]	M. Roozbeh, A. Rouhi, N.-A. Mohamed, F. Jahadi, Generalized support vector regression and symmetry functional regression approaches to model the high-dimensional data, Symmetry, 15 (2023), 1262. https://doi.org/10.3390/sym15061262 doi: 10.3390/sym15061262
[21]	M. Roozbeh, S. Babaie-Kafaki, Z. Aminifard, A nonlinear mixed–integer programming approach for variable selection in linear regression model, Commun. Stat. Simul. Comput., 11 (2023), 5434–5445. https://doi.org/10.1080/03610918.2021.1990323 doi: 10.1080/03610918.2021.1990323
[22]	P.-J. Rousseeuw, Least median of squares regression, J. Am. Stat. Assoc., 79 (1984), 871–880. https://doi.org/10.2307/2288718 doi: 10.2307/2288718
[23]	P.-J. Rousseeuw, A.-M. Leroy, Robust Regression and Outlier Detection, New York: John Wiley, 1987. https://doi.org/10.1002/0471725382
[24]	P.-J. Rousseeuw, K. van Driessen, Computing LTS regression for large data sets, Data Min. Knowl. Discov., 12 (2006), 29–45. https://doi.org/10.1007/s10618-005-0024-4 doi: 10.1007/s10618-005-0024-4
[25]	M. Amini, M. Roozbeh, Optimal partial ridge estimation in restricted semiparametric regression models, J. Multivar. Anal., 136 (2015), 26–40. https://doi.org/10.1016/j.jmva.2015.01.005 doi: 10.1016/j.jmva.2015.01.005
[26]	A. Zellner, An efficient method of estimating seemingly unrelated regressions and tests for aggregation bias, J. Am. Stat. Assoc., 57 (1962), 348–368. https://doi.org/10.2307/2281644 doi: 10.2307/2281644
[27]	M. Arashi, B.-M.-G. Kibria, T. Valizadeh, On ridge parameter estimators under stochastic subspace hypothesis, J. Stat. Comput. Simul., 87 (2017), 966–983. https://doi.org/10.1080/00949655.2016.1239104 doi: 10.1080/00949655.2016.1239104
[28]	M.-H. Karbalaee, M. Arashi, S.-M.-M. Tabatabaey, Performance analysis of the preliminary test estimator with series of stochastic restrictions, Commun. Stat. Theory Methods, 47 (2018), 1–17. https://doi.org/10.1080/03610926.2017.1300275 doi: 10.1080/03610926.2017.1300275
[29]	M. Roozbeh, G. Hesamian, M.-G. Akbari, Ridge estimation in semi-parametric regression models under the stochastic restriction and correlated elliptically contoured errors, J. Comput. Appl. Math., 378 (2020), 112940. https://doi.org/10.1016/j.cam.2020.112940 doi: 10.1016/j.cam.2020.112940
[30]	J. Durbin, A note on regression when there is extraneous information about one of the coefficients, J. Am. Stat. Assoc., 48 (1990), 799–808. https://doi.org/10.2307/2281073 doi: 10.2307/2281073
[31]	H. Theil, A.-S. Goldberger, On pure and mixed statistical estimation in economics, Int. Econ. Rev., 2 (1961), 65–78. https://doi.org/10.2307/2525589 doi: 10.2307/2525589
[32]	H. Theil, On the use of incomplete prior information in regression analysis, J. Am. Stat. Assoc., 58 (1963), 401–411. https://doi.org/10.2307/2283275 doi: 10.2307/2283275
[33]	R. Fallah, M. Arashi, S.-M.-M. Tabatabaey, On the ridge regression estimator with sub-space restriction, Commun. Stat. Theory Methods, 46 (2017), 11854–11865. https://doi.org/10.1080/03610926.2017.1285928 doi: 10.1080/03610926.2017.1285928
[34]	R. Fallah, M. Arashi, S.-M.-M. Tabatabaey, Shrinkage estimation in restricted elliptical regression model, J. Iran. Stat. Soc., 17 (2018), 49–61. https://doi.org/10.29252/jirss.17.1.49 doi: 10.29252/jirss.17.1.49
[35]	H. Toutenburg, Prior Information in Linear Models, New York: John Wiley, 1982. https://doi.org/10.2307/2982032
[36]	A.-E. Hoerl, R.-W. Kennard, Ridge regression: Biased estimation for non-orthogonal problems, Technometrics, 12 (1970), 69–82. https://doi.org/10.2307/1271436 doi: 10.2307/1271436
[37]	M. Roozbeh, N.-A. Hamzah, Feasible robust estimator in restricted semiparametric regression models based on the LTS approach, Commun. Stat. Simul. Comput., 46 (2017), 7332–7350. https://doi.org/10.1080/03610918.2016.1236954 doi: 10.1080/03610918.2016.1236954
[38]	F. Akdeniz, M. Roozbeh, Generalized difference-based weighted mixed almost unbiased ridge estimator in partially linear models, Stat. Pap. 60 (2019), 1717–1739. https://doi.org/10.1007/s00362-017-0893-9 doi: 10.1007/s00362-017-0893-9
[39]	F. Akdeniz, M. Roozbeh, E. Akdeniz, M.-N. Khan, Generalized difference-based weighted mixed almost unbiased liu estimator in semiparametric regression models, Commun. Stat. Theory Methods, 51 (2022), 4395–4416. https://doi.org/10.1080/03610926.2020.1814340 doi: 10.1080/03610926.2020.1814340
[40]	B.-M.-G. Kibria, Some Liu and ridge type estimators and their properties under the ill- conditioned Gaussian linear regression model, J. Stat. Comput. Simul., 82 (2012), 1–17. https://doi.org/10.1080/00949655.2010.519705 doi: 10.1080/00949655.2010.519705
[41]	K. Månsson, B.-M.-G. Kibria, G. Shukur, A restricted Liu estimator for binary regression models and its application to an applied demand system, J. Appl. Stat., 43 (2016), 1119–1127. https://doi.org/10.1080/02664763.2015.1092110 doi: 10.1080/02664763.2015.1092110
[42]	K. Månsson, B.-M.-G. Kibria, Estimating the unrestricted and restricted Liu estimators for the Poisson regression model: Method and application, Comput. Econ., 58 (2021), 311–326. https://doi.org/10.1007/s10614-020-10028-y doi: 10.1007/s10614-020-10028-y
[43]	P.-J. Rousseeuw, Multivariate estimation with high breakdown point, Math. Stat. Appl., 8 (1985), 283–297.
[44]	Alfons, C. Croux, S. Gelper, Sparse least trimmed squares regression for analyzing high-dimensional large data sets, Ann. Appl. Stat., 7 (2013), 226–248. https://doi.org/10.1214/12-AOAS575 doi: 10.1214/12-AOAS575
[45]	F.-A. Graybill, Matrices with Applications in Statistics, Wadsworth: Belmont, 1983.
[46]	D. Harville, Matrix Algebra from a Statistician's Perspective, New York: Springer Verlag, 1997. https://doi.org/10.1007/b98818
[47]	R. Farebrother, Further results on the mean square error of ridge regression, J. Roy. Stat. Soc. Ser. B, 38 (1976), 248–250. https://doi.org/10.1111/j.2517-6161.1976.tb01588.x doi: 10.1111/j.2517-6161.1976.tb01588.x
[48]	G.-C. McDonald, D.-I. Galarneau, A Monte Carlo evaluation of some ridge-type estimators, J. Am. Stat. Assoc., 70 (1975), 407–416. https://doi.org/10.2307/2285832 doi: 10.2307/2285832
[49]	D.-G. Gibbons, A simulation study of some ridge estimators, J. Am. Stat. Assoc., 76 (1981), 131–139. https://doi.org/10.2307/2287058 doi: 10.2307/2287058
[50]	M.-B. Priestley, M.-T. Chao, Non-Parametric Function Fitting, J. R. Stat. Soc. Ser. B, 34 (1972), 385–392. https://doi.org/10.1111/j.2517-6161.1972.tb00916.x doi: 10.1111/j.2517-6161.1972.tb00916.x
[51]	M. Ho, Essays on the Housing Market, PhD thesis, University of Toronto, 1995.
[52]	S.-J. Sheather, A Modern Approach to Regression with R, New York: Springer, 2009. https://doi.org/10.1007/978-0-387-09608-7
[53]	P. Cizek, Least trimmed squares in nonlinear regression under dependence, J. Stat. Plann. Inference, 136 (2005), 3967–3988. https://doi.org/10.1016/j.jspi.2005.05.004 doi: 10.1016/j.jspi.2005.05.004
[54]	J.-A. Visek, The least trimmed squares part Ⅲ: Asymptotic normality, Kybernetika, 42 (2006), 203–224.

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(561) PDF downloads(32) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

AIMS Mathematics

Feasible robust Liu estimator to combat outliers and multicollinearity effects in restricted semiparametric regression model

Related Papers:

Abstract

1. Introduction

2. Feasible type of the classical estimators in RSRM

3. Feasible robust Liu estimator in RSRM

4. Illustrative experiments

4.1. The Monte Carlo simulation studies

4.2. Real-world data analysis

5. Conclusions

Author contributions

Acknowledgments

Conflicts of Interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Abstract

1. Introduction

2. Feasible type of the classical estimators in RSRM

3. Feasible robust Liu estimator in RSRM

4. Illustrative experiments

4.1. The Monte Carlo simulation studies

4.2. Real-world data analysis

5. Conclusions

Author contributions

Acknowledgments

Conflicts of Interest

References

AIMS Mathematics

Feasible robust Liu estimator to combat outliers and multicollinearity effects in restricted semiparametric regression model

Related Papers:

Abstract

1. Introduction

2. Feasible type of the classical estimators in RSRM

3. Feasible robust Liu estimator in RSRM

4. Illustrative experiments

4.1. The Monte Carlo simulation studies

4.2. Real-world data analysis

5. Conclusions

Author contributions

Acknowledgments

Conflicts of Interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Feasible type of the classical estimators in RSRM

3. Feasible robust Liu estimator in RSRM

4. Illustrative experiments

4.1. The Monte Carlo simulation studies

4.2. Real-world data analysis

5. Conclusions

Author contributions

Acknowledgments

Conflicts of Interest

References