Research article Topical Sections

Comparison between direct transesterification of microalgae and hydrochar

  • Received: 24 May 2017 Accepted: 11 July 2017 Published: 18 July 2017
  • Hydrothermal carbonization (HTC) of microalgae is one of processes that can effectively remove moisture from microalgae. In addition, the hydrochar retains most of fatty acids from microalgae feedstock, and the content of fatty acids in hydrochar is doubled. This research concentrates on the comparison between direct transesterification of microalgae and hydrochar. The result shows that the biodiesel yields of hydrochar were higher than those of microalgae at the same reaction conditions due to the higher extraction rate of fatty acids from hydrochar. Finally, the amount of methanol and catalyst which is required for a given amount of microalgae can be reduced to a half through the direct transesterification of hydrochar.

    Citation: Vo Thanh Phuoc, Kunio Yoshikawa. Comparison between direct transesterification of microalgae and hydrochar[J]. AIMS Energy, 2017, 5(4): 652-666. doi: 10.3934/energy.2017.4.652

    Related Papers:

    [1] Wei Zhong, Li Mao, Wei Du . A signal quality assessment method for fetal QRS complexes detection. Mathematical Biosciences and Engineering, 2023, 20(5): 7943-7956. doi: 10.3934/mbe.2023344
    [2] Eleonora Sulas, Monica Urru, Roberto Tumbarello, Luigi Raffo, Danilo Pani . Systematic analysis of single- and multi-reference adaptive filters for non-invasive fetal electrocardiography. Mathematical Biosciences and Engineering, 2020, 17(1): 286-308. doi: 10.3934/mbe.2020016
    [3] Iman Mousavian, Mohammad Bagher Shamsollahi, Emad Fatemizadeh . Noninvasive fetal ECG extraction using doubly constrained block-term decomposition. Mathematical Biosciences and Engineering, 2020, 17(1): 144-159. doi: 10.3934/mbe.2020008
    [4] Philip A. Warrick, Emily F. Hamilton . Information theoretic measures of perinatal cardiotocography synchronization. Mathematical Biosciences and Engineering, 2020, 17(3): 2179-2192. doi: 10.3934/mbe.2020116
    [5] Ahsan H. Khandoker, Haitham M. Al-Angari, Andreas Voss, Steffen Schulz, Yoshitaka Kimura . Quantification of maternal-fetal cardiac couplings in normal and abnormal pregnancies applying high resolution joint symbolic dynamics. Mathematical Biosciences and Engineering, 2020, 17(1): 802-813. doi: 10.3934/mbe.2020042
    [6] Chunkai Zhang, Yingyang Chen, Ao Yin, Xuan Wang . Anomaly detection in ECG based on trend symbolic aggregate approximation. Mathematical Biosciences and Engineering, 2019, 16(4): 2154-2167. doi: 10.3934/mbe.2019105
    [7] Xianfeng Xu, Liping Wang, Xiaohong Cheng, Weilin Ke, Shenqiu Jie, Shen Lin, Manlin Lai, Linlin Zhang, Zhenzhou Li . Machine learning-based evaluation of application value of the USM combined with NIPT in the diagnosis of fetal chromosomal abnormalities. Mathematical Biosciences and Engineering, 2022, 19(4): 4260-4276. doi: 10.3934/mbe.2022197
    [8] Selene Tomassini, Annachiara Strazza, Agnese Sbrollini, Ilaria Marcantoni, Micaela Morettini, Sandro Fioretti, Laura Burattini . Wavelet filtering of fetal phonocardiography: A comparative analysis. Mathematical Biosciences and Engineering, 2019, 16(5): 6034-6046. doi: 10.3934/mbe.2019302
    [9] Yajing Zeng, Siyu Yang, Xiongkai Yu, Wenting Lin, Wei Wang, Jijun Tong, Shudong Xia . A multimodal parallel method for left ventricular dysfunction identification based on phonocardiogram and electrocardiogram signals synchronous analysis. Mathematical Biosciences and Engineering, 2022, 19(9): 9612-9635. doi: 10.3934/mbe.2022447
    [10] Hong Gao, Cuiyun Wu, Dunnian Huang, Dahui Zha, Cuiping Zhou . Prediction of fetal weight based on back propagation neural network optimized by genetic algorithm. Mathematical Biosciences and Engineering, 2021, 18(4): 4402-4410. doi: 10.3934/mbe.2021222
  • Hydrothermal carbonization (HTC) of microalgae is one of processes that can effectively remove moisture from microalgae. In addition, the hydrochar retains most of fatty acids from microalgae feedstock, and the content of fatty acids in hydrochar is doubled. This research concentrates on the comparison between direct transesterification of microalgae and hydrochar. The result shows that the biodiesel yields of hydrochar were higher than those of microalgae at the same reaction conditions due to the higher extraction rate of fatty acids from hydrochar. Finally, the amount of methanol and catalyst which is required for a given amount of microalgae can be reduced to a half through the direct transesterification of hydrochar.


    Since the beginning of 2020, the world has been facing a coronavirus pandemic (COVID-19) at a rapid and alarming rate, representing a significant challenge for humanity and a serious threat to life. Although many countries have taken multiple and sometimes harsh measures to limit the spread of the virus and reduce its spread, the eyes of the public are now turning to scientists, doctors, and researchers of all scientific disciplines in the hope of finding a quick and successful treatment for this virus. Many authors have investigated the association of environmental and meteorological factors on the spread of COVID-19, see for example, Tello-Leal, et al. [1], Kodera, et al. [2], Meo, et al. [3], Casado-Aranda, et al. [4], Dogan, et al. [5], Fu, et al. [6] and Yuan, et al. [7].

    McCullagh and Nelder [8] published a book on GLMs that led to their widespread use and appreciations. They extended the scoring method to maximum-likelihood estimation (MLE) in exponential families. Nelder and Pregibon [9] described methods of jointly estimating parameters of both link and variance functions. The iteratively reweighted least squares (IRLS) algorithm is amenable to statistics and measures that are common to all GLMs. Nelder and Wedderburn [10] used the Newton-Raphson process for regression coefficient estimates. Yuan and Bentler [11] reported that the convergence properties of the Fisher scoring algorithm are affected by many factors. One among the observed variables is multicollinearity. If the sample or model implied covariance matrix is close to singular, the Fisher scoring algorithm may have difficulty reaching a set of converged solutions. Nelder and Wedderburn [10] reported that the Newton-Raphson process with expected second derivatives is equivalent to Fisher's scoring technique. Additionally, de Jong and Heller [12] reported that the Newton-Raphson iteration equation leads to a sequence that often rapidly converges. This includes the D statistic along with specific residuals and influence measures. Liao [13] introduced a systematic way of interpreting commonly used probability models: logit, probit, and other GLMs. For recent works of using these models to the field of epidemiology and to the healthcare, see for example, Richardson and Hartman [14], Song, et al. [15], Mohamadou, et al. [16] and Trunfio, et al. [17].

    The inverted exponential (IE) distribution that was introduced by Keller and Kamath [18]. The role of IE distributions is indispensable in many applications of reliability theory for its memoryless property and its constant failure rate. Dey [20] considered IE distribution as life distribution (see Abdel-Aty et al. [19], and Dey [20]). Singh et al. [21] obtained Bayes estimates for parameters of IE distribution by using informative and noninformative priors. They also compared the classical method with the Bayesian through the simulation study.

    The Bayesian approach of the statistical modeling provides an alternative to standard GLMs. Posterior-mode estimation is an alternative to full posterior analysis or posterior mean estimation, which avoids numerical integrations or simulation methods. It was proposed by many authors (for more details, see Fahrmeir and Tutz [22] and Cepeda and Gamerman [23]). Dey et al. [24] described how to conceptualize, perform, and critique traditional GLMs from a Bayesian perspective, and how to use modern computational methods to summarize inferences using simulations. Olsson [25] gave an overview of GLMs and presented practical examples. The exponential family of distributions are discussed with maximum-likelihood estimation and ways of assessing the fit of the model. For the Bayesian estimation in this context, a useful asymmetric loss function known as the LINEX loss function was introduced by Varian (1975) and has been widely used by several authors. A highly used one is the zero-one loss function (for more details, see Sano et al. [26]). Robbins [27] has provided a more robust estimate and for estimating the parameters of prior distribution (hyperparameters), studies on the empirical Bayes (EB) method, Wei [28] has proposed the EB test of the regression coefficient, and working out the EB test decision rule by using kernel estimation of multivariate density function and its first-order partial derivatives, Singh [29] has proposed an EB approach in a multiple linear-regression model, Houston and Woodruff [30] have derived EB estimates using an m-group regression model to regress within-group estimates toward common values. More studies by many authors discussed EB test problems for parameters in a class of linear-regression model and other topics, e.g., Wind [31], Huang [32], Karunamuni [33], Yuan [34], Chen [35], Efron [36], Shao [37], Kim and Nembhard [38], and Jampachaisri et al. [39].

    In order to reduce the influence of outliers on the estimate, some robust measures were proposed in the literature. The common robust estimation method can be divided into several categories: M, MM, median, L1, Msplit, R, S, least-trimmed squares, and sign-constraint robust least squares estimation. Among these, Huber's M estimation has become one of the main robust estimation methods by virtue of its simple calculation and convenience to implement (see Li et al. [40]). The key aspect is the involvement of a loss function that is applied to data errors that was selected to less rapidly increase than the square loss function that is used in least-squares or maximum-likelihood procedures. There exist several well-known families of loss functions, such as Huber, Hampel, and Tukey's biweight (or bisquare) that can be used for the computation of M estimators (see Sinova and Aelst [41]).

    A major contribution of this paper is to propose similarity to GLMs, except that the distribution of the response is not a member of the exponential family using the Bayesian approach. We interest in IE distribution of which the flexible distribution can describe different lifetimes from medicine, reliability, ecology, biological studies, and other areas. We propose the Bayesian and non Bayesian inverted exponential regression models to model and analyze Covid-19-related data with the aim of explaining the relationships between Covid-19 cases and environmental-related variables. The paper is organized as follows: In Section 2, we present an overview of GLMs and propose the IE-Reg model under a log link function. In Section 3, we perform IE-BReg and IE-EBReg under a gamma prior, log link, and two loss functions. We propose Huber's and Tukey's bisquare (biweight) function to improve Bayesian models. We also adopted the iteratively reweighted least squares (IRLS) algorithm to estimate the Bayesian regression coefficients. In Section 4, we apply IE-Reg, IE-BReg, and IE-EBReg models, including an estimation, and use criteria such as AIC, BIC, MSE, and D to the Saudi COVID-19 dataset collected from March 23 to June 21, 2020. Finally, Section 5 draws a succinct conclusion to the findings.

    Nelder and Wedderburn [10] introduced the class of GLMs, defined according to the assumption that y1,y2,...yn are observations of the response variable, with density function yi as follows:

    f(yi;θi)=eθiyiψ(θi)+c(yi),  i=1,2,...,n, (2.1)

    where ψ(), c() are known functions, with θi being the canonical parameter. Link function g(.), related to the regression coefficients, is given by

    g(μi)=ηi=xiβ,  i=1,2,...,n, (2.2)

    where g(μi)=θi, β=(β1,...,βp) is a vector of p unknown regression parameters, xi=(xi1,xi2,...,xip) is a vector of explanatory variables, and ηi is a linear predictor of vectors xi and β. Here, g(.):(0,)R is a link function, which is a monotonic differentiable invertible function. The model given by (2.1) and (2.2) is called the GLM. The GLM class includes, as special cases, linear-regression and analysis-of-variance models, logit and probit models for quantal responses, log-linear models, and multinomial response models for counts (for more details, see McCaullagh and Nelder [42]).

    Consider that the probability density function of IE distribution is as follows (see Abdel-Aty et al. [19]):

    f(y;γ)=γy2eγy;  y>0,  γ>0, (2.3)

    which has no mean and γ is a scale parameter. The median value of the response variable is given by

    ˜μ=γlog(2). (2.4)

    Since mean does not exist, we use the median ˜μ instead of it in the link function (see Das and Dey [43]). The cumulative function of IE distribution is given by

    F(y;γ)=eγy;  y>0.

    Let yi be a random sample from IE, and γi=˜μilog(2), the log-likelihood function based on yi, is given by

    li=l(˜μi|yi)=log(˜μi)+log(log(2)y2i)log(2)˜μiyi,  i=1,2,...,n. (2.5)

    Regression coefficients are estimated using Fisher's scoring technique (for more details, see Nelder and Wedderburn [10], and McCaullagh and Nelder [42]). In order to develop the GLMs for our models, IE-Reg is similar to GLMs, except that the distribution of the response variable is not a member of the exponential family (Ferrari and Cribari-Neto [44]). We also suggest the logarithm (log) link functions of g(.), in view of (2.2) as in the following lemma.

    Lemma 2.1:

    Let the response variable Y have an IE distribution, i=1,2,...,n, and let the link function of the form be

    g(˜μi)=log(˜μi)=ηi=xiβ,  i=1,2,...,n. (2.6)

    Thus, estimated coefficients ˆβ=(ˆβ0,ˆβ1,,ˆβp) using Fisher's scoring technique at the sth iteration based on the IRLS process are given by

    ˆβ(s)=(XX)1XZ,  s=1,2,3,..., (2.7)

    where X is a covariates matrix, ˆβ(0)j is an initial vector, Z=(z1,z2,,zn), and

    zi=pj=1xijˆβ(s1)j+(1˜μ(s1)ilog(2)yi). (2.8)

    The procedure in (2.7) can be repeated until |ˆβ(s)ˆβ(s1)|ε. IE-Reg model in this case is given by

    ˆ˜μ(s)i=eˆβ(s)0+ˆβ(s)1xi1+....+ˆβ(s)pxip. (2.9)

    Proof: See the appendices

    Diaconis and Ylvisker [43] introduced conjugate prior distribution for the exponential family, which, as in (2.1), can be shown as

    π(θi)=k1emμ0θimψ(θi),  i=1,2,...,n, (3.1)

    where k1 is a normalization constant, and m,μ0 are natural parameters. θi values are connected to the regression coefficients by link function ηi=xiβ as

    g(ηi)=θi. (3.2)

    Posterior distribution of θi is given by

    π(θi|yi)=k2e(yi+mμ0)θi(1+m)ψ(θi). (3.3)

    Das and Dey [43] suggested a Jacobian of transformation and rewrote (3.3) with term ηi, as

    π(ηi|yi)=k2e(yi+mμ0)g(ηi)(1+m)ψ(g(ηi))g(ηi)ηi, (3.4)

    where k2 is a normalization constant, and g(ηi)ηi0. They used a zero-one loss function to attain the posterior mode of (3.4) as ^ηi=h(yi); hence, estimated coefficients ˆβ=(ˆβ0,ˆβ1,...,ˆβp) are given by

    ˆβ=(XX)1Xˆη, (3.5)

    where ˆβ is the least-squares estimates, and ˆη=(ˆη1,ˆη2,...,ˆηn) (for more details, see Das and Dey [43], and Das and Dey [45]).

    In order to develop a Bayesian approach, we suggest Bayesian and empirical Bayesian regression models (IE-BReg and IE-EBReg) that are similar to Bayesian GLMs, except that the distribution of the response variable is not a member of the exponential family. We used the general form of the posterior in (3.4), and since g() is a monotonic differentiable function, we then attain posterior Bayes estimates. In addition, we use log link function with zero-one and LINEX loss functions to be appropriate of Bayes estimates. IE-BReg and IE-EBReg estimators correspond to the log link function using different loss functions, as in the following lemmas.

    Lemma 3.1: (IE-BReg and IE-EBReg models based on zero-one loss function)

    Let the response variable Y have an IE distribution, and the link function of the form be as in (2.6). Consider that ˜μ has a gamma prior G(α,λ) with the following density function

    π(˜μ)=λαΓ(α)eλ˜μ˜μα1,  ˜μ>0,λ,α>0. (3.6)

    Thus, the posterior mode of ηi by using zero-one loss function can be derived by solving the following equation:

    ˆηi=log(α+1λ+log(2)yi),  i=1,2,...,n. (3.7)

    Estimated coefficients ˆβ are given as in (3.5). In this case, IE-BReg and IE-EBReg models are given by

    ˆ˜μi=eˆβ0+ˆβ1xi1+....+ˆβpxip. (3.8)

    In the case of IE-EBReg, empirical Bayes estimates for unknown prior distribution parameter λ are given by MLE from the data. Therefore, IE-EBReg estimates are found by placing these estimated prior distribution parameter into Equation (3.7) by ˆλ.

    Proof: See the appendices

    Lemma 3.2

    Let the response variable Y have an IE distribution, and the link function of the form be as in (2.6). Consider that ˜μ has a gamma prior G(α,λ) with the density function as in (3.6). Using Jacobian transformation from ˜μi to ηi, the posterior function of ηi can be written as in the following equation

    π(ηi|yi)e(1+α)ηieeηi(λ+log(2)yi).

    Thus, the scale parameter of the prior G(α,λ) is less than or equal one (λ1) and its variance (σ2) is greater than or equal the mean (μ).

    Proof: See the appendices

    Lemma 3.3: (IE-BReg and IE-EBReg models based on LINEX loss function)

    Let the response variable Y have an IE distribution, and let the link function of the form be as given in (2.6). Consider that ˜μ has a gamma prior with density function as given in (3.6). As a result, the posterior Bayes estimates of ηi, by using the LINEX loss function, can be derived as follows

    ˆηi=1αlog(λαlog(2)y2iΓ(α)(λ+log(2)yi)),  i=1,2,...,n. (3.9)

    The estimated coefficients ˆβ and the IE-BReg and IE-EBReg models and their estimates in this case are given as in (3.5) and (3.8), respectively. In the caes of IE-EBReg model, the regression coefficients estimates is found by placing these estimated prior distribution parameter into Equation (3.9) by ˆλ.

    Proof: See the appendices

    Lemma 3.4:

    Let the response variable Y have an IE distribution, and the link function of the form be as in (2.6). Consider that ˜μ has a gamma prior G(α,λ) with the density function as in (3.6). Using Jacobian transformation from ˜μi to ηi, the posterior function of ηi can be written as in the following equation

    π(ηi|yi)[g(ηi)]αeg(ηi)(λ+log(2)yi)g(ηi)ηi. (3.10)

    Thus, the scale parameter of the prior G(α,λ) is greater than or equal one (λ1) and its variance (σ2) is less than or equal the mean (μ).

    Proof: See the appendices

    M-estimation is considered to be the most common method of robust regression. It was proposed by Huber [46] in the presence of outliers, and it is more efficient than ordinary least squares (OLS) (Rousseeuw and Leroy [47], and Chang, et al. [48]). The Huber's function takes the following form (Huber [46,49]):

    ρ(r)={r22,|r|k,k(|r|k2),|r|>k, (3.11)

    where k is the tuning constant, r is the residual corresponding to the observation in OLS, and ρ() is the objective function that satisfies certain properties. Often, ρ() can be formed by using a linear combination of the residuals. Defining function rρ(r) and the corresponding weight function in this case is as follows:

    ψ(r)r=w(r)={1,|r|k,k|r|,|r|>k. (3.12)

    Another M-estimation function is the Tukey bisquare's (biweight) function. This is based on Tukey's function, taking the form of that in Sinova and Van Aelst [41]

    ρ(r)={k26(1[1(rk)2]3),|r|k,k26,|r|>k, (3.13)

    where k is the tuning constant and r is the residual corresponding to the observation in OLS. Defining function rρ(r)=ψ(r) and the corresponding weight function in this case is given as follows:

    ψ(r)r=w(r)={[1(rk)2]2,|r|k,0,|r|>k. (3.14)

    To make the IE-BReg and IE-EBReg models are robust, we suggest the Huber's and biweight functions for these models based on an adopted IRLS algorithm. There are also many other versions of the M-estimation function that could be used here.

    Lemma 3.5: (IE-BReg and IE-EBReg models based on M-estimation functions)

    Let the response variable Y have an IE distribution, and let the link function of the form be as given in (2.6). Consider that ˜μ has a gamma prior with density function is given as in (3.6). Using the Jacobian transformation from ˜μi to ηi and using the log link function, we have the posterior distribution of ηi is given as in (3.10). Thus, the estimated coefficients ˆβ=(ˆβ0,ˆβ1,...,ˆβp) are given as

    ˆβ(s)=(XW(ˆβ(s1))X)1XW(ˆβ(s1))ˆη,  s=1,2,3,, (3.15)

    where ˆηi=h(y) and ˆη=(ˆη1,ˆη2,...,ˆηn) are the posterior Bayes estimates of ηi using the zero-one or LINEX loss functions, and W=diag(w1,w2,,wn), wi are the selected weights depending on M-estimation functions. In this case, coefficients are estimated using the adopted IRLS Algorithm.

    An adopted algorithm based on IRLS and M-estimation is employed as follows:

    Equation (3.15) is solved using an adopted algorithm on the basis of the standard IRLS algorithm (for more details, see Maronna et al. [50], Wen and Liu [51], and Kikuchi et al. [52]). This algorithm is proposed for the solution of the IE-BReg and IE-EBReg estimates, that is employed in the following steps:

    (i) Setting the iteration counter at q=0, finding an initial estimates of regression coefficients ˆβ(q)j, j=0,1,2,...,p1 using IE-Reg estimates.

    (ii) The initial residuals r(q)(i)=Yie(Xiˆβ(q)j) are based on the log link function that is given as in (2.6), and calculate an initial scale estimate s(q)=1.4826(median|r(q)i|).

    (iii) An initial standardized residuals u(q)i=r(q)is(q) are calculated and used to calculate initial estimates for the weight function. Preliminary weights are w(q)i=w(u(q)i).

    (iv) Calculate ˆλ as the prior distribution π(˜μ) is G(α,λ) in the case of IE-EBReg using MLE estimates, or the scale parameter λ is known in the case of IE-BReg model.

    (v) Calculate Bayes estimates ˆηi=h(y);i=1,...,n using the prior G(α,λ) and zero-one or LINEX loss functions.

    (vi) Using weights from Steps i-iii and Steps iv and v to find estimators in (3.15).

    (vii) Set q=q+1; then, go to Step ii. Steps (ii) to (vii) are repeated until the estimate of ˆβ(q) is stabilized from the previous iteration, which means: |ˆβ(q+1)ˆβ(q)|ε.

    Under regularity conditions, estimator ˆβ has asymptotically normal distribution ˆβN(β,(XWX)1) (see Houston and Woodruff [30], Tellinghuisen [53]).

    In this section, we apply the IE-Reg, IE-BReg, and IE-EBReg models for the daily confirmed COVID-19 cases in Saudi Arabia. The relevant dataset in this application is COVID-19 data from Saudi Arabia in 2020. These data contain 91 observations from March 23-June 21, 2020 in which the response variable Y is the daily confirmed COVID-19 cases in Saudi Arabia. Explanatory variables are: X1; daily recovered COVID-19 cases, X2, daily critical COVID-19 cases; X3, daily active COVID-19 cases; X4, tests per million (PCR tests); X5, curfew hours per day; X6, maximal temperature in Celsius per day; X7, maximal relative humidity (%); X8, maximal wind speed in miles per hour (mph); and X9, maximal pressure in hectopascal (hPa). In the case of variables Xi,i=6,...,9 are the average for the cities of Riyadh, Jeddah, and Dammam that have had the highest number of confirmed and death cases. This dataset was taken from the Ministry of Health of Saudi Arabia (COVID-19 dashboard) and the Ministry of Health's Twitter account.

    Lemmas in Sections 2 and 3 were applied to these data. IE-Reg, IE-BReg, and IE-EBReg models based on log link and loss functions were used. Bayes coefficients were obtained using a gamma prior G(α,λ) to a known shape α and an unknown scale λ parameter. We used generated data from a gamma prior distribution to estimate λ in the case of IE-BReg model and use our data in the case of IE-EBReg model. We also compared the performance of all these models. Different plots, such as the quantile-quantile (Q-Q) plot, the empirical cumulative distribution function (ECDF), and box plot, were proposed to aid in distributional assessment and identify outliers. In addition, Huber's and biweight functions are suggested in the case of Lemma (3.5) to avoid such distortions due to outliers (for more details, see Sinova and Van Aelst [41]). The backward-selection method is used in the IE-EBReg model to remove the input variable (see Table 3). Modeling performance is measured in terms of some criteria, such as AIC, BIC, D, D/df (divided by its degrees of freedom), and mean square error (MSE) (de Jong and Heller [12]). We also used Thiel's inequality coefficient (TIC) to compare the prediction accuracy of the selected models (Leuthold [54], and Niu et al. [55]). To check the adequacy for the selected models, we consider the deviance residuals (see McCaullagh and Nelder [42]). The predictive results of these models and other numerical results are shown in Tables 1-8. R software was used to carry out calculations. In order to compare with known distributions, the glm() function in "stats" was used to fit the GLMs (Faraway [56]). Functions qqPlot(), ecdf(), boxplot, and ks.test() in R package "stats" were used for the assessment distributions (Fox and Weisberg [57]).

    Table 1.  Efficiency of Gamma, inverse Gaussian, Inverted exponential(IE-Reg), IE-Bayesian regression (IE-BReg), and IE-empirical Bayesian regression (IE-EBReg) models.
    The Model Prior IE-Reg IE-BReg
    G(2.610,1.389)
    IE-EBReg Inverse Gaussian Gamma
    Loss and Weight function LINEX and Huber LINEX and biweight
    AIC 35.339 64.867 65.837 181.256 147.107
    D 4.666 30.194 31.163 25.093 18.392
    D/df 0.058 0.373 0.385 0.302 0.222
    MSE 3.880 0.201 0.188 5935513 1.303

     | Show Table
    DownLoad: CSV
    Table 2.  AIC, BIC, D and MSE of IE-Reg, IE-BReg, and IE-EBReg models.
    Step Model Prior Loss Function AIC BIC D MSE
    Step 1 IE-Reg 39.339 64.448 4.666 3.880
    IE-BReg-Huber G(0.893,0.641) zero–one 65.700 90.808 31.026 0.560
    IE-BReg-biweight 66.054 91.163 31.381 0.486
    IE-BReg 66.059 91.167 31.385 0.486
    IE-BReg-Huber G(2.610,1.393) LINEX 67.559 89.917 30.135 0.201
    IE-BReg-biweight 65.773 90.882 31.100 0.188
    IE-BReg 65.781 90.890 31.108 0.188
    IE-EBReg-Huber G(0.893,0.642) zero–one 65.736 90.845 31.063 0.561
    IE-EBReg-biweight 66.087 91.195 31.413 0.487
    IE-EBReg 66.092 91.200 31.418 0.487
    IE-EBReg-Huber G(2.610,1.389) LINEX 64.867 89.976 30.194 0.201
    IE-EBReg-biweight 65.837 90.945 31.163 0.188
    IE-EBReg 65.845 90.954 31.171 0.188
    Step 2 IE-Reg 37.381 59.979 4.707 3.887
    IE-BReg-Huber G(0.844,0.616) zero–one 62.673 85.271 29.999 0.554
    IE-BReg-biweight 63.082 85.680 30.408 0.473
    IE-BReg 63.093 85.691 30.419 0.472
    IE-BReg-Huber G(2.350,1.672) LINEX 54.783 79.891 20.109 0.262
    IE-BReg-biweight 54.128 76.725 21.454 0.237
    IE-BReg 54.155 76.752 21.481 0.237
    IE-EBReg-Huber G(0.844,0.614) zero–one 62.600 85.198 29.926 0.554
    IE-EBReg-biwight 63.027 85.625 30.353 0.471
    IE-EBReg 63.038 85.636 30.364 0.469
    IE-EBReg-Huber G(2.350,1.621) LINEX 52.522 75.120 19.848 0.261
    IE-EBReg-biweight 54.151 76.749 21.477 0.230
    IE-EBReg 54.186 76.784 21.512 0.230
    Step 3 IE-Reg 35.621 55.708 4.947 4.015
    IE-BReg-Huber G(0.820,0.593) zero–one 59.890 79.977 29.217 0.539
    IE-BReg-biweight 60.450 80.537 29.777 0.445
    IE-BReg 60.453 80.540 29.780 0.445
    IE-BReg-Huber G(2.610,1.451) LINEX 60.320 80.407 29.647 0.205
    IE-BReg-biweight 60.970 81.057 30.296 0.196
    IE-BReg 60.973 81.060 30.299 0.196
    IE-EBReg-Huber G(0.820,0.607) zero–one 60.561 80.648 29.887 0.537
    IE-EBReg-biweight 60.970 81.057 30.297 0.465
    IE-EBReg 60.974 81.061 30.300 0.465
    IE-EBReg-Huber G(2.610,1.420) LINEX 60.493 80.580 29.819 0.200
    IE-EBReg-biweight 61.472 81.559 30.799 0.192
    IE-EBReg 61.476 81.563 30.803 0.192

     | Show Table
    DownLoad: CSV
    Table 3.  IE-EBReg model based on LINEX, Huber's and biweight functions.
    Step Variables ˆβ
    Huber
    z-Statistics P-value ˆβ
    biweight
    z-Statistics P-value
    Step 1 Intercept -31.068 -3.790 0.0002 -27.730 -3.457 0.0005
    X1 0.460 16.211 0.0000 0.461 16.673 0.0000
    X2 0.258 2.107 0.0351 0.292 2.401 0.0163
    X3 0.419 14.949 0.0000 0.420 15.404 0.0000
    X4 0.757 4.329 0.0000 0.648 3.876 0.0001
    X5 23.821 5.981 0.0000 23.409 5.973 0.0000
    X6 32.640 5.503 0.0000 32.170 5.513 0.0000
    X7 14.379 10.368 0.0000 13.551 9.962 0.0000
    X8 0.211 0.109 0.9132 -0.668 -0.349 0.7271
    X9 28.421 3.486 0.0004 25.170 3.154 0.0016
    Step 2 Intercept -36.839 -4.597 0.0000 -32.435 -4.136 0.0000
    X1 0.542 18.750 0.0000 0.523 19.327 0.0000
    X2 0.263 2.147 0.0318 0.336 2.763 0.006
    X3 0.495 17.643 0.0000 0.477 18.048 0.0000
    X4 0.827 4.687 0.0000 0.729 4.430 0.0000
    X5 25.401 6.370 0.0000 26.728 6.819 0.0000
    X6 36.826 6.307 0.0000 37.064 6.467 0.0000
    X7 16.828 12.124 0.0000 15.534 11.456 0.0000
    X9 33.688 4.219 0.0000 29.331 3.752 0.0002
    Step 3 Intercept -30.825 -3.867 0.0001 -27.535 -3.518 0.0004
    X1 0.456 16.450 0.0000 0.457 16.997 0.0000
    X3 0.415 15.304 0.0000 0.415 15.846 0.0000
    X4 0.790 4.599 0.0000 0.675 4.132 0.0000
    X5 21.823 5.600 0.0000 22.012 5.692 0.0000
    X6 33.856 5.845 0.0000 33.815 5.925 0.0000
    X7 14.402 10.453 0.0000 13.687 10.102 0.0000
    X9 28.142 3.543 0.0004 24.888 3.190 0.0014

     | Show Table
    DownLoad: CSV
    Table 4.  Fitting results of IE-EBReg ˆyi based on LINEX, Huber's, and biweight functions (during fitting interval).
    Date yi ˆyi
    Huber
    |yiˆyi| Relative error % ˆyi
    biweight
    |yiˆyi| Relative error %
    4/26/2020 1223 1348 125 10.25 1351 128 10.439
    4/27/2020 1289 1172 117 9.050 1177 112 8.697
    4/28/2020 1266 1.032 234 18.46 1026 240 18.959
    4/29/2020 1325 1257 68 5.11 1249 76 5.724
    4/30/2020 1351 1.223 128 9.470 1222 129 9.542
    5/1/2020 1344 1357 13 0.994 1353 9 0.652
    5/2/2020 1362 1514 152 11.15 1491 129 9.490
    5/3/2020 1552 1582 30 1.94 1566 14 0.890
    5/5/2020 1645 1364 281 17.07 1357 288 17.495

     | Show Table
    DownLoad: CSV
    Table 5.  Predicted results of IE-EBReg ˆyi based on LINEX, Huber's, and biweight functions with seven variables (out of fitting interval).
    Date yi ˆyi
    Huber
    |yiˆyi| R.E. % TIC ˆyi
    biweight
    |yiˆyi| R.E % TIC
    6/22/2020 3393 3340 53 1.55 0.0612 3165 228 6.731 0.0529
    6/23/2020 3139 2789 350 11.16 2693 446 14.205
    6/24/2020 3123 2627 496 15.87 2519 604 19.355
    6/25/2020 3372 4000 628 18.63 3765 393 11.658
    6/26/2020 3938 4272 334 8.48 4012 73.772 1.873
    6/27/2020 3927 4448 521 13.28 4107 180.124 4.587
    R.E.: Relative Error

     | Show Table
    DownLoad: CSV
    Table 6.  AIC, D, MSE and Cox Stuart test for the deviance residuals of IE-EBReg model based on biweight function.
    Model Cases Prior AIC D MSE Cox Stuart test
    p-value
    IE-BReg 1 G(2.6352,1.3613) 63.4138 32.7401 0.1870 0.0001
    2 64.4422 27.8167 0.1727 0.0001
    3 61.1159 28.6864 0.1727 0.0001
    4 58.9767 24.6503 0.1700 0.0001
    5 75.1276 15.6250 0.1617 0.0003
    IE-EBReg 1 G(2.6352,1.3574) 63.5012 32.8276 0.1869 0.0001
    2 64.5095 27.8839 0.1726 0.0001
    3 61.1903 28.7608 0.1725 0.0001
    4 59.0385 24.7120 0.1698 0.0001
    5 75.1385 15.6359 0.1612 0.0003
    IE-BReg 1 G(2.3537,1.4910) 52.99269 22.3190 0.2354 0.0161
    2 55.6927 19.0672 0.2090 0.0008
    3 51.8461 19.4166 0.2086 0.0008
    4 50.6692 16.3428 0.2015 0.0025
    5 70.3465 10.8439 0.1787 0.0079
    IE-EBReg 1 G(2.3537,1.4928) 52.9776 22.3039 0.2352 0.0161
    2 55.6846 19.0591 0.2089 0.0161
    3 51.8356 19.4061 0.2085 0.0066
    4 50.6634 16.3369 0.2014 0.0025
    5 70.3563 10.8536 0.1787 0.0079
    Case 1: using the original data; n=91.
    Case 2: using the data after removing one observation (i=1); n=90.
    Case 3: using the data after replaced one observation by the mean; n=91.
    Case 4: using the data after replaced the observation (i=1, 4, 5, 6, 7, 9) by the mean; n=91.
    Case 5: using the data after removing the observations (i=1, 4, 5, 6, 7, 9); n=85.

     | Show Table
    DownLoad: CSV
    Table 7.  The Relative errors % of predicted and fitting results of the IE-EBReg based on prior G(2.3537,1.4928).
    Date yi Case 1 Case 2 Case 3 Case 4 Case 5
    Fitting results 4/26/2020 1.223 6.1574 4.0055 4.3964 5.5198 9.0944
    4/27/2020 1289 13.6654 13.5397 13.6369 12.5813 7.6865
    4/28/2020 1266 24.5773 22.1173 22.6232 22.1945 18.4163
    4/29/2020 1325 10.1971 9.4581 9.5203 9.2789 7.7580
    4/30/2020 1351 14.0620 13.5364 13.7209 12.7740 8.0207
    5/1/2020 1344 3.2130 3.7411 3.7135 2.7615 1.3526
    5/2/2020 1362 6.5542 6.0133 6.1913 6.3622 6.9241
    5/3/2020 1552 1.2706 1.6513 1.5618 1.1365 0.5708
    5/4/2020 1645 20.6544 19.1781 19.3377 19.3725 18.7971
    5/5/2020 1595 23.7866 22.1910 22.2850 22.4925 22.7538
    Predicted results 6/22/2020 3.393 0.8852 8.1235 8.1629 8.3268 7.4896
    6/23/2020 3139 10.5945 16.4515 16.4037 16.2531 14.3989
    6/24/2020 3123 16.6312 23.2819 23.0829 23.2841 23.6959
    6/25/2020 3372 21.0604 8.9519 9.4537 8.5929 5.4076
    6/26/2020 3938 11.3371 2.3426 2.5350 1.4769 2.2651
    6/27/2020 3927 14.6806 4.6155 4.7346 3.3727 1.3583
    TIC statistics 0.0900 0.0594 0.0594 0.0586 0.0557

     | Show Table
    DownLoad: CSV
    Table 8.  IE-EBReg model based on LINEX, biweight functions (Case 4).
    Variables ˆβ Z-statistics Standard Error SE p-value AIC D MSE
    Intercept -27.6574 -3.5193 7.8588 0.0004 50.6634 16.3369 0.2014
    X1 0.5103 20.0674 0.0254 0.0000
    X3 0.4660 19.3073 0.0241 0.0000
    X4 0.6548 4.6898 0.1396 0.0000
    X5 26.5458 6.9168 3.8379 0.0000
    X6 32.7134 5.9260 5.5203 0.0000
    X7 14.2477 10.5226 1.3540 0.0000
    X9 24.8469 3.1705 7.8369 0.0015

     | Show Table
    DownLoad: CSV

    K-S test is used to recommend the IE distribution as a good fit for the daily confirmed COVID-19 cases in Saudi Arabia comparing with some other distributions as given below:

    Table .  K-S test of daily confirmed COVID-19 cases in Saudi Arabia using different distributions.
    DistributionGaussianExponentialGammaInverted ExponentialInverse Gaussian
    p-value0.00000.00000.1410.1690.016

     | Show Table
    DownLoad: CSV

    Based on the results obtained from the above table, the large p-value for the test indicated that IE distribution fit the response variable in the given data quite well. Figure 1 provides the Q-Q plot, ECDF, and fitted functions of the selected models, and it is clear that IE distribution fits these data well.

    Figure 1.  (a) Q-Q plots of the daily confirmed COVID-19 cases in Saudi Arabia based on IE (b) ECDF plot based on different distributions.

    Figure 2 presents box plots corresponding to each of the Saudi Arabia COVID-19 dataset variables, and the chart also maps outliers that exceeded the values of fences. The plot also displays the maximum, minimum, and median of the data, along with the first and third quantile. The outliers could also be identified as in Figure 2 (shown as unfilled circles) in explanatory variable X2, and we can observe a big difference between the maximal value and the rest of the observation, which is beyond the outer fence. This case could be considered a very extreme outlier. The chart also shows more outliers on variables X3, X5, and X8. For the X2 and X3 box plots, variables having outliers were much higher than the third quartile or much lower than the first quartile of the box was. Possible reasons for this deviation may be when daily active or critical cases exceeded the recovered cases, or daily recovered cases exceeded active or critical cases. There is no outlier in the daily-confirmed-cases variable in the context of box-plot analysis, and this variable was slightly asymmetrically distributed with a relatively heavy tail.

    Figure 2.  Box plots of the variables of Saudi Arabia COVID-19 dataset.

    Table 1 presents that the MSE value of the IE-Reg, gamma models was large, while the MSE values of the inverse Gaussian was very large. Table 2 shows that the MSE value of the IE-Reg model was large, while the MSE values of the IE-BReg and IE-EBReg models were small. In the case of IE-Reg model, ˆβ(s) was stabilized when Fisher's scoring procedure converged at s=15 because of |ˆβ(15)ˆβ(14)|<0. In the case of the IE-BReg and IE-EBReg models, the fitting results based on Huber's function had the largest MSE values compare to the MSE values based on biweight function. The results in this table also show that the MSE values based on biweight function and without use any weight function were almost similar, but fitting results based on biweight function were best in terms of AIC, and D statistics. Comparing between Bayesian fitting results, we observed that the results based on LINEX loss function were better than those of the zero-one loss function. Gamma prior variance was larger than the mean in the case of used the zero-one loss function, while gamma prior variance was less than the mean in the case of used the LINEX loss function.

    Table 2 also shows that the IE-EBReg model based on gamma prior G(2.610,1.420) and biweight function was the best on our dataset, since D and D/df (D/df=0.380),df=83 were acceptable with a low MSE compared to that of the other models. Tables 1 and 2 show that the D/df of this model was very close to 1, indicating that the fitting degree was very good. The value of z statistic was large, so there was also a significant relationship among the variables.

    Through this comparison, we can conclude that the predictive model was IE-EBReg based on biweight and LINEX functions, and it is given as follows:

    ˆY=e27.535+0.457x1+0.415x3+0.675x4+21.012x5+33.815x6+13.687x7+24.888x9. (4.1)

    In the side of the adopted IRLS procedure based on M-estimation, ˆβ(s) is stabilized and it converged at s=15 because of |ˆβ(15)ˆβ(14)|<4.2e06. The performance of the selected model was indicative that the adopted algorithm works well. The actual and predicted for the IE-EBReg model compared to IE-Reg are represented graphically in Figure 3. Figure 4 show that the deviance residuals against the indices of the observations suggest that the residuals for the IE-EBReg model are randomly scattered around zero. From Table 2 and 3, we can conclude that, for the selected model among possible IE-EBReg models based on the smallest AIC = 54.151 and lowest MSE = 0.230 with eight variables, this fitting result was the best when using level of significance α=0.05.

    Figure 3.  Fitting of IE-Reg and IE-EBReg with 7 variables as in Table 2.
    Figure 4.  Plot of deviance residuals for the IE-EBReg based on biweight and LINEX as in Table 2.

    Table 4 presents the fitting results for the IE-EBReg model based on biweight function that is the best based on relative errors. For the prediction results, Table 5 shows that the IE-EBReg model based on Huber's function compared to that based on biweight function had the worst prediction accuracy because the TIC value was closer to 1, while the TIC value for the IE-EBReg model based on biweight function was closer to 0.

    According to the residuals results of the IE-BReg and IE-EBReg models, it can be found that the errors of the first data are too large, which seriously affects of results, see Table 6 (Case 1). To make a precision comparison, we removed one observation form data i=1 that have large error and we reestimated the model after removing from the data (Case 2). Furthermore, we replaced y1 with the mean of observation from i=2 to i=11 and we reestimated the model after modifying the data (Case 3). We also replaced yi,i=1,4,5,6,7,9 with the mean of observation from i=2 to i=11 and we reestimated the model after modifying the data (Case 4). However, we removed the observations i=1,4,5,6,7,9 and we reestimated the model after removing observations (Case 5). The relative changes in the parameter estimates are presented in Table 6. The the deviance residuals against the indices of the observations suggest that the residuals are randomly scattered around zero at level of significance α=0.001 (Cox Stuart test) for the IE-BReg and IE-EBReg models based on the priors G(2.3537,1.410) and G(2.3537,1.493).

    Table 6 and 7 show that, the IE-EBReg model (Case 4) is the best for our data, and it is given as follows:

    ˆY=e27.657+0.510x1+0.466x3+0.655x4+26.546x5+32.713x6+14.248x7+24.847x9. (4.2)

    In the side of the adopted IRLS procedure based on M-estimation, ˆβ(s) is stabilized and it converged at s=15 because of |ˆβ(15)ˆβ(14)|<0. The actual and predicted for the IE-EBReg model compared to IE-Reg model is represented graphically in Figure 5. From Table 8, we can conclude that, this model has smallest AIC = 50.6634 and a low MSE = 0.2014, and there was also a significant relationship among the variables when using level of significance α=0.05. Figure 6 show that the deviance residuals against the indices of the observations suggest that the residuals for the IE-EBReg model (Case 4) are randomly scattered around zero. In comparison between IE-EBReg models that were shown in (4.1) and (4.2), and based on the results on Table 2 and 6, we can conclude that the IE-EBReg model in (4.2) is the best for our data.

    Figure 5.  Fitting of IE-Reg and IE-EBReg models with 7 variables as in Table 6.
    Figure 6.  Plot of deviance residuals for the IE-EBReg model (Case 4) as in Table 6.

    Figures 3 shows that the IE-EBReg model based on biweight and LINEX loss function generally fits the dataset better than other models. This also clear from Figure 4 as the plot of the deviance residuals against the indices of the observations suggest that the residuals are randomly scattered around zero at level of significant 0.001.

    Figure 5, shows that the IE-EBReg model (Case 4) which based on biweight and LINEX function fits the dataset better. Also, Figure 6 shows the deviance residuals against the indices of the observations suggest that the residuals are randomly scattered around zero at level of significant 0.001.

    In this paper the regression models IE-Reg, IE-BReg and IE-EBReg for modeling the daily confirmed Saudi COVID-19 cases with some envionmental-related variables (covariates) are considered. Zero-one and LINEX loss functions were used to attain the Bayesian and empirical Bayesian estimates based on the log-link function. In a non-Bayesian approach, parameter estimation is done by the Fisher scoring technique, and closed-form expressions are provided for the score function, and for Fisher's information matrix and its inverse.In the Bayesian approach, parameter estimation is performed using a gamma prior distribution, Jacobian transformation, and least-squares estimates. The IE-Reg, IE-BReg, and IE-EBReg models were compared to find which model predicted better. To deal with outlier problems, IE-BReg based on Huber's and biweight functions, and the adopted algorithm based on IRLS to find the estimates, were proposed. For distributional assessment, Q-Q, ECDF, box plots, and the KS test were applied. Some criteria, namely, AIC, BIC, D, D/df, and MSE, were also computed for all regression models.

    According to the results of the application, it was concluded that IE-BReg and IE-EBReg with a log link function performed the best in terms of the AIC, BIC, D, D/df, and MSE statistics, so they are recommended for these data. In contrast, IE-Reg showed poor results compared with those of the other models. Results indicated that the IE-EBReg model is highly capable of improving the performance of regression models to a greater extent in the prediction of daily confirmed COVID-19 cases in Saudi Arabia. Finally, it is found the following regressors are significant for the model: Explanatory variables are: X1, daily recovered COVID-19 cases; X3, daily active COVID-19 cases; X4, tests per million (PCR tests); X5, curfew hours per day; X6, maximal temperature in Celsius per day; X7, maximal relative humidity (%); and X9, maximal pressure in hectopascal (hPa).

    The authors would like to thank the editor and referees for their helpful comments, which improved the presentation of the paper. Also, the authors would like to extend their sincere appreciation to the Deanship of Scientific Research at King Saud University for its funding this Research Group (RG -1435-056).

    The authors declares that they have no conflicts of interest.

    The data is freely available at: https://covid15.moh.gov.sa/ https://twitter.com/SaudiMOH?s=20/ https://www.worldometers.info/coronavirus/ https://www.wunderground.com/.

    Suppose that, in yif(yi;γ), as in (2.3), the log-likelihood function based on yi,  i=1,2,...,n is given as in (2.5). Link function log(˜μi) connecting the ˜μi with linear model xiβ in this case is given as in (2.6). Score function Ur for log likelihood is written from one observation as

    Ur(β)=l(β)βr=ni=1li˜μi˜μiηiηiβr,  r=1,2,...,p. (5.1)

    From (2.5) and (2.6), we have

    Ur(β)=ni(1˜μilog(2)yi)˜μixir, (5.2)

    which can be written in matrix notation as

    U(ˆβ)=XtQ(y,˜μ(ˆβ)). (5.3)

    Taking the second derivatives of l(β), we have

    Ur(β)βj=2l(β)βjβr=ni(log(2)yi˜μiβj)xir,  j=1,2,...,p;

    hence,

    E(Ur(β)βj)=ni=1E(1yi)˜μiβjlog(2)xij.

    Since E(1yi)=1˜μilog(2) and ˜μiβj=xij˜μi, then Irj=E(Ur(β)βj)=nixijxir, and Fisher's information matrix is given as

    I(ˆβ)=XW(ˆβ)X, (5.4)

    where W(ˆβ) is the unit matrix. Fisher's scoring process can be applied to obtain

    I(ˆβ(s1))ˆβ(s)=I(ˆβ(s1))ˆβ(s1)+U(ˆβ(s1)),  s=1,2,3,....

    From (5.3) and (5.4), we have

    (XW(ˆβ(s1))X)ˆβ(s)=(XW(ˆβ(s1))X)ˆβ(s1)+XQ(y,˜μ(ˆβ(s1))), (5.5)

    and

    (XX)ˆβ(s)=X[Xˆβ(s1)+Q(y,˜μ(ˆβ(s1)))].

    Thus, the estimated coefficients ˆβ are given by

    ˆβ(s)=(XX)1X[Xˆβ(s1)+Q(y,˜μ(ˆβ(s1)))]=(XX)1XZ,

    as given in (2.7). To derive the MLS of β, IRLS algorithm is used. Under regularity conditions on the likelihood function, the MLE ˆβ(s) is asymptotically normal, unbiased, and efficient, with covariance matrix equal to the inverse of Fisher's information matrix (Houston and Woodruff [30]). Therefore, asymptotically,

    ˆβN[β,(XWX)1],

    where (XWX)1 is the inverse of Fisher's information matrix.

    Suppose that yif(yi;γ) is as in (2.3), γ=˜μlog(2); then, the density function of yi is given by

    f(yi;˜μi)=log(2)˜μiy2ielog(2)˜μiyi. (5.6)

    Consider a gamma prior for ˜μi, which can be written as in (3.6). Posterior distribution of ˜μi is given by

    π(˜μi|yi)=λαlog(2)˜μαiΓ(α)y2ie˜μi(λ+log(2)yi). (5.7)

    Using Jacobian transformation from ˜μi to ηi, and using the log link function that is given as in (2.6), we have

    π(ηi|yi)[g(ηi)]αeg(ηi)(λ+log(2)yi)g(ηi)ηi, (5.8)

    where g(ηi)=eηi=˜μi and g(ηi)ηi=eηi. Then,

    π(ηi|yi)e(1+α)ηieeηi(λ+log(2)yi). (5.9)

    Taking the derivative of the log posterior, we have

    log(π(ηi|yi))ηi(1+α)eˆηi(λ+log(2)yi)=0, (5.10)

    hence, the posterior mode of ηi is given as in (3.7). Thus, the estimated coefficients ˆβ, IE-BReg, and IE-EBReg models in this case, are given as in (3.5) and (3.8), respectively.

    In the case that prior distribution parameter λ is unknown, for this estimation task, ˆλ estimates are obtained via numerical maximization of the following marginal likelihood (Shao [37], Dikici et al. [58], and Coluccia et al. [59])

    f(y|˜μ)=ni=10f(yi;˜μi)π(˜μi|yi)dμi. (5.11)

    Thus, the IE-EBReg estimate is found by placing these estimated prior distribution parameter into Equation (3.7) by ˆλ.

    Suppose that yif(yi;γ) is as in (2.3), γ=˜μlog(2); then, the density function of yi is given as in (5.6). Consider a gamma prior G(α,λ) for ˜μi, which can be written as in (3.6). Posterior distribution of ˜μi is given as in (5.7). Using Jacobian transformation from ˜μi to ηi, zero-one loss function and using the log link function as in (2.6), we have

    (1+α)=eˆηi(λ+log(2)yi). (5.12)

    Because of 0<α<, thus,

    1<eˆηi(λ+log(2)yi)<, (5.13)

    hence, we get

    λ>1eˆηi(log(2)yi). (5.14)

    Since 0<yi< for every i=1,...,n, thus

    <1eˆηi(log(2)yi)<1eˆηi. (5.15)

    From (5.14) and (5.15), we find λ1eˆηi, but eˆηi1, because of equation (5.13) and the fact eˆηi<eˆηi(λ+log(2)yi). Now, by using 1eˆηi1 and λ1eˆηi, then we obtain λ1. Hence, 1λ1 and αλ2αλ, thus the variance of ˜μ is greater than or equal the mean.

    Suppose that yif(yi;γ) is as it is in (2.3), γ=˜μlog(2), and the density function of yi is given as in (2.3). Consider ˜μi has a gamma prior with density function, which can be written as in (3.6). Using the posterior distribution of ηi that is given in (5.9), we have

    E(eαηi)=eαηiπ(ηi|yi)dηi, (5.16)
    =λαlog(2)Γ(α)y2ieηieeηi(λ+log(2)yi)dηi, (5.17)

    and

    E(eαηi)=λαlog(2)Γ(α)y2i(λ+log(2)yi). (5.18)

    Using the LINEX loss function, we have

    λαlog(2)Γ(α)y2i(λ+log(2)yi)=eαˆηi, (5.19)

    Thus, the posterior Bayes estimates of ηi by using the LINEX loss function are given as in (3.9).

    In the case that prior distribution parameter λ is unknown for this estimation task, ˆλ estimates are obtained via numerical maximization of the marginal likelihood of Equation (5.11). As a result, the IE-EBReg estimates are found by placing these estimated prior distribution parameters into Equation (3.9) by ˆλ.

    Suppose that yif(yi;γ) is as in (2.3), γ=˜μlog(2); then, the density function of yi is given as in (5.6). Consider a gamma prior G(α,λ) for ˜μi, which can be written as in (3.6). Posterior distribution of ˜μi is given as in (5.7). Using Jacobian transformation from ˜μi to ηi, LINEX loss function and using the log link function as in (2.6), we have

    λαlog(2)Γ(α)y2i(λ+log(2)yi)=eαˆηi, (5.20)

    Because of 0<α<, thus 0<eαˆηi<1, and

    0<eαˆηiΓ(α)y2ilog(2)(λ+log(2)yi)<Γ(α)y2ilog(2)(λ+log(2)yi). (5.21)

    Using (5.20) and (5.21), we get

    λα<Γ(α)y2ilog(2)(λ+log(2)yi), (5.22)

    hence

    λα<(λ+log(2)yi), (5.23)

    and,

    λ(λα11)<log(2)yi. (5.24)

    Thus, λ<log(2)yi and (λα11)<log(2)yi. Hence, 1λ<1+log(2)yi. Since 0<yi< for every i=1,...,n, thus 1<1+log(2)yi<. Now, by using 1λ<1+log(2)yi and 1<1+log(2)yi, then we obtain 1λ1, λ1 and αλ2αλ. Thus, the variance of ˜μ is smaller than or equal the mean.

    [1] Mata TM, Martins AA, Caetano NS (2010) Microalgae for biodiesel production and other applications: a review. Renew Sust Energ Rev 14: 217–232. doi: 10.1016/j.rser.2009.07.020
    [2] Hidalgo P, Toro C, Navia R (2013) Advances in direct transesterification of microalgal biomass for biodiesel production. Rev Environ Sci Bio 12: 179–199. doi: 10.1007/s11157-013-9308-0
    [3] Griffiths M, Van Hille R, Harrison S (2010) Selection of direct transesterification as the preferred method for assay of fatty acid content of microalgae. Lipids 45: 1053–1060. doi: 10.1007/s11745-010-3468-2
    [4] Ehimen E, Sun Z, Carrington C (2010) Variables affecting the in situ transesterification of microalgae lipids. Fuel 89: 677–684. doi: 10.1016/j.fuel.2009.10.011
    [5] Wahlen BD, Willis RM, Seefeldt LC (2011) Biodiesel production by simultaneous extraction and conversion of total lipids from microalgae, cyanobacteria, and wild mixed-cultures. Bioresource Technol 102: 2724–2730. doi: 10.1016/j.biortech.2010.11.026
    [6] Sathish A, Smith BR, Sims RC (2014) Effect of moisture on in situ transesterification of microalgae for biodiesel production. J Chem Technol Biot 89: 137–142. doi: 10.1002/jctb.4125
    [7] Velasquez-Orta S, Lee J, Harvey A (2013) Evaluation of FAME production from wet marine and freshwater microalgae by in situ transesterification. Biochem Eng J 76: 83–89. doi: 10.1016/j.bej.2013.04.003
    [8] Heilmann SM, Davis HT, Jader LR, et al. (2010) Hydrothermal carbonization of microalgae. Biomass Bioenerg 34: 875–882. doi: 10.1016/j.biombioe.2010.01.032
    [9] Lu Y, Levine RB, Savage PE (2014) Fatty acids for nutraceuticals and biofuels from hydrothermal carbonization of microalgae. Ind Eng Chem Res 54: 4066–4071.
    [10] Du Z, Mohr M, Ma X, et al. (2012) Hydrothermal pretreatment of microalgae for production of pyrolytic bio-oil with a low nitrogen content. Bioresource Technol 120: 13–18. doi: 10.1016/j.biortech.2012.06.007
    [11] Levine RB, Pinnarat T, Savage PE (2010) Biodiesel production from wet algal biomass through in situ lipid hydrolysis and supercritical transesterification. Energ Fuel 24: 5235–5243. doi: 10.1021/ef1008314
    [12] Broch A, Jena U, Hoekman SK, et al. (2013) Analysis of solid and aqueous phase products from hydrothermal carbonization of whole and lipid-extracted algae. Energies 7: 62–79. doi: 10.3390/en7010062
    [13] Halim R, Gladman B, Danquah MK, et al. (2011) Oil extraction from microalgae for biodiesel production. Bioresource Technol 102: 178–185. doi: 10.1016/j.biortech.2010.06.136
    [14] Thenot JP, Horning E, Stafford M, et al. (1972) Fatty acid esterification with N, N-dimethylformamide dialkyl acetals for GC analysis. Anal Lett 5: 217–223. doi: 10.1080/00032717208069552
    [15] Greenspan P, Mayer EP, Fowler SD (1985) Nile red: a selective fluorescent stain for intracellular lipid droplets. J Cell Biol 100: 965–973. doi: 10.1083/jcb.100.3.965
    [16] Iwai M, Ikeda K, Shimojima M, et al. (2014) Enhancement of extraplastidic oil synthesis in Chlamydomonas reinhardtii using a type‐2 diacylglycerol acyltransferase with a phosphorus starvationxtraplible promoter. Plant Biotechnol J 12: 808–819. doi: 10.1111/pbi.12210
    [17] Velasquez-Orta S, Lee J, Harvey A (2012) Alkaline in situ transesterification of Chlorella vulgaris. Fuel 94: 544–550. doi: 10.1016/j.fuel.2011.11.045
    [18] Heilmann SM, Jader LR, Harned LA, et al. (2011) Hydrothermal carbonization of microalgae II. Fatty acid, char, and algal nutrient products. Appl Energ 88: 3286–3290.
    [19] Heilmann SM, Jader LR, Sadowsky MJ, et al. (2011) Hydrothermal carbonization of distiller's grains. Biomass Bioenerg 35: 2526–2533. doi: 10.1016/j.biombioe.2011.02.022
    [20] Valdez PJ, Nelson MC, Wang HY, et al. (2012) Hydrothermal liquefaction of nannochloropsis sp.: systematic study of process variables and analysis of the product fractions. Biomass Bioenerg 46: 317–331.
    [21] Poling BE, Thomson GH, Friend DG, et al. (2007) Physical and Chemical Data, In: Green DW, Perry RH, editors, Perry's Chemical Engineers' Handbook, 8th Edition, United States of America: McGraw-Hill Professional, 144–185.
    [22] Dupont C, Chiriac R, Gauthier G, et al. (2014) Heat capacity measurements of various biomass types and pyrolysis residues. Fuel 115: 644–651. doi: 10.1016/j.fuel.2013.07.086
    [23] Schneider N, Fortin TJ, Span R, et al. (2016) Thermophysical properties of the marine microalgae Nannochloropsis salina. Fuel Process Technol 152: 390–398. doi: 10.1016/j.fuproc.2016.06.039
    [24] Shuit SH, Lee KT, Kamaruddin AH, et al. (2010) Reactive extraction and in situ esterification of Jatropha curcas L. seeds for the production of biodiesel. Fuel 89: 527–530.
    [25] Du Z, Hu B, Shi A, et al. (2012) Cultivation of a microalga Chlorella vulgaris using recycled aqueous phase nutrients from hydrothermal carbonization process. Bioresource Technol 126: 354–357. doi: 10.1016/j.biortech.2012.09.062
  • This article has been cited by:

    1. Mohanad Alkhodari, Namareq Widatalla, Maisam Wahbah, Raghad Al Sakaji, Kiyoe Funamoto, Anita Krishnan, Yoshitaka Kimura, Ahsan H. Khandoker, Deep learning identifies cardiac coupling between mother and fetus during gestation, 2022, 9, 2297-055X, 10.3389/fcvm.2022.926965
    2. Ivan Izonin, Nataliya Shakhovska, Special issue: Informatics & data-driven medicine, 2021, 18, 1551-0018, 6430, 10.3934/mbe.2021319
    3. Prabhjot Kaur, Lillie Dewan, Comparative Assessment of BSS Techniques for Non-Invasive Extraction of Fetal ECG from Abdominal ECG Signal, 2023, 2250-2106, 10.1007/s40031-023-00874-3
    4. Karina Lezama-García, Julio Martínez-Burnes, Uri Baqueiro-Espinosa, Dina Villanueva-García, Adriana Olmos-Hernández, Ismael Hernández-Ávalos, Patricia Mora-Medina, Adriana Domínguez-Oliva, Daniel Mota-Rojas, Uterine dynamics, blood profiles, and electronic fetal monitoring of primiparous and multiparous bitches classified according to their weight, 2023, 10, 2297-1769, 10.3389/fvets.2023.1282389
    5. Vasileios-Rafail Xefteris, Monica Dominguez, Jens Grivolla, Athina Tsanousa, Francesco Zaffanela, Martina Monego, Spyridon Symeonidis, Sotiris Diplaris, Leo Wanner, Stefanos Vrochidis, Ioannis Kompatsiaris, A Multimodal Late Fusion Framework for Physiological Sensor and Audio-Signal-Based Stress Detection: An Experimental Study and Public Dataset, 2023, 12, 2079-9292, 4871, 10.3390/electronics12234871
    6. Anatol y Povoroznyuk, Oksana Povoroznyuk, Anna Filatova, 2024, Heterogeneous Ensemble Classifier in Medical Decision Support Systems, 979-8-3503-8449-9, 1, 10.1109/KhPIWeek61434.2024.10877993
  • Reader Comments
  • © 2017 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(5618) PDF downloads(1143) Cited by(3)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog