Loading [MathJax]/jax/output/SVG/jax.js
Research article

Analysis of a non-Markov transcription model with nuclear RNA export and RNA nuclear retention


  • Received: 07 April 2022 Revised: 23 May 2022 Accepted: 29 May 2022 Published: 09 June 2022
  • Transcription involves gene activation, nuclear RNA export (NRE) and RNA nuclear retention (RNR). All these processes are multistep and biochemical. A multistep reaction process can create memories between reaction events, leading to non-Markovian kinetics. This raises an unsolved issue: how does molecular memory affect stochastic transcription in the case that NRE and RNR are simultaneously considered? To address this issue, we analyze a non-Markov model, which considers multistep activation, multistep NRE and multistep RNR can interpret many experimental phenomena. In order to solve this model, we introduce an effective transition rate for each reaction. These effective transition rates, which explicitly decode the effect of molecular memory, can transform the original non-Markov issue into an equivalent Markov one. Based on this technique, we derive analytical results, showing that molecular memory can significantly affect the nuclear and cytoplasmic mRNA mean and noise. In addition to the results providing insights into the role of molecular memory in gene expression, our modeling and analysis provide a paradigm for studying more complex stochastic transcription processes.

    Citation: Qiqi Deng, Aimin Chen, Huahai Qiu, Tianshou Zhou. Analysis of a non-Markov transcription model with nuclear RNA export and RNA nuclear retention[J]. Mathematical Biosciences and Engineering, 2022, 19(8): 8426-8451. doi: 10.3934/mbe.2022392

    Related Papers:

    [1] Xuan Zhang, Huiqin Jin, Zhuoqin Yang, Jinzhi Lei . Effects of elongation delay in transcription dynamics. Mathematical Biosciences and Engineering, 2014, 11(6): 1431-1448. doi: 10.3934/mbe.2014.11.1431
    [2] S. Hossein Hosseini, Marc R. Roussel . Analytic delay distributions for a family of gene transcription models. Mathematical Biosciences and Engineering, 2024, 21(6): 6225-6262. doi: 10.3934/mbe.2024273
    [3] Qi Wang, Lifang Huang, Kunwen Wen, Jianshe Yu . The mean and noise of stochastic gene transcription with cell division. Mathematical Biosciences and Engineering, 2018, 15(5): 1255-1270. doi: 10.3934/mbe.2018058
    [4] Yue Hu, Xiaoqin Mei, Dong Tang . Long non-coding RNA XIST is down-regulated and correlated to better prognosis in ovarian cancer. Mathematical Biosciences and Engineering, 2020, 17(3): 2070-2081. doi: 10.3934/mbe.2020110
    [5] Gesham Magombedze, Winston Garira, Eddie Mwenje . Modelling the immunopathogenesis of HIV-1 infection and the effect of multidrug therapy: The role of fusion inhibitors in HAART. Mathematical Biosciences and Engineering, 2008, 5(3): 485-504. doi: 10.3934/mbe.2008.5.485
    [6] Jiajun Zhang, Tianshou Zhou . Stationary moments, distribution conjugation and phenotypic regions in stochastic gene transcription. Mathematical Biosciences and Engineering, 2019, 16(5): 6134-6166. doi: 10.3934/mbe.2019307
    [7] Sijie Lu, Juan Xie, Yang Li, Bin Yu, Qin Ma, Bingqiang Liu . Identification of lncRNAs-gene interactions in transcription regulation based on co-expression analysis of RNA-seq data. Mathematical Biosciences and Engineering, 2019, 16(6): 7112-7125. doi: 10.3934/mbe.2019357
    [8] Yanqin Wang, Xin Ni, Jie Yan, Ling Yang . Modeling transcriptional co-regulation of mammalian circadian clock. Mathematical Biosciences and Engineering, 2017, 14(5&6): 1447-1462. doi: 10.3934/mbe.2017075
    [9] Mingshuai Chen, Xin Zhang, Ying Ju, Qing Liu, Yijie Ding . iPseU-TWSVM: Identification of RNA pseudouridine sites based on TWSVM. Mathematical Biosciences and Engineering, 2022, 19(12): 13829-13850. doi: 10.3934/mbe.2022644
    [10] Zhenquan Zhang, Junhao Liang, Zihao Wang, Jiajun Zhang, Tianshou Zhou . Modeling stochastic gene expression: From Markov to non-Markov models. Mathematical Biosciences and Engineering, 2020, 17(5): 5304-5325. doi: 10.3934/mbe.2020287
  • Transcription involves gene activation, nuclear RNA export (NRE) and RNA nuclear retention (RNR). All these processes are multistep and biochemical. A multistep reaction process can create memories between reaction events, leading to non-Markovian kinetics. This raises an unsolved issue: how does molecular memory affect stochastic transcription in the case that NRE and RNR are simultaneously considered? To address this issue, we analyze a non-Markov model, which considers multistep activation, multistep NRE and multistep RNR can interpret many experimental phenomena. In order to solve this model, we introduce an effective transition rate for each reaction. These effective transition rates, which explicitly decode the effect of molecular memory, can transform the original non-Markov issue into an equivalent Markov one. Based on this technique, we derive analytical results, showing that molecular memory can significantly affect the nuclear and cytoplasmic mRNA mean and noise. In addition to the results providing insights into the role of molecular memory in gene expression, our modeling and analysis provide a paradigm for studying more complex stochastic transcription processes.



    Transcription is not only the most important but also the most complex step in gene expression. This double characteristic makes gene transcription receive lasting and extensive attention. With the development of measurement technologies (e.g., single-cell and single-molecule technologies), more molecular details of transcription have been experimentally uncovered. Nevertheless, some intermediate processes would not have been specified due to the complexity of gene transcription. Thus, traditional Markov models of gene transcription such as extensively studied ON-OFF models [1,2,3,4] would nor interpret experimental phenomena nor reveal the stochastic mechanisms of transcription. More biologically reasonable mathematical models need to be developed.

    It is well known that gene transcription involves RNA nuclear retention (RNR) and nuclear RNA export (NRE). However, these two important processes were often ignored in previous studies [1,2,3,4,5,6,7,8]. Main reasons include that 1) NRE was before considered to be a transient process, compared to other processes occurring in transcription. It was reported that the NRE phase lasted about 20min on average and were gene-specific [9,10]; 2) For RNR, less than 30% poly (A+) RNA is nuclear-retained and undetectable in the cytoplasm [11]. Currently, more and more experimental evidences have indicated that RNR play a key role in biological processes, e.g., in S. cerevisiae cells, RNR may play a precautionary role during stress situations [12]; in plants, the RNR process of NLP7 can orchestrate the early response to nitrate [13]; and in the signaling pathway of antiviral innate immunity, the RNA helicase DDX46 acts like a negative regulator to induce nuclear retention of antiviral innate transcripts [14]. These experimental facts suggest that RNR or NRE cannot be neglected when one makes theoretical predictions of gene expression (including expression levels and noise).

    Several works have revealed the respective roles of RNE and RNR in modulating stochastic gene expression [15,16,17,18,19]. One report exhibited that transcriptional burst attributed to promoter state switching could be substantially attenuated by the transport of mRNA from nucleus to cytoplasm [17]. Another report showed that slow pre-mRNA export from the nucleus could be an effective mechanism to attenuate protein variability arising from transcriptional burst [15]. In addition, RNR was also identified to buffer transcriptional burst in tissues and mammalian cells [16,18]. However, it has been experimentally confirmed that NRE and RNR can occur simultaneously in eukaryotes [20]. How these two dynamic processes cooperatively affect gene expression remains elusive and even unexplored.

    As a matter of fact, gene activation, NRE and RNR are multistep processes. In general, transcription begins only when the chromatin template accumulates over time until the promoter becomes active [21,22], where the accumulating process is a multistep biochemical process in which some intermediate steps would not have been specified due to experimental technologies. A representative example is that inactive phases of the promoter involving the prolactin gene in a mammalian cell are differently distributed, showing strong memory [23]. Similarly, both the export of mRNAs generated in the nucleus to the cytoplasm through nuclear pores and the retention of mRNAs among nuclear speckles or paraspeckles are in general also multistep reaction processes [24]. All these multistep processes can create memories between reaction events, leading to non-Markov dynamics. Traditional Markov models are no longer suitable to the modeling of gene transcription with molecular memory, and non-Markov models can well model multistep processes involved in gene transcription [7].

    In this paper, we introduce a non-Markov model of stochastic gene transcription. It considers not only RNR and NRE but also molecular memories created due to the multistep NRE process, due to RNR process, or due to the multistep activation process, thus including previous transcription models [1,2,3,4] as its special case. In order to solve this non-Markov model, we introduce effective transition rates, which explicitly decode the effect of molecular memory and by which we can transform the original non-Markov issue into an equivalent yet mathematical tractable Markov one. Based on this useful technique, we derive analytical results, which extend previous results [3,8,24,25] and provide insights into the role of molecular memory in affecting the nuclear and cytoplasmic mRNA mean and noise. The overall modeling and analysis provide a paradigm for studying more complex stochastic transcription processes.

    Most previous studies [15,20,26] of gene transcription focused on the dynamics of NRE processes, where mature mRNAs were released to the cytoplasm with the help of nuclear pore complex (NPC) (Figure 1) [16,27]. The number of NPCs or the count of those assistant proteins that controlled NPC determined the speed of exporting mRNA. Measuring the exporting rate was often replaced by measuring the retention time in the nucleus, which however may vary with environmental changes [16,17,18,19]. Other previous studies of gene transcription [9,10,11,28,29] focused on the dynamics of transcription initiation and elongation, where elongation time (T) was measured by the length of a gene using the length of bases (L) and the average rate of elongation (v), i.e., T=L/Lvv. These studies assumed that all mature mRNAs were first exported to the cytoplasm and then translated into proteins. However, biological experiments indicated that there were a considerable part of mature mRNAs that stayed in the nucleus in a probabilistic manner and lasted a long period (Figure 1) [24].

    Figure 1.  Schematic diagram for a model of stochastic gene transcription. First, chromation (consisting of nucleosomes) opens in a multistep manner, and then, DNA is transcribed into mRNAs also in a multistep manner. Some of these mRNAs are remained in the nucleus (forming so-called paraspeckles) in a multistep manner, and the others are exported into the cytoplasm through the nuclear pores also in a multistep manner.

    Here, we consider two cases: one case where NRE dominates over RNR and the other case where RNR dominates over NRE. For both cases, the gene is assumed to have one "off" state (corresponding to the inactive form of the promoter) and one "on" state (corresponding to the active form), and the promoter is assumed to switch randomly between these two states. Only in "on" state, can the gene generate pre-mRNA. After an alternative splicing (AS) process or an alternative polyadenylation (APA) process, which occurs frequently at the 3' UTR, a portion of mature mRNAs (one type of transcripts) may be transported to the cytoplasm through NPC wherein they execute translation tasks. The rest mature mRNAs (another type of transcripts) may be remained in the nucleus for a long time, possibly assembling on the sub-cellular region (wherein they form nuclear speckles or paraspeckles [30,31,32]) with the assistance of proteins, some of which would have been unspecified. When the intracellular environment is changed, most of mature mRNAs will release to the cytoplasm in response to this change. In addition, most genes (especially in eukaryotic cells) are expressed in a bursty manner [1,2,3,4].

    As pointed out in the introduction, gene transcription, NRE and RNR are all multistep reaction processes. In order to model these processes, we introduce a non-exponential waiting-time distribution for each intermediate reaction as done in ref. [7,33]. Since non-exponential waiting times lead to non-Markovian dynamics, the existing Markov theory cannot be used directly.

    Assume that burst size in gene transcription follows a distribution described by prob{B=i}=αi, where each αi is a nonnegative constant and i=0,1,2,. Let M1, M2 and M3 represent pre-mRNA, mature mRNA (one type of transcripts) transported to the cytoplasm and mature mRNA (another type of transcripts) retained in the nucleus respectively, and denote by m1, m2 and m3 their molecular numbers respectively. Thus, m=(m1,m2,m3)T represents the micro-state of the underlying system. Let W1(t;m), W2(t;m) and W3(t;m) be waiting-time distributions for the syntheses of pre-mRNAs, mature mRNA transported to the cytoplasm, and mature mRNA retained in the nucleus, respectively. Let W4(t;m) and W5(t;m) be waiting-time distributions for degradations of M2 and M3, respectively. To that end, the gene-expression model to be studied are described by the following five biochemical reactions labelled by Ri (1i5)

    R1:DNAW1(t;m)DNA+B×M1, R2:M1W2(t;m)M2,R3: M1W3(t;m)M3, R4:M2W4(t;m), R5:M3W5(t;m). (1)

    Let B represent the mean burst size. Note that if α1=1 and αk=0 for all k1, this impliesB1. In this case, the promoter is in the ON state all the time, and Eq (1) describes the constitutive gene expression. The other cases correspond to bursty gene expression. This is because B=0 implies that the promoter is in the OFF state (meaning that pre-mRNAs are not generated), whereas B>0 implies that the promoter is at the ON state (meaning that pre-mRNAs are generated).

    For each reaction, there is a memory function [7,33]. Denote by Mi(t;m) memory function for reaction Ri (1i5). These memory functions can be expressed by waiting-time distributions in Eq (1). In fact, if we let ˜Mi(s;m) be the Laplace transform of memory function Mi(t;m), then ˜Mi(s;m) can be expressed as ˜Mi(s;m)=s˜φi(s;m)/s˜φi(s;m)[15i=1˜φi(s;m)][15i=1˜φi(s;m)], where ˜φi(s;m) is the Laplace transform of function φi(t;m) that can be expressed as φi(t;m)=Wi(t;m)ki[1t0Wk(t;m)dt] (1i5) [7]. Let P(m;t) be the probability that the system is in state m at time t and ˜P(m;s) be the Laplace transform of P(m;t). With ˜Mi(s;m), we can show that the chemical master equation in the sense of Laplace transform takes the form

    s˜P(m;s)P(0;s)=(m1i=0αiEi1I)[˜M1(s;m)˜P(m;s)]+(E1E12I)[˜M2(s;m)˜P(m;s)] +(E1E13I)[˜M3(s;m)˜P(m;s)]+5j=4(EjI)[˜Mj(s;m)˜P(m;s)], (2)

    where E is the step operator and E1 is the inverse of E, and I is the unit operator.

    Interestingly, we find that limit lims0˜Mi(s;m) always exists, and if the limit function is denoted by Ki(m), then Ki(m) can be explicitly expressed by given waiting-time distributions Wk(t;m) (1k5), that is,

    Ki(m)= + 0Wi(t;m)[jitWj(t;m)dt]dt + 0[5j=1tWj(t;m)dt]dt,1i5. (3)

    Note that s0 corresponds to t+ according to the definition of Laplace transform. However, t+ corresponds to the steady-state case, which is our interest. We point out that function Ki(m), which will be called effective transition rate for reaction Ri, explicitly decodes the effect of molecular memory, where 1i5. More importantly, using these effective transition rates, we can construct a Markov reaction network with the same topology as the original non-Markov reaction network:

    DNAK1(m)DNA+B×M1, M1K2(m)M2, M1K3(m)M3, M2K4(m), M3K5(m). (4)

    Moreover, two reaction networks have exactly the same chemical master equation at steady state:

    (m1i=0αiEi1I)[K1(s;m)P(m)]+(E1E12I)[K2(m)P(m)] +(E1E13I)[K3(m)P(m)]+j{4,5}(EjI)[Kj(m)P(m)]=0, (5)

    implying that both stationary behaviors are exactly the same (referring to Figure 2), where P(m) is the stationary probability density function corresponding to dynamic density probability function P(m;t).

    Figure 2.  Schematic diagram for two reaction networks with the same topology and the same reactive species, where W-type functions represent reaction-event waiting-time distributions on the left-hand side whereas K-type functions are effective transition rates on the right-hand side (see the main text for details), D represents DNA, and the other symbols are explained in the context. The results in reference [7] imply that the stationary behaviors of two reaction networks are exactly the same although reaction events in the two cases take place in different manners.

    In summary, by introducing an effective transition rate (Ki(m)) for each reaction Ri, given by Eq (3), a mathematically difficult non-Markov issue is transformed into a mathematical tractable Markov issue. This brings us convenience for theoretical analysis. In the following, we will focus on analysis of Eq (5).

    Note that Gamma functions can well model multistep processes [34,35]. This is because the convolution of several exponential distributions is an Erlang distribution (a special case of Gamma distribution). Therefore, in order to model the effect of molecular memory on the mRNA expression, we assume that waiting-time distributions for gene activation, NRE and RNR processes are Gamma distributions: W1(t;m)=[Γ(L0)]1tL01(k0)L0ek0t, W2(t;m)=[Γ(Lc)]1tLc1(m1kc)Lcem1kct andW3(t;m)=[Γ(Lr)]1tLr1(m1kr)Lrem1krt, and that waiting-time distributions for the other processes are exponential ones, W4(t;m)=m2dcem2dct, and W5(t;m)=m3drem3drt. Here Γ() is the common Gamma function, k0 is the mean transcription rate, kc and kr denote the mean synthesis rates for mRNAs in the nucleus and mRNAs in the cytoplasm respectively, dc and dr are the mean degradation rates of mRNA in the nucleus and mRNA in the cytoplasm respectively. Throughout this paper, L0, Lc and Lr are called memory indices since, e.g., L0=1 corresponds to the memoryless case whereas L01 to the memory case.

    Let k1 be the total synthesis rate of mRNAs in the cytoplasm, which is composed of two parts: one part is the rate at which pre-mRNA generates a transcript through the AS process or the APA process, and the other is the rate at which the mature mRNA is finally exported to the cytoplasm with the help of NPC. The rate of generating mature mRNAs, determined by the gene length, is generally fixed. In contrast, the rate of exporting mRNAs to the cytoplasm may change in a large range, depending on cellular environments. This is because some types of mRNAs are exported in a fast manner due to RNA binding proteins or linked splicing factors, and other types of mRNAs are exported in a slow manner and the corresponding genes are most intron-containing ones [19]. Thus, we can use k1 to characterize the export rate indirectly. Similarly, if we let k2 be the synthesis rate of mRNAs in the nucleus, then it also includes two parts: one part is the rate of pre-mRNA produced through an AS or APA process, and the other is the rate at which transcripts are transported to some sub-cellular regions (e.g., nuclear speckles or paraspeckles) under the assistance of some proteins. Here, we assume that k2 changes a little so that the involved processes are simplified. Owing to AS or APA processes, the lengths of mature mRNAs of the two kinds can be significantly different. Usually, the rate k1 is faster than the rate k2. The retention and export of transcripts are random, we introduce another parameter pr, called remaining probability throughout this paper, to characterize this randomness. Then, the practical export rate and the practical retention rate should be kc=k1(1pr) and kr=k2pr respectively, where pr(0,1).

    Based on the experimental data from Halpern's group [26] that measured the whole genome-wide catalogue of nuclear and cytoplasmic mRNA from MIN6 cells, we know that most genes (~70%) has more cytoplasmic transcripts than nuclear transcripts. Thus, we can get an approximate formula for remaining probability pr: pr=Nn/Nn(Nn+Nc)(Nn+Nc) where Nn is the number of transcripts in the nucleus and Nc the number of transcripts in the cytoplasm. By considering gene INS1 for which the value of ratio Nc/NcNnNn is the maximal (13.2±4.6), we can know that the value of pr is about 5%. In that paper, the authors also mentioned that about 30% of the genes in MIN6 cells have more transcripts in the nucleus than in the cytoplasm. By considering gene ChREBP for which the value of ratio cytoplasm/nucleus is about 0.05, we can know that the value of pr is about 95%. Therefore, the range of remaining probability (pr) in the whole genome is about 5~95%. It is reasonable that the 50% value of pr is set as a threshold. For convenience, we categorize models of eukaryotic gene expression into two classes: one class where the RNE process is dominant and the other class where the RNR process is dominant. For the former, pr<0.5 holds, implying that most mRNAs are exported to the cytoplasm through nuclear pores, whereas for the latter, pr>0.5 holds, implying that most mRNAs retain in the nucleus.

    In the following analysis, memory indices Li (i=0,c,r), and remaining probability pr will be taken as key parameters while the other parameter values will be kept fixed. Without loss of generality, we assume that two degradation rates for mRNAs in the nucleus and cytoplasm are equal and denote by d the common degradation rate, i.e., dc=dr=d.

    First, if we let xi represent the concentration of reactive species Mi, i.e., xi=limmiΩmi/miΩΩ (i=1,2,3) where Ω represents the volume of the system, then the rate equations corresponding to the constructed-above Markov reaction network can be expressed as

    dxdt=SK(x), (6)

    where x=(x1,x2,x3)T is a column vector, S=(Sij)3×5=(B11000101000101) is a matrix, and K(x)=(K1(x),K2(x),K3(x),K4(x),K5(x))T is a column vector of effective transition rates. The stead states or equilibriums of the system described by Eq (6), denote by xS, are determined by solving the algebraic equation group, i.e., SK(xS)=0.

    Second, if denoting by X the mean of random variable X and taking approximation MixSi (i=1,2,3), then we can derive the following matrix equation (see Appendix A):

    ASΣS+ΣSATS+ΩDS=0, (7)

    where two square matrices AS=(Aij)3×3 and DS=(Dij)3×3 evaluated at steady state are known, and covariance matrix ΣS=(σij) with σij=(MiMi)(MjMj) is unknown. Note that diagonal elements σ22 and σ33 represent variances for random variables M2 (corresponding to mRNA in the cytoplasm) and M3 (corresponding to mRNA in the nucleus), which are our interest. In addition, we can also derive formulae similar to Eq (3) in the case of continuous variables.

    In order to show the explicit effect of molecular memory on the mRNA expression in different biological processes, we consider two special cases: 1) The process of gene activation with memory and other processes are memoryless. 2) The process of nuclear RNA export with memory and other processes are memoryless. In other cases, there are in general no analytical results.

    Case 1 L01, Lc=1 and Lr=1. In this case, the five effect transition rates become: K1(x)=(kcx1+krx1+dx2+dx3)(k0)L0(kcx1+krx1+dx2+dx3+k0)L0(k0)L0, K2(x)=kcx1, K3(x)=krx1, K4(x)=dx2, and K5(x)=dx3. Note that in the case of continuous variables, the corresponding effect transition rates Ki(x) (1i5) have the same expressions except for variable notations.

    We can show that the means of mRNAs in the nucleus and in the cytoplasm are given respectively by (see Appendix B)

    M2=˜k0˜kc,M3=˜k0˜kr, (8)

    where ˜k0=12[(1+2B)1/1L0L01]k0kc+kr>0 with B being the expectation of burst size B (random variable), ˜kc=kcd and ˜kr=krd. Apparently, Mi (i=2,3) is a monotonically decreasing function of memory index L0, implying that molecular memory always reduces the mRNA expression level in the nucleus and cytoplasm. In addition, by noting kc=k1(1pr) and kr=k2pr, we can know that if k2/k2k1k1 is fixed, then M3 (i.e., the mean of mRNAs in the nucleus) is a monotonically increasing function of remaining probability pr whereas M2 (i.e., the mean of mRNAs in the cytoplasm) is a monotonically decreasing function of pr. In addition, M3 is a monotonically decreasing function of ρ=kc/kckrkr whereas M2 is a monotonically increasing function of ρ. These results are in agreement with intuition.

    Interestingly, we find that σ22 and σ33, the variances for mRNAs in the nucleus and in the cytoplasm resepctively, have the following relationship (see Appendix B)

    σ22=(˜kr˜kc)2σ33+˜k0(˜kc+˜kr)˜kc˜kr, (9)

    indicating that the mRNA variance in the cytoplasm, σ22, is larger than that in the nucleus, σ33.

    From the viewpoint of experiments, the cytoplasmic mRNAs are easy to measure whereas the cytoplasmic mRNAs are difficult to measure. Therefore, we are interested in the cytoplasmic mRNA expression (including the level and noise). By complex calculations, we can further show that the cytoplasmic mRNA variance is given by

    σ22=˜k0˜kc˜k3r˜k3c+˜k3r{2+˜kc˜kr+122˜b+[γ(1+˜b)2](˜kc+˜kr)1+(1+˜b)(1+2˜b)(˜kc+˜kr)}. (10)

    where ˜b=14B[L0+2(L01)BL0(1+2B)(L01)/(L01)L0L0]>0 and γ=B2+BB with B2 being the second-order raw moment of burst size B. Furthermore, if we define the noise intensity as the ratio of the variance over the squared mean, then the noise intensity for the cytoplasmic mRNA, denoted by ηc, can be analytically expressed as

    ηc=1˜k0˜kc˜k3r˜k3c+˜k3r{2+˜kc˜kr+122˜b+[γ(1+˜b)2](˜kc+˜kr)1+(1+˜b)(1+2˜b)(˜kc+˜kr)}. (11)

    Note that if L0=1, which correspond to the Markov case, then ˜k0=k0Bkc+kr and ˜b=0. Thus, the cytoplasmic mRNA noise in the Markov case, denoted by ηc|L0=1, is given by

    ηc|L0=1=1˜k0˜kc˜k3r˜k3c+˜k3r(˜kc+2˜kr˜kr+γ2˜kc+˜kr1+˜kc+˜kr). (12)

    Therefore, the ratio of the noise in non-Markov (L01) case over that in the Markov (L0=1) case

    ηcηc|L0=1={˜kc+2˜kr˜kr+122˜b+[γ(1+˜b)2](˜kc+˜kr)1+(1+˜b)(1+2˜b)(˜kc+˜kr)}(˜kc+2˜kr˜kr+γ2˜kc+˜kr1+˜kc+˜kr)1, (13)

    which may be more than the unit but may also be less than the unit, depending on the size of remaining probability. However, if L0 is large enough (e.g., L0>2), then the ratio in Eq (13) is always larger than the unit, implying that molecular memory amplifies the cytoplasmic mRNA noise.

    Case 2 L0=1, Lc1 and Lr=1. In this case, five effect transition rates reduce to K1(x)=k0, K2(x)=(k0+krx1+dx2+dx3)(kcx1)Lc(k0+kcx1+krx1+dx2+dx3)Lc(kcx1)Lc, K3(x)=kcx1, K4(x)=dx2, and K5(x)=dx3. It seems to us there are no analytical results as in Case 1. However, if pr=0 (i.e., if we do not consider nuclear retention), then we can show that the steady state is given by x1=k0k1ω,x2=k0Bd,x3=0, where ω=(1+B)B1/1LcLc(1+2B)1/1LcLcB1/1LcLc is a factor depending on both transcriptional burst and molecular memory. Moreover, the mRNA noise in the cytoplasm is given by (see Appendix C for derivation)

    ηc=d(1+B)2k0B2dω(1+ω+B)+γLck1Bk1(1+2B)dω(1+ω+B)+LcB(1+2B)[dω+k1(1+B)]. (15)

    In order to see the contribution of molecular memory to the cytoplasmic mRNA noise, we calculate the ratio of the noise in non-Markov (Lc1) case over that in the Markov (Lc=1) case

    ηcηc|Lc=1=d+k12d+γk1(1+B)[2dω(1+ω+B)+γLck1Bk1(1+2B)]dω(1+ω+B)+LcB(1+2B)[dω+k1(1+B)], (16)

    which is in general larger than the unit for large Lc>1 (corresponding to strong memory), indicating that molecular memory in general enlarges the cytoplasmic mRNA noise.

    Here we numerically investigate the effect of molecular memory (Lc) from nuclear RNA export on the cytoplasm mRNA (M2) in the cases that the other model parameter values are fixed. Numerical results are demonstrated in Figure 3. From Figure 3(a), we observe that the mean level of the cytoplasm mRNA is a monotonically decreasing function of Lc, independent of the value choice of remaining probability (pr) (even though we only show the two cases of pr values). This is in agreement with our intuition since a more reaction step for mRNA synthesis inevitably leads to less mRNA molecules. On the other hand, we observe from Figure 3(b) that molecular memory reduces the cytoplasm mRNA noise (ηc) for smaller values of Lc but enlarges ηc for larger values of Lc, implying that there is an optimal Lc such that ηc arrives at the minimum. We emphasize that the dependences shown in Figure 3(a), (b) are qualitative since they are independent of the value choice of remaining probability.

    Figure 3.  Influence of molecular memory (Lc) from multistep RNA export on the cytoplasmic mRNA (M2), where solid lines represent theoretical results obtained by linear noise approximation (Appendix A), and empty circles represent numerical results obtained by a Gillespie algorithm [36]. Parameter values are set as <B>=2, k1=4, k2=0.8, k0=1, dc=dr=1. (a) Impact of molecular memory from the RNA export on the mean cytoplasmic mRNA (M2) for two values of remaining probability (pr), where the blue solid line corresponds to pr=0.2001 and the orange solid line to pr=0.3001. (b) Effect of molecular memory from the RNA export on the cytoplasmic mRNA noise (ηc) for two values of pr.

    Importantly, Figure 3 indicates that the results obtained by the linear noise approximation (solid lines) are well in agreement with the results obtained by the Gillespie algorithm [36]. Therefore, the linear noise approximation can be used in fast evaluation of the expression noise, and in the following, we will focus on results obtained by the linear noise approximation. In addition, we point out that most results obtained here and thereafter are qualitative since they are independent of the choice of parameter values. However, to demonstrate interesting phenomena clearly, we will choose special values of some model parameters.

    Here we focus on numerically investigating joint effects of molecular memory (Lc) and remaining probability (pr) on the cytoplasm mRNA (M2). Figure 4(a) demonstrates effects of pr on the M2 noise for three values of Lc. We observe that with the increase of remaining probability, the cytoplasmic mRNA noise (ηc) first decreases and then increases, implying that there is a critical pr such that ηc arrives at the minimum (referring to empty circles in Figure 4(a)) or that remaining probability can minimize the cytoplasmic mRNA noise. Moreover, this minimum is independent of the values of memory index Lc. In addition, we find that the minimal ηc first increases and then decreases with increasing Lc (the inset of Figure 4(a)). In other words, noise of cRNA can derive a optimal value with the decrease of remaining probability and the increase of memory index. Figure 4(b) shows the dependences of ηc on Lc for three different values of remaining probability. We find that molecular memory can also make cytoplasmic mRNA noise reach the minimum (referring to empty circles in Figure 4(b)), and this minimal noise is monotonically increasing function of pr.

    Figure 4.  Influence of remaining probability and molecular memory on mature mRNA transported to the cytoplasm (M2). Solid lines represent theoretical results obtained by our linear noise approximation (Appendix A). Empty circles represent the minimum of the noise of the cytoplasm mRNA. (a) Parameter values are set as <B>=40, k1=5, k2=0.8, k0=2.5, dc=dr=1. (b) Parameter values are set as <B>=2, k1=2, k2=0.8, k0=1, dc=dr=1.

    Here we focus on numerically analyzing joint effects of memory index L0 and remaining probability pr on the cytoplasm mRNA (M2). Figure 5(a) demonstrates effects of pr on the M2 noise for two representative values of L0 (note: L0=1 corresponds to the memoryless case whereas L0=2 corresponds to the memory case). We observe that with the increase of remaining probability, the cytoplasmic mRNA noise (ηc) first decreases and then increases, implying that there is a critical pr such that ηc reaches the minimum (referring to empty circles in Figure 5(a)) or that remaining probability can minimize the cytoplasmic mRNA noise. Moreover, this minimum (referring to empty circles) is a monotonically increasing function of memory index L0.

    Figure 5.  Influence of remaining probability for nuclear RNA retention and molecular memory from multistep gene activation on the cytoplasmic mRNA (M2), where lines represent the results obtained by linear noise approximation (Appendix A). Empty circles represent the minimum of the noise of the cytoplasm mRNA. (a) The dependence of cytoplasmic mRNA noise ηc on remaining probability pr for two values of memory index L0, where the inset is an enlarged diagram showing the dependence of the minimal ηc on L0. Parameter values are set as <B>=20, k1=10, k2=1, k0=2.5, dc=dr=1. (b) The dependence of cytoplasmic mRNA noise ηc on remaining probability pr for three values of remaining probability pr. Parameter values are set as <B>=2, k1=4, k2=1, k0=2.5, dc=dr=1.

    Figure 5(b) demonstrates that the cytoplasmic mRNA noise (ηc) is always a monotonically increasing function of memory index L0, independent of remaining probability. In addition, we observe that ηc is a monotonically increasing function of remaining probability (this can be seen by comparing three lines).

    Here we consider the case that RNR is a multistep process, i.e., Lr1. Numerical results are demonstrated in Figure 6. We observe from Figure 6(a) that except for the case of Lr=1, which corresponds to the Markov process and for which the cytoplasmic mRNA noise (ηc) is a monotonically increasing function of remaining probability (pr), the dependences of ηc on pr are not monotonic in the cases of Lr1 (corresponding to non-Markov processes) but there is a threshold of pr such that ηc reaches the minimum (referring to empty circles), similarly to the case of Figure 5(a). Moreover, this minimal noise is a monotonically decreasing function of memory index Lr (referring to the inset of Figure 6(a)) but the monotonicity is opposite to that in the case of Figure 5(a).

    Figure 6.  Influence of remaining probability and molecular memory on mature mRNA transported to the cytoplasm (M2), solid lines represent theoretical results obtained by linear noise approximation (Appendix A). Empty circles represent the minimum of the noise of the cytoplasm mRNA. (a) Parameter values are set as <B>=2, k1=2, k2=0.8, k0=2.5, dc=dr=1. (b) Parameter values are set as <B>=2, k1=2, k2=0.8, k0=1, dc=dr=1.

    Figure 6(b) shows how the cytoplasmic mRNA noise (ηc) depends on memory index Lr for two different values of remaining probability. Interestingly, we observe that there is an optimal value of Lr such that the cytoplasmic mRNA noise reaches the minimum. Moreover, the minimal ηc is a monotonically decreasing function of remaining probability (pr), referring to the inset in the bottom right-hand corner.

    Gene transcription in eukaryotes involve many molecular processes, some of which are well known and others are little known and even unknown [37,38]. In this paper, we have introduced a non-Markov model of stochastic transcription, which simultaneously considers RNA nuclear retention and nuclear RNA export processes and in which we have used non-exponential waiting-time distributions (e.g., Gamma distributions) to model some unknown or unspecified molecular processes involved in, e.g., the synthesis of pre-mRNA and the export of mRNAs generated in the nucleus to the cytoplasm and the retention of mRNA in the nucleus. Since non-exponential waiting times can lead to non-Markov kinetics, we have introduced effective transition rates for the reactions underlying transcription to transform a mathematically difficult issue to a mathematically tractable one. As a result, we have derived the analytical expressions of mRNA means and noise in the nucleus and cytoplasm, which revealed the importance of molecular memory in controlling or fine-tuning the expressions of two kinds of mRNA. Our modeling and analysis provided a heuristic framework for studying more complex gene transcription processes.

    Our model considered main events occurring in gene transcription such as bursty expression (burst size follows a general distribution), alternative splicing (by which two kinds of transcripts are generated), RNR (a part of RNA molecules that are kept in the nucleus) and RNE (another part of RNA molecules that are exported to the cytoplasm). Some popular experimental technologies such as single-cell sequence data [39], single-molecule fluorescence in-situ hybridization (FISH) [40] and electron micrographs (EM) of fixed cells [41] have indicated that RNR and NRE are two complex biochemical processes, each involving regulation by a large number of proteins or complexes [42]. In particular, the mRNAs exported to the cytoplasm involve the structure of nuclear pore complex (NPC) [43]. A number of challenging questions still remain unsolved, e.g., how do RNR and NRE cooperatively regulate the expressions of nuclear and cytoplasmic mRNAs? Why are these two dynamical processes necessary for the whole gene-expression processes when the cells survive in complex environments? And what advantages do they have in contrast to a single NRE process?

    Despite simple, our model can not only reproduce results for pre-mRNA (nascent mRNA) means at steady state in previous studies but also give results in agreement with experimental data on the mRNA Fano factors (define as the ratio of variance over mean) of some genes. However, we point out that some results of Fano factors obtained using our model is not always in agreement with the experimental data, e.g., for five genes, RBP3, TAF5, TAF6, TAF12 and KAP104, results obtained by our model seem not in agreement with experimental data but results obtained by a previous theoretical model [44] seems better (data are not shown). In addition, for the PRB8 gene, results on Fano factor, obtained by our model and the previous model, are poorly in agreement with experimental data (data are not shown). This indicates that constructing a theoretical model for the whole transcription process still needs more work.

    In spite of differences, our results are wholly in agreement with some experimental data or observations. First, the qualitative result that RNR always reduces the nuclear pre-mRNA noise and always amplifies the cytoplasmic mRNA noise is in agreement with some experimental observations [28,42,45] and also with intuition since the retention naturally increases the mean number of the nuclear pre-mRNAs but decreases the mean number of the cytoplasmic mRNAs. Second, we compare our theoretical predictions with experimental results [28,45]. Specifically, we use previously published experimental data for two yeast genes, RBP2 and MDN1 [28,45] to calculate the cytoplasmic mRNA Fano factors. Parameter k1 is set as k10.29±0.013/min, which is based on experimental data [28] and the degradation rates of the cytoplasmic mRNAs for RBP2 and MDN1 are set according to dc=ln2/ln2t1/2t1/2, where t1/2 is an experimental mRNA half-life. Then, we can find that the results on the Fano factors of genes RBP2 and MDN1 are well in agreement with the experimental data [45].

    At the whole genome scale, about 70% mRNAs in the nucleus are transported to the cytoplasm whereas about 30% mRNAs are retained in the nucleus [26]. This fact implies that the changing range of remaining probability is moderate or small. In addition, the nuclear export rate of a different gene is in general different. If this rate is not too large, then following the increase of remaining probability, the increase in the cytoplasmic mRNA noise is inevitable. This result indirectly interprets the reason why the noise at the protein level is quite large as shown in previous studies of gene expression [46].

    Finally, for some genes, the relative changing ranges of remaining probability and nuclear export rate may be large at the transcription level. In this case, it is in theory sufficient that adjusting one of nuclear export rate and remaining probability can fine-tune the cytoplasmic mRNA noise if the mean burst size is fixed, but differences would exist between theoretical and experimental results since NRE and RNR occur simultaneously in gene expression and are functionally cooperative. In addition, since biological regulation may be different from the theoretical assumption made here, the nuclear or cytoplasmic mRNA noise predicted in theory may be overestimated.

    This work was partially supported by National Nature Science Foundation of China (11931019), and Key-Area Research and Development Program of Guangzhou (202007030004).

    All authors declare no conflicts of interest in this paper.

    First, the chemical master equation for the constructed Markov reaction network reads

    P(m;t)t=(m1i=0αiEi1I)[K1(m)P(m;t)]+(E1E12I)[K2(m)P(m;t)] +(E1E13I)[K3(m)P(m;t)]+j{4,5}{(EjI)[Kj(m)P(m;t)]}, (A1)

    Second, the stead state or equilibrium of the system described by Eq (6) in the main text, denoted by xS=(xS1,xS2,xS3)T, can be obtained by solving the algebraic equation group

    BK1(xS)K2(xS)K3(xS)=0K2(xS)K4(xS)=0K3(xS)K5(xS)=0 (A2)

    Then, we perform the Ω-expansions [47] to derive a Lyapunov matrix equation for covariance matrix between Mi and Mj with i,j=1,2,3, i.e., for matrix Σ=((MiMi)T(MjMj))(σij). Note that

    Ki(n)=Ki(x+Ω1/122z)=Ki(x)+Ω1/122j=p,c,rzjKi(x)xj+o(Ω1),i=1,2,3 (A3a)
    m1i=0αiEi1I=Ω1/122m1i=0iαiz1+12Ω1m1i=0i2αi2z21+o(Ω3/322), (A3b)
    E1E12I=Ω1/122(z2z1)+12Ω1(2z21+2z2222z1z2)+o(Ω3/322), (A3c)
    E1E13I=Ω1/122(z3z1)+12Ω1(2z21+2z2322z1z3)+o(Ω3/322), (A3d)
    EjI=Ω1/122zj+12Ω12z2j+o(Ω3/322),j=2,3. (A3e)

    Hereafter o(y) represents the infinitestmal quantity of the same order as y0. We denote by Π(z;t) the probability density function for new random variable z. Then, the relationship between variables P(m;t) and Π(z;t) is

    P(m;t)t=Π(z;t)tΩ1/122i=1,2,3dxidtΠ(z;t)zi. (A4)

    By substituting Eqs (A3) and (A4) into Eq (A1) and comparing the coefficients of Ω1/122, we have

    i=1,2,3dxidtΠ(z;t)zi=BK1(x)Π(z;t)z1+K2(x)(Π(z;t)z2Π(z;t)z1) +K3(x)(Π(z;t)z3Π(z;t)z1)+j{2,3}KjΠ(z;t)zj, (A5)

    which naturally holds due to Eq (6) in the main text, where B=m1i=0iαi is the mean burst size. Comparing the coefficients of Ω0, we have

    Π(z;t)t=Bj=1,2,3K1(x)xj[zjΠ(z;t)]z1j=1,2,3K2(x)xj([zjΠ(z;t)]z2[zjΠ(z;t)]z1)j=1,2,3K3(x)xj([zjΠ(z;t)]z3[zjΠ(z;t)]z1)j{2,3}k=1,2,3Kj(x)xk[zkΠ(z;t)]zj+12B2K1(x)2Π(z;t)z21+12K2(x)(2Π(z;t)z21+2Π(z;t)z2222Π(z;t)z1z2)+12K3(x)(2Π(z;t)z21+2Π(z;t)z2322Π(z;t)z1z3)+12j{2,3}Kj2Π(z;t)z2j (A6)

    where B2=m1i=0i2αi is the second moment of burst size. Since Ki(x) is independent of z, Eq (A6) can be rewritten as

    Π(z;t)t=i,j{1,2,3}Aij[zjΠ(z;t)]zi+12i,j{1,2,3}Dij2Π(z;t)zizj, (A7)

    where the elements of matrix A=(Aij) take the form

    A=(Aij)=(BK1x1K2x1K3x1BK1x2K2x2K3x2BK1x3K2x3K3x3K2x1K4x1K2x2K4x2K2x3K4x3K3x1K5x1K3x2K5x2K3x3K5x3) (A8a)

    and matrix D=(Dij) takes the form

    D=(B2K1+K2+K3K2K3K2K2+K40K30K3+K5). (A8b)

    If we consider the stationary equation of Eq (A7), denote by AS and DS the corresponding matrices.

    Third, the steady-state Fokker Planck equation allows a solution of the following form

    Π(z)=1(2π)3det(ΣS)exp(12zTΣ1Sz). (A9)

    Here, matrix ΣS=((MxS)T(MxS))(σij) (covariance matrix) is determined by solving the following Lyapunov matrix equation

    ASΣS+ΣSATS+DS=0. (A10)

    Note that the diagonal elements of matrix ΣS are just the variances of the state variables, and the vector of the mean concentrations of the reactive species is given approximately by MxS. Eq (A10) is an extension of the linear noise approximation in the Markov case [48].

    In this case, we can show that effective transition rates are given by K1(x)=(x1kc+x1kr+x2d+x3d)(k0)L0(x1kc+x1kr+x2d+x3d+k0)L0(k0)L0, K2(x)=x1kc, K3(x)=x1kr, K4(x)=x2d, and K5(x)=x3d, where x=(x1,x2,x3)T. Thus, accoridng to Eq (A2), we know that the steady state is given by

    xs1=ak0kc+kr,xs2=kcdxs1,xs3=krdxs1, (B1)

    where a=(1+2B)1/1L0L012. Note that

    K1(x)x1=(kc+kr)(k0)L0[k0(1+2a)]L0(k0)L02ak0L0(kc+kr)(k0)L0[k0(1+2a)]L01{[k0(1+2a)]L0(k0)L0}2

    Therefore,

    K1(x)x1|x=xS=K1(xS)x1=(kc+kr)2BL0[(1+2B)(1+2B)(L01)/(L01)L0L0]4B2. (B2a)

    Completely similarly, we have

    K1(x)x2=K1(x)x3=d2BL0[(1+2B)(1+2B)(L01)/(L01)L0L0]4B2. (B2b)

    Thus, matrix AS reduces to

    A=(Aij)=((bB1)(kc+kr)bdBbdBkcd0kr0d), (B3)

    where b=14B2[L0+2(L01)BL0(1+2B)(L01)/(L01)L0L0]. Meanwhile, the matrix DS in Eq (A10) becomes

    DS=˜k0(γ(kc+kr)kckrkc2kc0kr02kr), (B4)

    where ˜k0=ak0kc+kr and γ=B2+BB. We can directly derive the following relationships from Eq (A10):

    (bB1)(kc+kr)σ11+bdB(σ12+σ13)=˜k02γ(kc+kr) (B5a)
    (bB1)(kc+kr)σ12+bdBσ22+bdBσ23+kcσ11dσ12=˜k0kc (B5b)
    (bB1)(kc+kr)σ13+bdBσ23+bdBσ33+krσ11dσ13=˜k0kr (B5c)

    and obtain the following relationships

    σ12=dkcσ22˜k0, σ13=dkrσ33˜k0 and σ23=12(krkcσ22+kckrσ33)kc+kr2d˜k0.

    Substituting these relationships into Eq (B4a)–Eq (B4c) yields

    (bB1)(kc+kr)σ11+bdB(dkcσ22+dkrσ33)=2bdB˜k0˜k02γ(kc+kr), (B6a)
    kcdσ11+(2bB1+3bB22krkcdkc)σ22+bB2kckrσ33=kcd˜k0+[3bB22kc+krd1]˜k0, (B6b)
    krdσ11+bB2krkcσ22+(2bB1+3bB22kckrdkr)σ33=krd˜k0+[3bB22kc+krd1]˜k0. (B6c)

    The combination of Eq (B6a), (B6b) gives

    σ22=(krkc)2σ33+kckrkc+krd˜k0, or σ33=(kckr)2(σ22kc+krdkckr˜k0). (B7a)

    The sum of Eq (B6b), (B6c) gives

    kc+krdσ11+[(2bB1)kc+krd1](dkcσ22+dkrσ33)=[3bB12kc+krd1]˜k0. (B7b)

    The combination of Eq (B7b) and (B6a) yields

    dkcσ22+dkrσ33=1bB+12[(bB1)2γ]kc+krd1+(bB1)(2bB1)kc+krd (B7c)

    By substituting this equation into Eq (B7a), we finally obtain

    σ22=˜k0˜kc˜k3r˜k3c+˜k3r{2+˜kc˜kr+122˜b+[γ(1+˜b)2](˜kc+˜kr)1+(1+˜b)(1+2˜b)(˜kc+˜kr)}, (B8a)

    and further

    σ33=˜k0˜kr˜k3c˜k3c+˜k3r{1˜k3c˜k3r˜k4c˜k4r+122˜b+[γ(1+˜b)2](˜kc+˜kr)1+(1+˜b)(1+2˜b)(˜kc+˜kr)}. (B8b)

    where ˜b=bB=14B[L0+2(L01)BL0(1+2B)(L01)/(L01)L0L0]>0 with b<0, ˜kc=kcd, ˜kr=krd, γ=B2+BB.

    Thus, the cytoplasmic mRNA noise is given by

    ηc=σ22M22=1˜k0˜kc˜k3r˜k3c+˜k3r{1+˜kc+˜kr˜kr+122˜b+[γ(1+˜b)2](˜kc+˜kr)1+(1+˜b)(1+2˜b)(˜kc+˜kr)}, (B9a)

    and the nuclear mRNA noise in the nucleus by

    ηr=σ33M32=1˜k0˜kr˜k3c˜k3c+˜k3r{1˜k3c˜k3r˜k4c˜k4r+122˜b+[γ(1+˜b)2](˜kc+˜kr)1+(1+˜b)(1+2˜b)(˜kc+˜kr)}. (B9b)

    In this case, we can show that five effect transition rates take the forms: K1(x)=k0, K2(x)=(k0+x1kr+dx2+dx3)(x1kc)Lc(k0+x1kc+x1kr+dx2+dx3)Lc(x1kc)Lc, K3(x)=x1kr, K4(x)=dx2, and K5(x)=dx3. In order to derive analytical results, we assume that remaining probability is so small that pr0, implying kr=0, kc=k1 and K3(x)=0. By solving the steady-state deterministic equation

    {BK1K2K3=0K2K4=0K3K5=0, (C1)

    we obtain the analytical expression of steady state (xS) given by

    xS1=k0(1+B)B1/1LcLck1[(1+2B)1/1LcLcB1/1LcLc], xS2=k0Bd and xS3=0. (C2)

    Note that the elements of Jacob matrix in the linear noise approximation reduce to

    a11=K2(xS)x1, a12=K2(xS)x2, a13=K2(xS)x3, a21=K2(xS)x1, a22=K2(xS)x2d, a23=K2(xS)x3, a31=0, a32=0, and a33=d. Defferentiating function K2(x) with regard to x1 yields

    K2(x)x1=Lc(k0+x2d)(k1)Lc(x1)Lc1(k0+x1k1+x2d)Lc(x1k1)LcLc(k0+x2d)(x1k1)Lc[k1(k0+x1k1+x2d)Lc1(k1)Lc(x1)Lc1][(k0+x1k1+x2d)Lc(x1k1)Lc]2.

    Therefore,

    K2(x)x1|x=xS=K2(xS)x1=Lck0Bx1[1+2B1+BB1+B(1+2BB)(Lc1)/(Lc1)LcLc]. (C3)

    Completely similarly, we have

    K2(x)x2=K2(x)x3=dB1+BdLcB21+Bk0k1x1(1+2BB)(Lc1)/(Lc1)LcLc. (C4)

    Furthermore, the Jacob matrix becomes

    As=(a11a12a12a11a12da1200d), (C5)

    where a11 = K2(xS)x1 and a12 = K2(xS)x2 are given by Eqs (C3) and (C4).

    Meanwhile, the matrix Ds in the linear noise approximation is given by

    Ds=(k0B2+k0BK20K22K20000). (C6)

    It follows from matrix equation ASΣS+ΣSATS+DS=0 that

    σ11=k0B2+k0B2a11a12a211K2+a12(a12+d)a211σ22, (C7a)
    σ22=k0a11(B2+B)+2dB2d(a12+da11). (C7b)

    Substituting the expressions of a11 and a12 into Eq (C7b) yields

    σ22=k0B2dLck1ω1+2B1+ω+B(B2+B)+2dd1+B+LcB(1+2B)1+ω+B(d1+B+k1ω), (C8)

    where ω=(1+B)B1/1LcLc(1+2B)1/1LcLcB1/1LcLc.



    [1] J. Peccoud, B. Ycart, Markovian modelling of gene product synthesis, Theor. Popul. Biol., 48 (1995), 222-234. https://doi.org/10.1006/tpbi.1995.1027 doi: 10.1006/tpbi.1995.1027
    [2] V. Shahrezaei, P. S. Swain, Analytical distributions for stochastic gene expression, Proc. Natl. Acad. Sci. U.S.A., 105 (2008), 17256-17261. https://doi.org/10.1073/pnas.0803850105 doi: 10.1073/pnas.0803850105
    [3] J. Paulsson, Models of stochastic gene expression, Phys. Life Rev., 2 (2005), 157-175. https://doi.org/10.1016/j.plrev.2005.03.003 doi: 10.1016/j.plrev.2005.03.003
    [4] T. B. Kepler, T. C. Elston, Stochasticity in transcriptional regulation: Origins, consequences, and mathematical representations, Biophys. J., 81 (2001), 3116-3036. https://doi.org/10.1016/S0006-3495(01)75949-8
    [5] N. Friedman, L. Cai, X. S. Xie, Linking stochastic dynamics to population-distribution: An analytical framework of gene expression, Phys. Rev. Lett., 97 (2006), 168302. https://doi.org/10.1103/PhysRevLett.97.168302 doi: 10.1103/PhysRevLett.97.168302
    [6] J. J. Zhang, T. S. Zhou, Promoter architecture-mediated transcriptional dynamics, Biophys. J., 106 (2014), 479-488. https://doi.org/10.1016/j.bpj.2013.12.011 doi: 10.1016/j.bpj.2013.12.011
    [7] J. J. Zhang, T. S. Zhou, Markovian approaches to modeling intracellular reaction processes with molecular memory, Proc. Natl. Acad. Sci. U.S.A., 116 (2019), 23542-23550. https://doi.org/10.1073/pnas.1913926116 doi: 10.1073/pnas.1913926116
    [8] T. Jia, R. V. Kulkarni, Intrinsic noise in stochastic models of gene expression with molecular memory and bursting, Phys. Rev. Lett., 106 (2011), 058102. https://doi.org/10.1103/PhysRevLett.106.058102 doi: 10.1103/PhysRevLett.106.058102
    [9] S. Choubey, Nascent RNA kinetics: Transient and steady state behavior of models of transcription, Phys. Rev. E., 97 (2018), 022402. https://doi.org/10.1103/PhysRevE.97.022402 doi: 10.1103/PhysRevE.97.022402
    [10] B. Zoller, S. C. Little, T. Gregor, Diverse spatial expression patterns emerge from unified kinetics of transcriptional bursting, Cell, 175 (2018), 835-847. https://doi.org/10.1016/j.cell.2018.09.056 doi: 10.1016/j.cell.2018.09.056
    [11] M. Z. Ali1, S. Choubey, Decoding the grammar of transcriptional regulation from RNA polymerase measurements: models and their applications, Phys. Biol., 16 (2019), 061001. https://doi.org/10.1088/1478-3975/ab45bf doi: 10.1088/1478-3975/ab45bf
    [12] T. B. Kallehauge, M. C. Robert, E. Bertrand, T. H. Jensen, Nuclear retention prevents premature cytoplasmic appearance of mRNA, Mol. Cell., 48 (2012), 145-152. https://doi.org/10.1016/j.molcel.2012.07.022 doi: 10.1016/j.molcel.2012.07.022
    [13] C. Marchive, F. Roudier, L. Castaings, V. Brehaut, E. Blondet, V. Colot, et al., Nuclear retention of the transcription factor NLP7 orchestrates the early response to nitrate in plants, Nat. Commun., 4 (2013), 1713. https://doi.org/10.1038/ncomms2650 doi: 10.1038/ncomms2650
    [14] Q. L. Zheng, J. Hou, Y. Zhou, Z. Y. Li, X. T. Cao, The RNA helicase DDX46 inhibits innate immunity by entrapping m6A-demethylated antiviral transcripts in the nucleus, Nat. Immunol., 18 (2017), 1094-1103. https://doi.org/10.1038/ni.3830 doi: 10.1038/ni.3830
    [15] A. Singh, P. Bokes, Consequences of mRNA transport on stochastic variability in protein levels, Biophys. J., 103 (2012), 1087-1096. https://doi.org/10.1016/j.bpj.2012.07.015 doi: 10.1016/j.bpj.2012.07.015
    [16] A. Mor, S. Suliman, R. Ben-Yishay, S. Yunger, Y. Brody, Y. Shav-Tal, Dynamics of single mRNP nucleocytoplasmic transport and export through the nuclear pore in living cells, Nat. Cell. Biol., 12 (2010), 543-552. https://doi.org/10.1038/ncb2056 doi: 10.1038/ncb2056
    [17] M. Oeffinger, D. Zenklusen, To the pore and through the pore: a story of mRNA export kinetics, Biochim. Biophys. Acta., 1819 (2012), 494-506. https://doi.org/10.1016/j.bbagrm.2012.02.011 doi: 10.1016/j.bbagrm.2012.02.011
    [18] V. O. Wickramasinghe, R. A. Laskey, Control of mammalian gene expression by selective mRNA export, Nat. Rev. Mol. Cell. Biol., 16 (2015), 431-442. https://doi.org/10.1038/nrm4010 doi: 10.1038/nrm4010
    [19] T. Chen, B. van Steensel, Comprehensive analysis of nucleocytoplasmic dynamics of mRNA in Drosophila cells, PLoS Genet., 13 (2017), e1006929. https://doi.org/10.1371/journal.pgen.1006929 doi: 10.1371/journal.pgen.1006929
    [20] N. Battich, T. Stoeger, L. Pelkmans, Control of transcript variability in single mammalian cells, Cell, 163 (2015), 1596-1610. https://doi.org/10.1016/j.cell.2015.11.018 doi: 10.1016/j.cell.2015.11.018
    [21] G. L. Hager, C. Elbi, T. A. Johnson, T. Voss, A. K. Nagaich, R. L. Schiltz et al., Chromatin dynamics and the evolution of alternate promoter states. Chromosome Res., 14 (2006), 107-116. https://doi.org/10.1007/s10577-006-1030-0
    [22] R. Metivier, G. Reid, F. Gannon, Transcription in four dimensions: nuclear receptor-directed Transcription in four dimensions, EMBO Rep., 7 (2006), 161-167. https://doi.org/10.1038/sj.embor.7400626 doi: 10.1038/sj.embor.7400626
    [23] C. V. Harper, B. Finkenstädt, M. R. White, Dynamic analysis of stochastic transcription cycles, PLoS Biol., 9 (2011), e1000607. https://doi.org/10.1371/journal.pbio.1000607 doi: 10.1371/journal.pbio.1000607
    [24] Q. L. Wang, T. S. Zhou, Dynamical analysis of mCAT2 gene models with CTN-RNA nuclear retention, Phys. Biol., 12 (2015), 016010. https://doi.org/10.1088/1478-3975/12/1/016010 doi: 10.1088/1478-3975/12/1/016010
    [25] H. H. Qiu, B. G. Zhang, T. S. Zhou, Analytical results for a generalized model of bursty gene expression with molecular memory, Phys. Rev. E, 100 (2019), 012128. https://doi.org/10.1103/PhysRevE.100.012128 doi: 10.1103/PhysRevE.100.012128
    [26] K. B. Halpern, I. Caspi, D. Lemze, M. Levy, S. Landen, E. Elinav, et al., Nuclear retention of mRNA in mammalian tissues, Cell. Rep., 13 (2015), 2653-2662. https://doi.org/10.1016/j.celrep.2015.11.036 doi: 10.1016/j.celrep.2015.11.036
    [27] F. Alber, S. Dokudovskaya, L. M. Veenhoff, W. H. Zhang, J. Kipper, D. Devos, et al., The molecular architecture of the nuclear pore complex, Nature, 450 (2007), 695-701. https://doi.org/10.1038/nature06405 doi: 10.1038/nature06405
    [28] S. Choubey, J. Kondev, A. Sanchez, Deciphering transcriptional dynamics in vivo by counting nascent RNA molecules, PLoS Comput. Biol., 11 (2015), e1004345. https://doi.org/10.1371/journal.pcbi.1004345 doi: 10.1371/journal.pcbi.1004345
    [29] H. Xu, S. O. Skinner, A. M. Sokac, I. Golding, Stochastic kinetics of Nascent RNA, Phys. Rev. Lett, 117 (2016), 128101. https://doi.org/10.1103/PhysRevLett.117.128101 doi: 10.1103/PhysRevLett.117.128101
    [30] D. L. Spector, Nuclear domains, J. Cell. Sci., 114 (2001), 2891-2893. https://doi.org/10.1242/jcs.114.16.2891 doi: 10.1242/jcs.114.16.2891
    [31] A. H. Fox, Y. W. Lam, A. K. L. Leung, C. E. Lyon, J. Andersen, M. Mann, et al., Paraspeckles: A novel nuclear domain, Curr. Biol., 12 (2002), 13-25. https://doi.org/10.1016/S0960-9822(01)00632-7 doi: 10.1016/S0960-9822(01)00632-7
    [32] A. I. Lamond, D. L. Spector, Nuclear speckles: A model for nuclear organelles, Nat. Rev. Mol. Cell. Biol., 4 (2003), 605-612. https://doi.org/10.1038/nrm1172 doi: 10.1038/nrm1172
    [33] T. Aquino, M. Dentz, Chemical continuous time random walks, Phys. Rev. Lett, 119 (2017), 230601. https://doi.org/10.1103/PhysRevLett.119.230601 doi: 10.1103/PhysRevLett.119.230601
    [34] B. C. Arnold, Majorization: Here, there and everywhere, Statist. Sci., 22 (2007), 407-413. https://doi.org/10.1214/0883423060000000097 doi: 10.1214/0883423060000000097
    [35] A. David, S. Larry, The least variable phase type distribution is Erlang, Stoch. Models, 3 (1987), 467-473. https://doi.org/10.1080/15326348708807067 doi: 10.1080/15326348708807067
    [36] D. T. Gillespie, A general method for numerically simulating the stochastic time evolution of coupled chemical reactions, J. Comput. Phys, 22 (1976), 403-434. https://doi.org/10.1016/0021-9991(76)90041-3 doi: 10.1016/0021-9991(76)90041-3
    [37] I. Brouwer, T. L. Lenstra, Visualizing transcription: Key to understanding gene expression dynamics, Curr. Opin. Chem. Biol., 51 (2019), 122-129. https://doi.org/10.1016/j.cbpa.2019.05.031 doi: 10.1016/j.cbpa.2019.05.031
    [38] E. M. Wissink, A. Vihervaara, N. D. Tippens, J. T. Lis, Nascent RNA analyses: Tracking transcription and its regulation, Nat. Rev. Genet., 20 (2019), 705-723. https://doi.org/10.1038/s41576-019-0159-6 doi: 10.1038/s41576-019-0159-6
    [39] D. Grün, L. Kester, A. van Oudenaarden, Validation of noise models for single-cell transcriptomics, Nat. Methods., 11 (2014), 637. https://doi.org/10.1038/nmeth.2930 doi: 10.1038/nmeth.2930
    [40] D. R. Larson, R. H. Singer, D. Zenklusen, A single molecule view of gene expression, Cell. Biol., 19 (2009), 630-637. https://doi.org/10.1016/j.tcb.2009.08.008 doi: 10.1016/j.tcb.2009.08.008
    [41] A. E. Hage, S. L. French, A. L. Beyer, D. Tollervey, Loss of Topoisomerase I leads to R-loop-mediated transcriptional blocks during ribosomal RNA synthesis, Genes. Dev., 24 (2010), 1546-1558. https://doi.org/10.1101/gad.573310 doi: 10.1101/gad.573310
    [42] D. Grunwald, R. H. Singer, M. Rout, Nuclear export dynamics of RNA-protein complexes, Nature, 475 (2011), 333-341. https://doi.org/10.1038/nature10318 doi: 10.1038/nature10318
    [43] R. Benyishay, A. J. Ashkenazy, Y. Shav-Tal, Dynamic encounters of genes and transcripts with the nuclear pore, Trends. Genet., 32 (2016), 419-431. https://doi.org/10.1016/j.tig.2016.04.003 doi: 10.1016/j.tig.2016.04.003
    [44] Q. L. Wang, T. S. Zhou, Alternative-splicing-mediated gene expression, Phys. Rev. E., 89 (2014), 012713. https://doi.org/10.1103/PhysRevE.89.012713 doi: 10.1103/PhysRevE.89.012713
    [45] S. J. Gandhi, D. Zenklusen, T. Lionnet, R. H. Singer, Transcription of functionally related constitutive genes is not coordinated, Nat. Struct. Mol. Biol., 18 (2011), 27-34. https://doi.org/10.1038/nsmb.1934 doi: 10.1038/nsmb.1934
    [46] T. Stoeger, N. Battich, L. Pelkmans, Passive noise filtering by cellular compartmentalization, Cell, 164 (2016), 1151-1161. https://doi.org/10.1016/j.cell.2016.02.005 doi: 10.1016/j.cell.2016.02.005
    [47] N. G. van Kampen, Stochastic processes in physics and chemistry, North-Holland, Amsterdam, 2007. https://doi.org/10.1016/B978-044452965-7/50006-4
    [48] J. Elf, M. Ehrenberg, Fast evaluation of fluctuations in biochemical networks with the linear noise approximation, Gen. Res, 13 (2003), 2475-2484. https://doi.org/10.1101/gr.1196503 doi: 10.1101/gr.1196503
  • This article has been cited by:

    1. Changhong Shi, Xiyan Yang, Jiajun Zhang, Tianshou Zhou, Stochastic modeling of the mRNA life process: A generalized master equation, 2023, 122, 00063495, 4023, 10.1016/j.bpj.2023.08.024
    2. Xiaojun Bai, Sizhe Wang, Xin Zhang, Haohua Wang, Molecular-Memory-Induced Counter-Intuitive Noise Attenuator in Protein Polymerization, 2024, 16, 2073-8994, 315, 10.3390/sym16030315
  • Reader Comments
  • © 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(2334) PDF downloads(102) Cited by(2)

Figures and Tables

Figures(6)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog