Loading [MathJax]/jax/output/SVG/jax.js
Research article Special Issues

Thermal-induced unfolding-refolding of a nucleocapsid COVN protein

  • Unfolding of a coarse-grained COVN protein from its native configuration shows a linear response with increasing temperature followed by non-monotonic double peaks in its radius of gyration. The protein conforms to a random coil of folded segments in native state with increasingly tenuous and globular structures in specific temperature regimes where the effective dimensions of corresponding structures D ≈ 1.6–2.4. Thermal agitation alone is not sufficient to fully eradicate its segmental folding as few local folds are found to persist around such residues as 65L, 110Y, 224L, 374P even at high temperatures.

    Citation: Warin Rangubpit, Pornthep Sompornpisut, R.B. Pandey. Thermal-induced unfolding-refolding of a nucleocapsid COVN protein[J]. AIMS Biophysics, 2021, 8(1): 103-110. doi: 10.3934/biophy.2021007

    Related Papers:

    [1] Panisak Boonamnaj, Pornthep Sompornpisut, Piyarat Nimmanpipug, R.B. Pandey . Thermal denaturation of a coronavirus envelope (CoVE) protein by a coarse-grained Monte Carlo simulation. AIMS Biophysics, 2022, 9(4): 330-340. doi: 10.3934/biophy.2022027
    [2] Warin Rangubpit, Sunan Kitjaruwankul, Panisak Boonamnaj, Pornthep Sompornpisut, R.B. Pandey . Globular bundles and entangled network of proteins (CorA) by a coarse-grained Monte Carlo simulation. AIMS Biophysics, 2019, 6(2): 68-82. doi: 10.3934/biophy.2019.2.68
    [3] Davide Sala, Andrea Giachetti, Antonio Rosato . Molecular dynamics simulations of metalloproteins: A folding study of rubredoxin from Pyrococcus furiosus. AIMS Biophysics, 2018, 5(1): 77-96. doi: 10.3934/biophy.2018.1.77
    [4] Stephanie H. DeLuca, Samuel L. DeLuca, Andrew Leaver-Fay, Jens Meiler . RosettaTMH: a method for membrane protein structure elucidation combining EPR distance restraints with assembly of transmembrane helices. AIMS Biophysics, 2016, 3(1): 1-26. doi: 10.3934/biophy.2016.1.1
    [5] Jany Dandurand, Angela Ostuni, Maria Francesca Armentano, Maria Antonietta Crudele, Vincenza Dolce, Federica Marra, Valérie Samouillan, Faustino Bisaccia . Calorimetry and FTIR reveal the ability of URG7 protein to modify the aggregation state of both cell lysate and amylogenic α-synuclein. AIMS Biophysics, 2020, 7(3): 189-203. doi: 10.3934/biophy.2020015
    [6] Timothy Jan Bergmann, Giorgia Brambilla Pisoni, Maurizio Molinari . Quality control mechanisms of protein biogenesis: proteostasis dies hard. AIMS Biophysics, 2016, 3(4): 456-478. doi: 10.3934/biophy.2016.4.456
    [7] Z. Hong Zhou, Joshua Chiou . Protein chainmail variants in dsDNA viruses. AIMS Biophysics, 2015, 2(2): 200-218. doi: 10.3934/biophy.2015.2.200
    [8] Kazushige Yokoyama, Christa D. Catalfamo, Minxuan Yuan . Reversible peptide oligomerization over nanoscale gold surfaces. AIMS Biophysics, 2015, 2(4): 649-665. doi: 10.3934/biophy.2015.4.649
    [9] Alessandro Didonna, Federico Benetti . Post-translational modifications in neurodegeneration. AIMS Biophysics, 2016, 3(1): 27-49. doi: 10.3934/biophy.2016.1.27
    [10] Salomón J. Alas-Guardado, Pedro Pablo González-Pérez, Hiram Isaac Beltrán . Contributions of topological polar-polar contacts to achieve better folding stability of 2D/3D HP lattice proteins: An in silico approach. AIMS Biophysics, 2021, 8(3): 291-306. doi: 10.3934/biophy.2021023
  • Unfolding of a coarse-grained COVN protein from its native configuration shows a linear response with increasing temperature followed by non-monotonic double peaks in its radius of gyration. The protein conforms to a random coil of folded segments in native state with increasingly tenuous and globular structures in specific temperature regimes where the effective dimensions of corresponding structures D ≈ 1.6–2.4. Thermal agitation alone is not sufficient to fully eradicate its segmental folding as few local folds are found to persist around such residues as 65L, 110Y, 224L, 374P even at high temperatures.


    CoVID-19 pandemic is attracting unprecedented attention [1][5] in investigating the corona virus and its constituents. Corona virus involves a number of proteins, RNA and a huge list of crowded inter- and intra-cellular constituents in its assembly and replication. In an initial investigation even with a coarse-grained computer simulation model it is not feasible to consider all constituents that are involved in its assembly and replication. We examine the structural dynamics of a nucleocapsid (COVN) protein [6] consisting of 422 residues which plays a critical role in packaging the viral genome RNA into ribonucleocapsid and virion assembly [7][9]. For the sake of simplicity and to develop a clear understanding of the basic nature of the conformational evolution, it would be interesting to examine the structural response of a free COVN as a function of temperature before systematically including different types of proteins, solute, solvent etc. of the underlying host space.

    ‘Protein folding’ [10],[11] remains an open problem despite enormous efforts for over half a century. Because of the enormity of challenges (e.g. time scale for huge degrees of freedom with all-atom approaches), coarse-graining [12][17] remains a viable choice to gain insight into the fundamental mechanism of conformational dynamics. Using a simplified yet efficient and effective coarse-grained model [18],[19], a large-scale Monte Carlo simulation is performed to study the thermal response of COVN. Our coarse-grained model has already been used to investigate structural dynamics of such proteins as histones critical in assembly of chromatin [20], lysozyme [21] and alpha-synuclein [22] key in amyloid, protein (VP40) in ebola virus [23], membrane proteins [18],[19] for selective transports, etc. COVN is represented by a chain of 422 residues in a specific sequence in a cubic lattice [18],[19]. Each residue interacts with surrounding residues within a range (rc) with a generalized Lennard-Jones potential,

    Uij=[|ϵij|(σrij)12+ϵij(σrij)6],rij<rc

    where rij is the distance between the residues at site i and j; rc=8 and σ = 1 in units of lattice constant. A knowledge-based [12][17] residue-residue contact matrix (based on a large ensemble of protein structures in PDB) is used as input for the potential strength eij [14] in phenomenological interaction (1). With the implementation of excluded volume and limits on the covalent bond length constraints, each residue performs its stochastic movement with the Metropolis algorithm, i.e. with the Boltzmann probability exp(ΔE/T) where ΔE is the change in energy between new and old position. Attempts to move each residue once defines unit Monte Carlo time step. All quantities are measured in arbitrary unit (i.e. spatial length in unit of lattice constant) including the temperature T which is in reduced units of the Boltzmann constant.

    Simulations are performed on a 5503 lattice for a sufficiently long time (107) steps with a number of independent samples (1001000) over a wide range of temperatures. Different sample sizes are also used to verify the reliability of the qualitative trends from our data presented here. A number of local and global physical quantities such as radius of gyration, root mean square displacement of the center of mass, structure factor, contact map, etc. are examined as a function of temperature. The conformation of the protein exhibits a monotonous response from a random-coil of folded (globular) segments in native phase to tenuous fibrous conformations on raising the temperature; it exhibits a non-monotonic response with a re-entrant conformation involving enhanced globularity before reaching a steady-state conformation on further heating. While most segmental folds disappear in denatured phase while some persist even at a very high temperature (see below).

    Before presenting our data, it is worth pointing out the justification of our model in context to investigation of proteins associated with the Corona virus which has only four structural proteins of which the envelope protein CoVE is the smallest with 76 residues. The primary and secondary structures of CoVE have shown to have three domains (see Figure 1 of Schoeman and Filelding [24] and references therein) with N- and C-terminals separated by the transmembrane segment. These domains are faithfully identified and reproduced from the contact profiles [25] generated by the coarse-grained model used here. COVN is a relatively large protein as pointed above. Chang et al. [7] have identified N- (residues 45–181) and C-terminal (residues 248–365) domains of COVN that can bind to nucleic acids i.e. RNA. Thermal modulation of the contact profiles of COVN generated by the same coarse-grained model exhibits the evolution in segmental assembly that may be consistent with the responsiveness of the two regions (see below). Although it would be difficult to guaranty the results of a model for a quantitative comparison with laboratory observations, it appears that our coarse-grained model does capture some of the basic features of the proteins we have investigated so far.

    Figure 1.  Variation of the average radius of gyration (Rg) with the temperature. Some snapshots (at the time step t = 107) are included at representative temperatures: (i) T=0.0100, (ii) T=0.0140, (iii) T=0.0150, (iv) T = 0.0200, (v) T = 0.0230 (first maximum), (vi) T= 0.0240 (minimum), (vii) T = 0.0268 (second maximum), (viii) T= 0.0320. Size of the self-organized segmental assembly represents the degree of globularization. In snapshots, gold spheres represent residues in contact, the large black sphere is the first residue 1M and large grey sphere is the last 422A (see Figure S1).

    Figure 1 shows the variation of the average radius of gyration (Rg) with the temperature. At low temperatures (T = 0.0100.015), the radius of gyration remains almost constant with its lowest magnitude (Rg ~ 22.5) in its native phase. Unlike many proteins (globular in native phase), COVN appears to be expanded into a random coil (see below) signature of an intrinsically disordered [7] protein. Raising the temperature (T = 0.0150.023) leads to a monotonic increase to its maximum Rg ~ 54.64 ± 2.60 at T = 0.0230. On further heating, the radius of gyration decreases sharply in a narrow range of temperature (T = 0.0230.025) to a minimum value (Rg ~ 38.17 ± 1.72) at T = 0.0246 before it begins to increase with the temperature (T = 0.02500.0268) again until it reaches a second maximum (Rg ~ 51.00 ± 2.24) at T = 0.0268. Beyond the second peak, the radius of gyration continues to decay slowly towards its saturation with the temperature in denatured phase (Rg ~ 41.4 ± 2.17 at T = 0.032, Rg ~ 38.23 ± 1.96 at T = 0.050). Note that this trend is clear despite a relatively large fluctuation in data. To our knowledge, we are not aware of such a non-linear thermal response of such proteins. We believe this is due to unique structure of COVN.

    Representative snapshots (Figure 1, see also Figure S1) of the protein at selected temperatures shows the variations in nature of the self-organizing structures over the range of temperature. For example, in native phase (T = 0.010, 0.014) we see local segmental folding with a chain of folded blobs in a random-coil-like conformation (see below) in contrast to a global folding one generally expects. Local folds begin to disappear at high temperatures but still persist in smaller sizes. Segmental folds appear to be distributed along the entire protein backbone at both maxima and at high temperatures in denature phase while the segmental folds at the minimum and in native phase are localized.

    Figure 2.  Structure factor S(q) versus wavelength (lambda (λ)) comparable to radius of gyration of COVN on a log-log scale at representative temperatures.

    How to quantify the distribution of residues over length scales? To assess the mass (distribution), we have analyzed the structure factor S(q) defined as,

    S(q)=1N|Nj=1eiqrj|2|q|

    where rj is the position of each residue and |q| = 2π/λ is the wave vector of wavelength λ. Using a power-law scaling S(q)q−1 λ, one may be able to evaluate the power-law exponent γ and estimate the spread of residues over the length scale λ. Overall size of the protein chain is described by its radius of gyration (Rg). Therefore, the structure factor over the length scale comparable to protein size (λ ~Rg) can provide an estimate of the effective dimension D of the protein conformation via scaling the number of residues (N) NλD where D = 1/γ. Variations of S(q) with the wavelength λ comparable to radius of gyration of the protein over the entire range of representative temperatures are presented in Figure 2.

    In the native phase (T = 0.0150) where the radius of gyration is minimum (Rg ~ 22.5), the effective dimension D ~ 2.053 of the protein shows that the overall spread is not globular. It is rather random-coil, a chain of segmental globules (see Figure 1). In unfolding-transition regime (T = 0.020), the effective dimension D ~ 1.726 decreases while retaining its partial folding towards C-terminal (see below). Continuous increasing the temperature leads to maximum unfolding (T = 0.0230) where the protein chain stretches to its maximum gyration radius (Rg ~ 55) with lowest effective dimension D ~ 1.579 with a couple of unfolded segments (see below). Further heating leads to contraction with a lower radius of gyration (Rg ~ 22.5, T = 0.0246) with a higher effective dimension D ~ 2.389, which indicates more compact conformation than that in its native phase, a thermal-induced folding. The effective dimension begin to reduce with increasing the temperature further as the protein conformation approaches a tenuous structure, i.e. D ~ 1.579 at T = 0.036.

    Figure 3.  Average number (Nr) of residues in contact along the backbone of COVN as a function of temperature. Top Figure shows the contacts at representative temperatures in a native phase (T = 0.015), at the first (maximum) peak of the radius of gyration (T = 0.0230), and in a highly denatured phase (T = 0.0320). These regions of marked in the center three dimensional Figure with the scale at the upper right corner. Right Figure shows the thermal response of the contact profile of specific centers of folding.

    Let us look closer into the local structures by examining the contact map in depth as presented in Figure 3 (see also Figure S2). First, we notice that the number of residues (Nr) within the range of interaction of each residues along the backbone, is higher at lower temperatures. However, the distribution of Nr is highly heterogenous and concentrated towards specific segments (65L, 110Y, 224L, 257K, 370K, 374K). The degree of folds appears to be significant at these globularization centers (in particular segment 367T-380A) even at higher temperatures although it is highest in native to denature transition region (see also Figure 1). In general, the modulation of the contact profiles shows the evolution in segmental assembly [Figure S2] that may be consistent with the responsiveness of N- and C- terminal domains [7]. Thermal response of contact profiles of each center of folding appears similar except 65L which exhibits a non-linear (somewhat oscillatory) response (see the right section of Figure 3). However, it is worth pointing out the the response of the contact profile of 65L resembles the thermal response of the radius of gyration. Despite the lowest magnitude of contacts (Nr) of 65L with respect to other globularization centers i.e. 224L, its unusual variations with the temperature (Figure 3) may induce global response in radius of gyration (Figure 1).

    Thus, the thermal response of COVN protein is non-linear with a random coil of folded blobs in native phase to a systematic unfolding, refolding, and unfolding as the protein denatures on increasing the temperature. The radius of gyration increases on raising the temperature, first monotonically from a minimum in its native state to a maximum value. Further heating leads to a sharp decline (the protein contracts) in a narrow temperature range followed by increase (protein expands) again to a second maximum with a local minimum in between. The radius of gyration at the local minimum is larger than that in its native state but the segmental globularization is localized towards the second half (C-terminal) while the first half (N-terminal) of the protein acquire a fibrous configuration. Continued heating causes COVN to approach a steady-state value with a small contraction rate.

    Scaling analysis of the structure factor is critical in quantifying the overall spread of COVN by evaluating its effective dimension D. In native phase, D ~ 2.053 (T = 0.0150, native phase), D ~ 1.716 (T = 0.0200, intermediate denature phase), D ~ 1.579 (T = 0.0230, first maximum), D ~ 2.389 (T = 0.0246, local minimum), D ~ 1.651 (T = 0.0268, second maximum), D ~ 1.726 (T = 0.0360, denatured). These estimated are consistent with the thermal response of the radius of gyration. Active zones of folded segments are identified from a detailed analysis of the contact map profile where the degree of folding can be quantified from the average contact measures. Segmental denaturing around residues such as 65W, 110Y, 224L, and 374P by technique other than thermal agitations may eradicate the specific functionality of COVN.


    Acknowledgments



    Support from the Chulalongkorn University Dusadi Phipat scholarship award to Warin Rangubpit has been instrumental for her visit to University of Southern Mississippi. We thank Brian Olson for helping with computer support. The authors acknowledge HPC at the University of Southern Mississippi supported by the National Science Foundation under the Major Research Instrumentation (MRI) program via Grant # ACI 1626217.

    Conflict of interest



    The authors declare no conflict of interest.

    Author contributions



    All authors participated in discussion and agreed to investigate the protein folding of COVN. RBP proposed to use coarse-grain model in consultation with PS and WR. RBP and WR generated data. WR analyzed data. PS participated in results and discussion of the data.

    [1] Bzówka M, Mitusinska K, Raczynska A, et al. (2020) Molecular dynamics simulations indicate the COVID-19 mpro is not a viable target for small-molecule inhibitors design. bioRxiv doi.org/10.1101/2020.02.27.968008.
    [2] J Alsaadi EA, Jones IM (2019) Membrane binding proteins of coronaviruses. Future Virol 14: 275-286. doi: 10.2217/fvl-2018-0144
    [3] Wan Y, Shang J, Graham R, et al. (2020) Receptor recognition by the novel coronavirus from Wuhan: an analysis based on decade-long structural studies of SARS coronavirus. J Virol 94: e00127.
    [4] Xu J, Zhao S, Teng T, et al. (2020) Systematic comparison of two animal-to-human transmitted human coronaviruses: SARS-CoV-2 and SARS-CoV. Viruses 12: 244. doi: 10.3390/v12020244
    [5] Ahmed SF, Quadeer AA, McKay MR (2020) Preliminary identification of potential vaccine targets for the COVID-19 coronavirus (SARS-CoV-2) based on SARS-CoV immunological studies. Viruses 12: 254. doi: 10.3390/v12030254
    [6]  UniProtKB-P59595(NCAP_SARS) (2020) .Available from: https://www.uniprot.org/uniprot/P59595.
    [7] Chang CK, Hsu YL, Chang YH, et al. (2009) Multiple nucleic acid binding sites and intrinsic disorder of severe acute respiratory syndrome coronavirus nucleocapsid protein: implications for ribonucleocapsid protein packaging. J Virol 83: 2255-2264. doi: 10.1128/JVI.02001-08
    [8] Fang HJ, Chen YZ, Li MS, et al. (2009) Thermostability of the N-terminal RNA-binding domain of the SARS-CoV nucleocapsid protein: Experiments and numerical simulations. Biophys J 96: 1892-1901. doi: 10.1016/j.bpj.2008.10.045
    [9] Chang C, Chen CMM, Chiang M, et al. (2013) Transient oligomerization of the SARS-CoV N protein–implication for virus ribonucleoprotein packaging. PloS One 8: e65045. doi: 10.1371/journal.pone.0065045
    [10] Sorokina I, Mushegian A (2018) Modeling protein folding in vivo. Biol Direct 13: 13. doi: 10.1186/s13062-018-0217-6
    [11] Gershenson A, Gosavi S, Faccioli P, et al. (2020) Successes and challenges in simulating the folding of large proteins. J Biol Chem 295: 15-33. doi: 10.1074/jbc.REV119.006794
    [12] Miyazawa S, Jernigan RL (1985) Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation. Macromolecules 18: 534-552. doi: 10.1021/ma00145a039
    [13] Miyazawa S, Jernigan RL (1996) Residue–residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. J Mol Biol 256: 623-644. doi: 10.1006/jmbi.1996.0114
    [14] Betancourt MR, Thirumalai D (1999) Pair potentials for protein folding: choice of reference states and sensitivity of predicted native states to variations in the interaction schemes. Protein Sci 8: 361-369. doi: 10.1110/ps.8.2.361
    [15] Tanaka S, Scheraga HA (1976) Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. Macromolecules 9: 945-950. doi: 10.1021/ma60054a013
    [16] Godzik A (1996) Knowledge-based potentials for protein folding: what can we learn from known protein structures? Structure 4: 363-366. doi: 10.1016/S0969-2126(96)00041-X
    [17] Huang SY, Zou X (2011) Statistical mechanics-based method to extract atomic distance-dependent potentials from protein structures. Proteins: Struct, Funct, Bioinf 79: 2648-2661. doi: 10.1002/prot.23086
    [18] Kitjaruwankul S, Khrutto C, Sompornpisut P, et al. (2016) Asymmetry in structural response of inner and outer transmembrane segments of CorA protein by a coarse-grain model. J Chem Phys 145: 135101. doi: 10.1063/1.4963807
    [19] Boonamnaj P, Paudel SS, Jetsadawisut W, et al. (2019) Thermal-response of a protein (hHv1) by a coarse-grained MC and all-atom MD computer simulations. Phys A 527: 121310. doi: 10.1016/j.physa.2019.121310
    [20] Fritsche M, Pandey RB, Farmer BL, et al. (2013) Variation in structure of a protein (H2AX) with knowledge-based interactions. PLoS One 8: e64507. doi: 10.1371/journal.pone.0064507
    [21] Pandey RB, Farmer BL, Gerstman BS (2015) Self-assembly dynamics for the transition of a globular aggregate to a fibril network of lysozyme proteins via a coarse-grained Monte Carlo simulation. AIP Adv 5: 092502. doi: 10.1063/1.4921074
    [22] Mirau P, Farmer BL, Pandey RB (2015) Structural variation of alpha-synuclein with temperature by a coarse-grained approach with knowledge-based interactions. AIP Adv 5: 092504. doi: 10.1063/1.4927544
    [23] Pokhrel R, Sompornpisut P, Chapagain P, et al. (2018) Domain rearrangement and denaturation in Ebola virus protein VP40. AIP Adv 8: 125129. doi: 10.1063/1.5063474
    [24] Schoeman D, Fielding BC (2019) Coronavirus envelope protein: current knowledge. Virol J 16: 69. doi: 10.1186/s12985-019-1182-0
    [25] Pandey RB Thermal denaturation of a protein (CoVE) by a coarse-grained Monte Carlo simulation (2020) .arXiv preprint arXiv:2009.00049, 2020.
  • biophy-08-01-007-s001.pdf
  • This article has been cited by:

    1. Panisak Boonamnaj, Pornthep Sompornpisut, R. B. Pandey, Thermal response of main protease of SARS and COVID-19 via a coarse-grained approach, 2022, 12, 2158-3226, 105027, 10.1063/5.0109357
  • Reader Comments
  • © 2021 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(2677) PDF downloads(174) Cited by(1)

Figures and Tables

Figures(3)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog