Research article Topical Sections

RosettaTMH: a method for membrane protein structure elucidation combining EPR distance restraints with assembly of transmembrane helices

  • Received: 18 October 2015 Accepted: 17 December 2015 Published: 21 December 2015
  • Membrane proteins make up approximately one third of all proteins, and they play key roles in a plethora of physiological processes. However, membrane proteins make up less than 2% of experimentally determined structures, despite significant advances in structure determination methods, such as X-ray crystallography, nuclear magnetic resonance spectroscopy, and cryo-electron microscopy. One potential alternative means of structure elucidation is to combine computational methods with experimental EPR data. In 2011, Hirst and others introduced RosettaEPR and demonstrated that this approach could be successfully applied to fold soluble proteins. Furthermore, few computational methods for de novo folding of integral membrane proteins have been presented. In this work, we present RosettaTMH, a novel algorithm for structure prediction of helical membrane proteins. A benchmark set of 34 proteins, in which the proteins ranged in size from 91 to 565 residues, was used to compare RosettaTMH to Rosetta’s two existing membrane protein folding protocols: the published RosettaMembrane folding protocol (“MembraneAbinitio”) and folding from an extended chain (“ExtendedChain”). When EPR distance restraints are used, RosettaTMH+EPR outperforms ExtendedChain+EPR for 11 proteins, including the largest six proteins tested. RosettaTMH+EPR is capable of achieving native-like folds for 30 of 34 proteins tested, including receptors and transporters. For example, the average RMSD100SSE relative to the crystal structure for rhodopsin was 6.1 ± 0.4 Å and 6.5 ± 0.6 Å for the 449-residue nitric oxide reductase subunit B, where the standard deviation reflects variance in RMSD100SSE values across ten different EPR distance restraint sets. The addition of RosettaTMH and RosettaTMH+EPR to the Rosetta family of de novo folding methods broadens the scope of helical membrane proteins that can be accurately modeled with this software suite.

    Citation: Stephanie H. DeLuca, Samuel L. DeLuca, Andrew Leaver-Fay, Jens Meiler. RosettaTMH: a method for membrane protein structure elucidation combining EPR distance restraints with assembly of transmembrane helices[J]. AIMS Biophysics, 2016, 3(1): 1-26. doi: 10.3934/biophy.2016.1.1

    Related Papers:

    [1] Elias Maciel, Inocencio Ortiz, Christian E. Schaerer . A Herglotz-based integrator for nonholonomic mechanical systems. Journal of Geometric Mechanics, 2023, 15(1): 287-318. doi: 10.3934/jgm.2023012
    [2] Xavier Rivas, Daniel Torres . Lagrangian–Hamiltonian formalism for cocontact systems. Journal of Geometric Mechanics, 2023, 15(1): 1-26. doi: 10.3934/jgm.2023001
    [3] Jordi Gaset, Arnau Mas . A variational derivation of the field equations of an action-dependent Einstein-Hilbert Lagrangian. Journal of Geometric Mechanics, 2023, 15(1): 357-374. doi: 10.3934/jgm.2023014
    [4] Alexander Pigazzini, Cenap Özel, Saeid Jafari, Richard Pincak, Andrew DeBenedictis . A family of special case sequential warped-product manifolds. Journal of Geometric Mechanics, 2023, 15(1): 116-127. doi: 10.3934/jgm.2023006
    [5] Marcin Zając . The dressing field method in gauge theories - geometric approach. Journal of Geometric Mechanics, 2023, 15(1): 128-146. doi: 10.3934/jgm.2023007
    [6] Robert I McLachlan, Christian Offen . Backward error analysis for conjugate symplectic methods. Journal of Geometric Mechanics, 2023, 15(1): 98-115. doi: 10.3934/jgm.2023005
    [7] Álvaro Rodríguez Abella, Melvin Leok . Discrete Dirac reduction of implicit Lagrangian systems with abelian symmetry groups. Journal of Geometric Mechanics, 2023, 15(1): 319-356. doi: 10.3934/jgm.2023013
    [8] Markus Schlarb . A multi-parameter family of metrics on stiefel manifolds and applications. Journal of Geometric Mechanics, 2023, 15(1): 147-187. doi: 10.3934/jgm.2023008
    [9] Valentin Duruisseaux, Melvin Leok . Time-adaptive Lagrangian variational integrators for accelerated optimization. Journal of Geometric Mechanics, 2023, 15(1): 224-255. doi: 10.3934/jgm.2023010
    [10] Francesco Bonechi, Jian Qiu, Marco Tarlini . Generalised Kähler structure on $ \mathbb{C}P^2 $ and elliptic functions. Journal of Geometric Mechanics, 2023, 15(1): 188-223. doi: 10.3934/jgm.2023009
  • Membrane proteins make up approximately one third of all proteins, and they play key roles in a plethora of physiological processes. However, membrane proteins make up less than 2% of experimentally determined structures, despite significant advances in structure determination methods, such as X-ray crystallography, nuclear magnetic resonance spectroscopy, and cryo-electron microscopy. One potential alternative means of structure elucidation is to combine computational methods with experimental EPR data. In 2011, Hirst and others introduced RosettaEPR and demonstrated that this approach could be successfully applied to fold soluble proteins. Furthermore, few computational methods for de novo folding of integral membrane proteins have been presented. In this work, we present RosettaTMH, a novel algorithm for structure prediction of helical membrane proteins. A benchmark set of 34 proteins, in which the proteins ranged in size from 91 to 565 residues, was used to compare RosettaTMH to Rosetta’s two existing membrane protein folding protocols: the published RosettaMembrane folding protocol (“MembraneAbinitio”) and folding from an extended chain (“ExtendedChain”). When EPR distance restraints are used, RosettaTMH+EPR outperforms ExtendedChain+EPR for 11 proteins, including the largest six proteins tested. RosettaTMH+EPR is capable of achieving native-like folds for 30 of 34 proteins tested, including receptors and transporters. For example, the average RMSD100SSE relative to the crystal structure for rhodopsin was 6.1 ± 0.4 Å and 6.5 ± 0.6 Å for the 449-residue nitric oxide reductase subunit B, where the standard deviation reflects variance in RMSD100SSE values across ten different EPR distance restraint sets. The addition of RosettaTMH and RosettaTMH+EPR to the Rosetta family of de novo folding methods broadens the scope of helical membrane proteins that can be accurately modeled with this software suite.


    An invariant volume is a powerful tool for understanding the asymptotic nature of a dynamical system. In particular, it is a well-known fact that unconstrained mechanical systems are volume-preserving. The case of nonholonomic systems is more nuanced as these systems generally fail to preserve the symplectic form (which follows from the fact that nonholonomic systems are not variational) and hence, the induced volume form. This makes the study of invariant volumes in nonholonomic systems nontrivial and the existence of an invariant volume is an exceptional scenario which should not be expected in general [1,2]. A famous example of this is the Chaplygin sleigh; this system, although energy-preserving, experiences volume "dissipation" which results in asymptotic stability (cf. [3] for a general discussion on stability of nonholonomic systems or [4] for an interpretation via impact systems).

    The existence of an invariant volume for a nonholonomic system offers two key insights. The first is the usual case in dynamical systems where an invariant volume allows for the use of the Birkhoff Ergodic Theorem (cf. e.g. 4.1.2 in [5]), for recurrence (with the caveat of the volume being finite although some results still hold for infinite measures [6]), and as an obstruction to asymptotic stability. The other is unique to nonholonomic systems; even though nonholonomic systems are not Hamiltonian, "nonholonomic systems which do preserve volume are in a quantifiable sense closer to Hamiltonian systems than their volume changing counterparts, " [7] (see also [8,9,10,11,12]). Therefore, being able to find an invariant volume for a nonholonomic system allows for understanding its asymptotic behavior and can provide a way to "Hamiltonize" a nonholonomic system (although most systems with an invariant volume still cannot be Hamiltonized, cf. [13]).

    There exists an abundance of research into finding invariant volumes for nonholonomic systems where the constraints are linear and symmetries are present: Chaplygin systems are studied in, e.g. [11,14,15,16,17,18,19], Euler-Poincaré-Suslov systems are studied in, e.g. [20,21], systems with internal degrees of freedom are studied in, e.g. [20,22,23], and [24] studies the case of symmetric kinetic systems where the dimension assumption does not hold. Related work on asympotic dynamics may be found in [25]. There are also results demonstrating that no invariant volumes exist, e.g. [26,27].

    The contribution of this work is to consider conditions for the existence of invariant volume in nonholonomic systems such that

    1. the constraints need not be linear/affine in the velocities,

    2. the analysis can be carried out on the ambient manifold rather than resorting to local coordinates, and

    3. absolutely no symmetry assumptions are used.

    Such an approach seems to be new.

    Our main result is an existence condition for an invariant measure for a hyper-regular Lagrangian system with an admissible set of nonlinear nonholonomic constraints obeying Chetaev's rule. In the linear case the existence of a basic invariant density reduces to checking whether or not a certain 1-form can be made to be exact, [28], and this special case is similar to results in [15,24].

    Preliminaries on nonholonomic systems are presented in Section 2. Section 3 presents the construction of the "extended nonholonomic vector fields" on the entire cotangent space $ T^*Q $ which restricts to the nonholonomic vector field on $ M $. Although there does not exist a canonical choice for the extended vector field, they will agree once restricted. An advantage of this approach is that one is able to design extended vector fields which ensure that the constraint manifold is an exponentially stable invariant manifold. The divergence calculation for a nonholonomic system is performed in Section 4 via extended nonholonomic vector fields. The main results, Theorem 5.2 and Theorem 5.6, are proved in Section 5 (cf. Theorems 5.2 and 5.6). Additionally, it is shown that the invariant volume does not depend on the choice of extended nonholonomic vector field chosen and is unique up to a constant of motion. Section 6 presents an interpretation of the linear constraint case to the torsion of the nonholonomic connection, which seems to be a novel observation. This paper concludes with examples in Section 7.

    This paper is a continuation of the work done in [29] and, as such, many of the results below can be found there.

    We will first briefly cover the case of unconstrained mechanical systems before discussing nonholonomic systems. A smooth (finite-dimensional, $ \mathrm{dim}(Q) = n $) manifold $ Q $ is called the configuration space, the tangent bundle $ TQ $ is called the state space, and the cotangent bundle $ T^*Q $ is called the phase space. The bundle projection maps will be denoted by $ \tau_Q:TQ\to Q $ and $ \pi_Q:T^*Q\to Q $.

    Lagrangian systems are described by a smooth real-valued function on the state space called the Lagrangian, $ L:TQ\to Q $. The dynamics generated from the Lagrangian function arise from Hamilton's principle and are given by the Euler-Lagrange equations,

    $ ddtL˙qLq=0.
    $

    In constrast to Lagrangian systems, Hamiltonian systems evolve on the phase space. For a given Lagrangian, denote $ \mathbb{F}L:TQ\to T^*Q $ as its fiber derivative.

    Definition 2.1. A Lagrangian, $ L:TQ\to \mathbb{R} $, is hyperregular if the fiber derivative is a global diffeomorphism. Furthermore, a Lagrangian is natural if it has the form

    $ L(v)=12g(v,v)τQV(v),
    $

    where $ g $ is a Riemannian metric on $ Q $ and $ V\in C^\infty(Q) $.

    Throughout this work, all Lagrangians will be assumed to be hyperregular. Moreover, when affine constraints are considered, the Lagrangian will be assumed to be natural.

    For a given Lagrangian, we define the Hamiltonian $ H:T^*Q\to\mathbb{R} $ via the Legendre transform:

    $ H(p)=p,vL(v),p=FL(v).
    $

    Let $ \omega = dq^i\wedge dp_i $ be the canonical symplectic form on $ T^*Q $. The resulting Hamiltonian vector field arising from the Hamiltonian $ H $ and the symplectic form $ \omega $ is denoted by $ X_H $ and is given by

    $ iXHω=dH,
    $

    where $ i_X\omega = \omega(X, \cdot) $ is the contraction. An important feature of Hamiltonian vector fields is that they preserve the symplectic form, and thus are volume-preserving.

    Theorem 2.2 (Liouville's Theorem). Hamiltonian dynamics preserve the symplectic form and, additionally, preserve the volume form $ \omega^n $.

    Liouville's theorem does not apply to general nonholonomic systems as they are generally not symplectic. Determining when an analogous version of this theorem applies to constrained systems is the main goal of this work.

    For unconstrained Lagrangian systems, motion is allowed to occur in the entire state space, $ TQ $. For constrained systems, motion is restricted to lie on a constraint manifold $ N\subset TQ $ and we tacitly assume that $ \tau_Q(N) = Q $. For the purposes of this work, we will not differentiate between nonholonomic and holonomic constraints as the machinery for the former will suffice for the latter.

    Let $ N\subset TQ $ be the constraint submanifold which is, locally, described via the zero level-set of a regular function $ G:TQ\to\mathbb{R}^k $, $ k < n = \dim(Q) $. Let us denote each component of this function by $ \Psi^\alpha $, $ \alpha = 1, \ldots, k $, i.e.

    $ N=kα=1(Ψα)1(0).
    $

    To transfer to the Hamiltonian point of view, we assume that the Lagrangian, $ L:TQ\to\mathbb{R} $, is hyper-regular. The constraints on the cotangent bundle become

    $ Φα=(FL1)Ψα,M=FL(N)=kα=1(Φα)1(0).
    $

    These constraints are admissible if the following matrix is non-singular [18].

    $ mαβ=CdΦβ(XΦα),(mαβ)=(mαβ)1,
    $

    where $ X_{\Phi^\alpha} $ is the Hamiltonian vector field generated by $ \Phi^\alpha $ and $ \mathcal{C}:T(T^*Q)\to T(T^*Q) $ is the $ \mathbb{F}L $-related almost-tangent structure, explained below.

    The constrained Euler-Lagrange equations become modified via Chetaev's rule (which is not necessarily the physically correct procedure, cf. [30] for a discussion), which will provide equivalent results to those in [31] where the "almost-tangent" structure of the tangent bundle is utilized. For Lagrangian systems, Chetaev's rule states that if we have the nonlinear constraints $ \Psi^\alpha $, then the constraint forces have the following form

    $ ddt(L˙q)Lq=λαSdΨα,
    $
    (2.1)

    where $ \mathcal{S}:T(TQ)\to T(TQ) $ is a (1, 1)-tensor called the almost-tangent structure and, in local coordinates, is given by

    $ S=˙qidqi.
    $

    If, rather than being general nonlinear, the constraints are affine in the velocities,

    $ Ψα(v)=ηα(v)+τQξα(v),
    $

    where $ \eta^\alpha\in\Omega^1(Q) $ are 1-forms and $ \xi^\alpha\in C^\infty(Q) $ are functions, then (2.1) becomes

    $ ddt(L˙q)Lq=λατQηα.
    $

    However, this work will focus on the Hamiltonian formalism. The constraint manifold, $ M\subset T^*Q $, is locally described by the joint level-set of the functions $ \Phi^\alpha:T^*Q\to\mathbb{R} $ where $ \Phi(p) = \Psi\circ\mathbb{F}L^{-1}(p) $. Moving (2.1) to the cotangent side, the constrained Hamiltonian equations of motion become

    $ iXMHω|M=dH|M+λαCdΦα|M,
    $
    (2.2)

    where $ \mathcal{C}:T(T^*Q)\to T(T^*Q) $ is the $ \mathbb{F}L $-related almost-tangent structure, i.e. if $ L $ is natural, then

    $ C(αidqi+βjdpj)=gijβjdqi.
    $

    Throughout this work, $ L:TQ\to\mathbb{R} $ will be a hyperregular Lagrangian with corresponding Hamiltonian $ H:T^*Q\to\mathbb{R} $. The constraint manifold will be called $ N\subset TQ $ or $ M = \mathbb{F}L(N)\subset T^*Q $. These submanifolds will be locally described by the joint zero level-set of a collection of smooth functions,

    $ N=kα=1(Ψα)1(0),M=kα=1(Φα)1(0).
    $

    If the constraints are affine in the velocities/momentum, the Lagrangian will be assumed to be natural with Riemannian metric $ g $. In this case, the constraining functions will have the form

    $ Ψα(v)=ηα(v)+τQξα(v),Φα(p)=P(Wα)(p)+πQξα(p),
    $

    where $ \eta^\alpha $ are 1-forms, $ \xi^\alpha $ are functions, $ W^\alpha = \mathbb{F}L^{-1}(\eta^\alpha) $ are vector fields, and $ P(W^\alpha) $ is the vector field's momentum

    $ P(Wα)(p)=p,Wα(πQ(p)).
    $

    In this case of affine constraints, $ \mathcal{D}\subset TQ $ will be the distribution

    $ Dq=kα=1kerηαq,
    $

    and $ \mathcal{D}^* = \mathbb{F}L(\mathcal{D})\subset T^*Q $.

    Finally, the nonholonomic vector fields with Hamiltonian $ H $ and constraint manifold $ M $ will be denoted by $ X_H^M $.

    Given a constraint manifold, $ M\subset T^*Q $, we can determine the nonholonomic vector field, $ X_H^M\in\mathfrak{X}(M) $ via (2.2). Commonly local, noncanonical, coordinates are chosen for $ M $ (cf. §5.8 in [20] and [32]). However, we will instead work with a tubular neighborhood (for ease, this will be written as the entire manifold $ T^*Q $) and define an extended vector field $ X_H^{ext}\in\mathfrak{X}(T^*Q) $ such that $ X_H^{ext}|_{M} = X_H^M $. This section outlines an intrinsic (albeit non-unique) way to determine such a vector field.

    Definition 3.1. For a given constraint submanifold $ M\subset T^*Q $, a realization of $ M $ is an ordered collection of functions $ \mathscr{C} \; : = \{\Phi^\alpha:T^*Q\to\mathbb{R}\} $ such that zero is a regular value of $ G = \Phi^1\times\ldots\times \Phi^k $ and

    $ M=α(Φα)1(0).
    $

    If the functions $ \Phi^\alpha $ are affine in momenta, i.e. $ \Phi^\alpha = P(X^\alpha)+\pi_Q^*f^\alpha $, then the realization is called affine.

    Remark 3.2. In the case where the Lagrangian is natural (which provides a Riemannian metric on $ Q $) and the constraint submanifold is affine, we can choose the realization to be affine:

    $ \mathscr{C} = \{P(W^1)+\pi_Q^*\xi^1,\ldots,P(W^k)+\pi_Q^*\xi^k\}, $

    where $ W^i = \mathbb{F}L^{-1}\eta^i = (\eta^i)^\sharp $.

    By replacing $ M $ with a realization $ \mathscr{C} $, we can extend the nonholonomic vector field to a vector field on $ T^*Q $ such that $ \Phi^\alpha $ are first integrals. Recall that the form of the nonholonomic vector field is $ i_{X_H^M}\omega = dH + \lambda_\alpha\mathcal{C}^*d\Phi^\alpha $. We construct the extended nonholonomic vector field, $ \Xi_H^{\mathscr{C}} $, by requiring that:

    (NH.1) $ i_{\Xi_H^\mathscr{C}}\omega = dH + \lambda_\alpha\mathcal{C}^*d\Phi^\alpha $ for smooth functions $ \lambda_\alpha:T^*Q\to\mathbb{R} $, and

    (NH.2) $ \mathcal{L}_{\Xi_H^\mathscr{C}}\Phi^\alpha = 0 $ for all $ \Phi^\alpha\in\mathscr{C} $.

    Under reasonable compatibility assumptions on $ \mathscr{C} $ (cf. §3.4.1 in [18], see Definition 3.4 below), such a vector field exists and is unique. However, given two different realizations, $ \mathscr{C} $ and $ \mathscr{C}' $, of the same constraint manifold $ M $, it is not generally true that $ \Xi_{H}^\mathscr{C} = \Xi_H^{\mathscr{C}'} $, however $ \Xi_{H}^\mathscr{C}|_{M} = \Xi_H^{\mathscr{C}'}|_{M} $.

    Remark 3.3. The constraint manifold is given by the joint zero level-sets of the $ \Phi^\alpha $ while the realization provides additional irrelevant information off of the constraint manifold. This is why $ \Xi_H^\mathscr{C}\ne \Xi_H^{\mathscr{C}'} $ but they agree once restricted as will be proved in Proposition 3.6 below.

    Definition 3.4. For a realization $ \mathscr{C} = \{\Phi^1, \ldots, \Phi^k\} $, the constraint mass matrix, $ (m^{\alpha\beta}) $, is given by "orthogonally" pairing the constraints,

    $ mαβ=CdΦα(XΦβ).
    $

    The realization is admissible if this matrix is non-singular on a tubular neighborhood of $ M $. The inverse matrix will be denoted by $ (m_{\alpha\beta}) = (m^{\alpha\beta})^{-1} $.

    When the constraints are affine (along with a natural Lagrangian), the constraint mass matrix becomes

    $ mαβ=g(Wα,Wβ)=ηα(Wβ),
    $

    and admissibility of the constraints is equivalent to the constraints being linearly independent.

    We can now write down a formula for $ \Xi_H^\mathscr{C} $. Using (NH.1) and (NH.2), we get that (where $ \{\cdot, \cdot\} $ is the standard Poisson bracket)

    $ LΞCHΦβ=iXΦβω(ΞCH)=iΞCHω(XΦβ)=dH(XΦβ)λαCdΦα(XΦβ)={Φβ,H}λαCdΦα(XΦβ)=0.
    $

    This implies that $ \left\{ \Phi^\beta, H \right\} = m^{\alpha\beta}\lambda_\alpha $. Due to the constraint mass matrix being nondegenerate, the multipliers are uniquely determined and the extended nonholonomic vector field is determined by

    $ iΞCHω=dHmαβ{H,Φα}CdΦβ
    $
    (3.1)

    Definition 3.5. The 1-form on $ T^*Q $ given by

    $ νCH:=dHmαβ{H,Φα}CdΦβ,
    $

    is called the nonholonomic 1-form with respect to the realization $ \mathscr{C} $.

    Proposition 3.6. Given two different realization, $ \mathscr{C} $ and $ \tilde{\mathscr{C}} $, the extended nonholonomic vector fields given by (3.1) agree on $ M $.

    Proof. Let $ \mathscr{C} = \{\Phi^\alpha\} $ and $ \tilde{\mathscr{C}} = \{\tilde{\Phi}^\alpha\} $ be two different realizations of the same manifold $ M $ and let $ x\in M $. Their differentials span the annihilator,

    $ span{dΦαx}=span{d˜Φαx}=Ann(TxM).
    $

    As the map $ \mathcal{C}:T(T^*Q)\to T(T^*Q) $ is linear, we have

    $ span{CdΦαx}=span{Cd˜Φαx}.
    $

    As the multipliers are unique, we must have

    $ mαβ{H,Φα}CdΦβ|M=˜mαβ{H,˜Φα}Cd˜Φβ|M,
    $

    and therefore the resulting extended nonholonomic vector fields must agree on $ M $.

    Remark 3.7. A consequence of Proposition 3.6 is that this procedure of extending the nonholonomic vector field is still valid when $ M $ is not globally defined as the level-set of functions. Suppose that $ U, V\subset M $ are two neighborhoods characterized by the zero level-sets of functions $ \Phi, \phi:T^*Q\to\mathbb{R} $. Then $ \Phi $ and $ \phi $ both define different realizations on $ U\cap V $. However, their corresponding vector fields agree on the intersection. This observation will make the constructions required for Theorem 5.2 well-defined even if $ M $ cannot be globally defined as a level set.

    The extended nonholonomic vector field, $ X_H^M $, is a vector field on the ambient space $ T^*Q $ and has $ M $ as an invariant manifold. As such, an integral curve of $ X_H^M $ is only a valid trajectory if its initial condition lies in $ M $. In practice using numerical techniques, round off errors can result in trajectories drifting away from $ M $. To counter this effect, the condition (NH.2) can be replaced to include something in the spirit of a "control Lyapunov function" [33]. This modification will only be explored in this section while the rest of this paper will be concerned with the unmodified approach explained above.

    Let the modified feedback nonholonomic criteria be:

    (NH.1) $ i_{\Xi_H^\mathscr{C}}\omega = dH + \lambda_\alpha \mathcal{C}^*d\Phi^\alpha $, for smooth functions $ \lambda_\alpha:T^*Q\to \mathbb{R} $, and

    (NH.2') $ \mathcal{L}_{\Xi_H^\mathscr{C}}\Phi^\alpha = -\kappa^\alpha \Phi^\alpha $ for some positive constants $ \kappa^\alpha $,

    where there is no summation in (NH.2').

    This approach has the advantage that if $ x_0\not\in M $, its trajectory exponentially approaches $ M $, i.e. suppose that $ \varphi_t $ is the flow of the extended feedback nonholonomic vector field, then

    $ Φα(φt(x0))=eκαΦα(x0)0.
    $

    Mimicking the derivation of (3.1), we arrive at

    $ iΞCHω=dHmαβ({H,Φα}καΦα)CdΦβ.
    $
    (3.2)

    Example 1 (Simple Pendulum). Consider the simple pendulum viewed as a constrained system in $ \mathbb{R}^2 $. The Hamiltonian and constraint are

    $ H=12m(p2x+p2y)+mgy,Φ=xpx+ypy=0,
    $

    where $ m $ is the mass of the pendulum and $ g $ is the acceleration due to gravity. The equations of motion from (3.2) are

    $ ˙x=1mpx,˙y=1mpy,˙px=xx2+y2[mgy1m(p2x+p2y)κ(xpx+ypy)],˙py=yx2+y2[mgy1m(p2x+p2y)κ(xpx+ypy)]mg.
    $
    (3.3)

    A valid initial condition, $ z_0 = (x(0), y(0), p_x(0), p_y(0)) $, for (3.3) needs to satisfy the constraint $ \Phi(z_0) = 0 $. However, due to running numerical errors, the constraint will not be preserved as the state is integrated. By choosing $ \kappa > 0 $, a correcting feedback is introduced to help preserve $ \Phi = 0 $ as shown in Figure 1.

    Figure 1.  The introduction of a positive $ \kappa $ helps to stabilize the numerical trajectory. Numerical integration was performed with Matlab's $\texttt{ode45}$.

    We end this section with a brief consideration of ideal constraints - constraints that perform no work on the system. As discussed in [31], a constraint $ \Psi $ on $ TQ $ is ideal if and only if

    $ ˙qiΨ˙qi|M=0.
    $

    On the cotangent side, this condition becomes

    $ CdΦ(XH)|M=0.
    $

    In particular, if all the constraints are ideal, then energy is conserved. Indeed,

    $ ˙H=mαβ{H,Φα}CdΦβ(XH).
    $

    Constraints will not be assumed to be ideal. Indeed, examples 7.1.2, 7.1.3, and 7.2.1 will all be subject to non-ideal constraints. In particular, these examples will be volume-preserving but not energy-preserving (although a modified energy integral exists for the first two examples [34]).

    For a given nonholonomic vector field $ X_H^M\in\mathfrak{X}(M) $, a volume-form $ \mu\in \Omega^{2n-k}(M) $ is invariant if and only if

    $ divμ(XMH)=0.
    $

    This section offers a way to choose a volume-form $ \mu $ based off a (non-canonical choice) of realization $ \mathscr{C} $ as well as showing how to compute the divergence of the nonholonomic vector fields with respect to this volume.

    The symplectic manifold $ T^*Q $ has a canonical volume form $ \omega^n $. However, the nonholonomic flow takes place on a submanifold $ M\subset T^*Q $ which is $ 2n-k $ dimensional. Therefore, $ \omega^n $ is not a volume form on $ M $. Here, we construct a volume form on $ M $ which is unique up to the choice of realization. The derivation of this will be similar to the construction of the volume form on an energy surface in §3.4 of [35]. For the realization $ \mathscr{C} = \{\Phi^1, \ldots, \Phi^k\} $, define the $ k $-form

    $ σC:=dΦ1dΦk.
    $

    Definition 4.1. If we denote the inclusion map by $ \iota:M\hookrightarrow T^*Q $, then a nonholonomic volume, $ \mu_\mathscr{C} $, is given by

    $ μC=ιε,σCε=ωn.
    $

    Proposition 4.2. Given an ordered collection of constraints, $ \mathscr{C} $, the induced volume form $ \mu_\mathscr{C} $ is unique.

    Proof. Suppose that $ \varepsilon $ and $ \varepsilon' $ are two forms satisfying $ \sigma_\mathscr{C}\wedge\varepsilon = \omega^n $. Then

    $ εε=α,σCα=0.
    $

    Now let $ \iota:M\hookrightarrow T^*Q $ be the inclusion. Then from the above, we see that

    $ ιε=ιε+ια.
    $

    The result will follow so long as $ \iota^*\alpha = 0 $. Suppose that $ \iota^*\alpha\ne0 $ and choose vectors $ v^1, \ldots, v^{2n-k}\in T_xM\subset T_xT^*Q $ such that $ \alpha(v^1, \ldots, v^{2n-k})\ne 0 $. Complete this collection of vectors to a basis of $ T_xT^*Q $: $ v^1, \ldots, v^{2n-k}, v^{2n-k+1}, \ldots, v^{2n} $ such that $ \sigma_{\mathscr{C}}(v^{2n-k+1}, \ldots, v^{2n})\ne 0 $. Then we have

    $ σCα(v1,,v2n)=(1)(2nk)kα(v1,,v2nk)σC(v2nk+1,,v2n)0,
    $

    which is a contradiction.

    Remark 4.3. Notice that for an ordered collection of constraints the volume form is unique. However, changing the order of the constraints changes the sign of the induced volume form and re-scaling constraints re-scales the volume form. In this sense, $ \mathscr{C} $ uniquely determines $ \mu_\mathscr{C} $, but $ M $ only determines $ \mu_\mathscr{C} $ up to a multiple. This is to be expected as all volume-forms are related by a multiplicative factor.

    While examining the failure of Liouville's theorem (Theorem 2.2) for nonholonomic systems, we will see when $ \mu_\mathscr{C} $ is preserved under the flow of $ X_H^M $. More generally, we will consider the existence of a smooth density $ f\in C^\infty(M) $ when $ f\mu_\mathscr{C} $ is preserved.

    Let $ \omega = dq^i\wedge dp_i $ be the standard symplectic form on $ T^*Q $. This in turn induces a volume form $ \omega^n $. A measure of how much a flow fails to preserve a volume form is described by its divergence. We proceed with computing the divergence of a nonholonomic vector field, $ \mathrm{div}_{\mu_{\mathscr{C}}}(X_H^M) $. When this is nonzero, we will be interested in finding a density, $ f $, such that $ \mathrm{div}_{f\mu_{\mathscr{C}}}(X_H^M) = 0 $. This problem will be addressed in §5.

    Before we begin with the divergence calculation, we first present a helpful lemma which allows us to relate the divergence of the extended nonholonomic vector field with the corresponding restricted vector field.

    Lemma 4.4. If $ \mathscr{C} $ is a realization of the constraint manifold $ M\subset T^*Q $, then

    $ divωn(ΞCH)|M=divμC(XMH).
    $

    Proof. Leibniz's rule for the Lie derivative provides

    $ LΞCHωn=LΞCH(σCε)=(LΞCHσC)ε+σC(LΞCHε).
    $

    However, $ \mathcal{L}_{\Xi_H^{\mathscr{C}}}\sigma_\mathscr{C} = 0 $ because the constraints are preserved under the flow. Applying this, we see that

    $ LΞCHωn=σC(LΞCHε),
    $

    which gives

    $ (divωn(ΞCH))σCε=σC(LΞCHε).
    $

    Due to the fact that the Lie derivative commutes with restriction, the result follows.

    This lemma allows for us to calculate the divergence of the extended nonholonomic vector field and to restrict to the constraint distribution afterwards.

    Using Cartan's magic formula along with Lemma 4.4, we see

    $ LΞCH(ωn)=iΞCHdωn+diΞCHωn=nd(iΞCHωωn1)=nd(iΞCHω)ωn1n(iΞCHω)dωn1=ndνCHωn1.
    $

    The divergence of the system is controlled by the failure of $ \nu_H^\mathscr{C} $ to be closed. Its derivative is given by

    $ dνCH=ddHd(mαβ{H,Φα}CdΦβ).
    $

    Let $ \phi_\alpha = m_{\alpha\beta}\Phi^\beta $. As $ \Phi^\beta = 0 $ on $ M $, the derivative becomes (where restriction to $ M $ is implied)

    $ dνCH=d{H,ϕβ}CdΦβ+{H,ϕβ}dCdΦβ.
    $

    The form $ d\nu_H^\mathscr{C} $ is a general 2-form. However, the only terms that survive once it is wedged with $ \omega^{n-1} $ are the diagonal terms, $ dq^i\wedge dp_i $. Using local coordinates, $ a_i^\beta dq^i = \mathcal{C}^*d\Phi^\beta $, we have

    $ (dνCH)diag=({H,ϕβ}piaβi{H,ϕβ}aβipi)dqidpi.
    $

    Therefore, the divergence is given by

    $ divμC(XMH)=n({H,ϕβ}piaβi{H,ϕβ}aβipi).
    $

    The first component of this expression can be written intrinsically. Notice that

    $ dπQXf=fpiqi,
    $

    where $ \pi_Q:T^*Q\to Q $ is the cotangent bundle projection. Therefore, the divergence can now be written as

    $ divμC(XMH)=n(CdΦβ([XH,Xϕβ])+{H,ϕβ}aβipi).
    $
    (4.1)

    Suppose the Lagrangian is natural with Riemannian metric $ g = (g_{ij}) $. Then

    $ CdΦβ=gijΦβpjdxi,aβi=gijΦβpj,aβipi=gij2Φβpipj.
    $

    Let us call this final double sum $ \mathcal{M}^\beta $. Applying this to (4.1), we have

    $ divμC(XMH)=nCdΦβ([XH,Xϕβ])n{H,ϕβ}Mβ.
    $

    Next, assume that (in addition to the Lagrangian being natural), the constraints are affine which makes $ \mathcal{M}^\beta = 0 $. Also the matrix, $ m^{\alpha\beta} $, does not depend on $ p $ and therefore, the divergence can be written as

    $ divμC(XMH)=nmαβπQηβ([XH,XΦα]),
    $
    (4.2)

    as $ \mathcal{C}^*d\Phi^\beta = \pi_Q^*\eta^\beta $ for affine constraints.

    For affine systems with natural Lagrangians, the divergence can be expressed intrinsically via (4.2). To understand the divergence in the general nonlinear case (with potentially non-natural Lagrangians), we need an intrinsic way to interpret the term $ \mathcal{M}^\beta $. This can be accomplished via the following procedure:

    The divergence for a nonholonomic system subject to nonlinear constraints is thus

    $ divμC(XMH)=dim(Q)(CdΦβ([XH,Xϕβ])+{H,ϕβ}ΔC(Φβ)),
    $
    (4.3)

    which is the key to proving Theorem 5.2 below.

    In general, the divergence of a nonholonomic system does not vanish as (4.1) shows. When does there exist a different volume form on $ M $ that is invariant under the flow? i.e. does there exist a density $ \rho > 0 $ such that $ \mathrm{div}_{\rho\mu_{\mathscr{C}}}(X_H^M) = 0 $? Finding such a $ \rho $ requires solving a certain type of partial differential equation which is known as the smooth dynamical cohomology equation. Solving this PDE is generally quite difficult. However, when the constraints are affine and the density is assumed to be basic, the problem becomes considerably more tractable

    What conditions need to be met for $ \rho $ such that $ \rho\mu_\mathscr{C} $ is an invariant volume form? Using the formula for the divergence as well as the fact that the Lie derivative is a derivation yields:

    $ divρμC(XMH)=divμC(XMH)+1ρLXMH(ρ).
    $

    Therefore the density, $ \rho $, yields an invariant measure if and only if

    $ 1ρLXMH(ρ)=divμC(XMH).
    $
    (5.1)

    Notice that the left hand side of (5.1) can be integrated to

    $ 1ρLXMH(ρ)=d(lnρ)(XMH).
    $

    Calling $ \lambda = \ln \rho $, we have the following lemma.

    Lemma 5.1. For a nonholonomic vector field, $ X_H^M $, there exists a smooth invariant volume, $ \rho\mu_\mathscr{C} $, if there exists an exact 1-form $ \alpha = d\lambda $ such that

    $ α(XMH)=divμC(XMH).
    $
    (5.2)

    Then the density is (up to a multiplicative constant) $ \rho = e^\lambda $.

    Therefore the existence of invariant volumes boils down to finding global solutions to the PDE (5.2). Using the divergence calculation, (4.3), we can state the main result.

    Theorem 5.2. The nonholonomic Hamiltonian vector field $ X_H^M $ possesses a smooth invariant volume, $ \rho\mu_{\mathscr{C}} $, if there exists an exact 1-form $ \alpha\in\Omega^1(T^*Q) $ such that

    $ α(XMH)=dim(Q)(CdΦβ([XH,Xϕβ])+{H,ϕβ}ΔC(Φβ)).
    $

    The remainder of this section deals with uniqueness of solutions and a necessary condition for solutions to exist.

    Remark 5.3. PDEs of the form $ dg(X) = f $ for a given smooth function $ f $, vector field $ X $ and with $ g $ as the unknown are called cohomology equations [36,37]. Thus the equation (5.2) is a cohomology equation.

    The problem of existence is quite difficult in general and we postpone that discussion until the next subsection where we assume that the solution has the form $ \rho = \pi_Q^*\tilde{\rho} $. In the meantime, assuming that there exists a function $ \lambda\in C^\infty(M) $ that solves (5.2), do there exist other solutions? Suppose that $ \lambda_1 $ and $ \lambda_2 $ both solve (5.2). Then their difference must be a first integral of the system: $ \mathcal{L}_{X_H^M}(\lambda_1-\lambda_2) = 0 $. Solutions of (5.2) are then unique up to constants of motion. i.e. if $ \lambda $ solves (5.2), then every invariant density has the form (again, up to a multiplicative constant)

    $ ρ=exp(λ+constantofmotion).
    $

    Therefore invariant measures can be thought of as an affine space with dimension being equal to the number of first integrals of the nonholonomic system.

    In general, solving the cohomology equation (5.2) is quite difficult. It turns out, however, that it is relatively easy to determine necessary and sufficient conditions on the solvability when the density is assumed to be basic and the constraints are affine.

    For the remainder of this section, unless otherwise stated, the constraints are assumed to be affine. In particular, if $ N\subset TQ $, then the constraints have the form

    $ Ψα(v)=ηα(v)+τQξα(v),
    $

    for 1-forms $ \eta^\alpha $ and function $ \xi^\alpha $. On the cotangent side, the constraints become

    $ Φα(p)=P(Wα)(p)+πQξα(p),
    $
    (5.3)

    where $ W^\alpha = \mathbb{F}L^{-1}\eta^\alpha $.

    Definition 5.4. A density $ \rho:T^*Q\to\mathbb{R} $ is said to be basic if $ \rho = \pi_Q^*\tilde{\rho} $ for some $ \tilde{\rho}:Q\to\mathbb{R} $.

    Under this assumption, (4.2) can be presented in a surprisingly nice way. In this case, the divergence can be described by an equivalence class of "affine-forms." The density form and density function, defined below, is a representative element from this class.

    Definition 5.5. Let $ \mathscr{C} $ be an affine realization of $ M\subset T^*Q $ of the form (5.3). Then, define the density form and the density function to be

    $ ϑC=mαβLWαηβ,ζC=mαβLWαξβ,
    $

    respectively.

    Studying this pair, $ (\vartheta_{\mathscr{C}}, \zeta_\mathscr{C}) $, provides necessary and sufficient conditions for the existence of basic densities.

    Theorem 5.6. For a natural Hamiltonian system subject to affine constraints, there exists an invariant volume of the form $ (\pi_Q^*\tilde{\rho})\mu_{\mathscr{C}} $ if and only if there exists functions $ \varphi_\gamma $ such that

    $ nϑCφγηγ=dln˜ρ,nζCφγξγ=0.
    $
    (5.4)

    Proof. To prove this, we will show that $ -n\cdot\pi_Q^*\vartheta(X_H^M) - n\cdot \pi_Q^*\zeta_\mathscr{C} = \mathrm{div}_\mathscr{C}(X_H^M) $. Recall that the differential of a 1-form is given by $ d\alpha(X, Y) = X\alpha(Y)-Y\alpha(X) - \alpha([X, Y]) $ and that $ \pi_Q^*\eta^\beta(X_H)|_M = -\xi^\beta $. Returning to (4.2), we have

    $ divμC(XMH)=nmαβπQηβ([XH,XΦα])=nmαβ(XHmαβ+XΦαξβπQdηα(XH,XΦα))=nmαβ(dmαβ(˙q)+dξβ(Wα)dηα(˙q,Wβ))=nmαβ[(diWβηα+iWβdηα)(˙q)+dξβ(Wα)]=nmαβ[LWβηα(˙q)+LWαξβ]=nϑC(˙q)nζC.
    $

    This computation shows that $ \mathrm{div}_{\mu_{\mathscr{C}}}(X_H^M) = -n\cdot\vartheta_{\mathscr{C}}(\dot{q})-n\cdot\zeta_\mathscr{C} $, but $ \dot{q} $ cannot be arbitrary as it must lie within $ M $ which states that $ \eta^\alpha(\dot{q}) + \xi^\alpha = 0 $. Adding multiples of the constraints to the divergence to produce an exact 1-form yields (5.4).

    This theorem allows for a straight-forward algorithm to find invariant volumes in nonholonomic systems subject to affine constraints; one only needs to compute the pair $ (\vartheta_{\mathscr{C}}, \zeta_\mathscr{C}) $ and determine whether or not it can be made exact by appending constraints to it. This procedure will be carried out on multiple examples in §7.

    Remark 5.7. In the pure kinetic energy case with linear constraints discussed in [24], it is proved that if the system admits an (arbitrary) invariant volume, then one can always find another invariant volume form whose density function depends only on the (reduced) configuration variables. Moreover, systems subjected to affine constraints may possess an invariant volume whose density is not basic, cf. [38].

    The above shows that "exactness" of $ (\vartheta_{\mathscr{C}}, \zeta_\mathscr{C}) $ determines the existence of a density depending on configuration. How does this depend on the choice of $ \mathscr{C} $ to realize the constraints? It turns out the answer is independent of the choice of realization.

    Theorem 5.8. Let $ \mathscr{C}_1 $ and $ \mathscr{C}_2 $ both be affine realizations of the constraint manifold $ M $. Suppose that there exist functions $ \varphi_\gamma $ such that

    $ ϑC1φγηγ1=dln˜ρ1,ζC1φγξγ1=0,
    $

    then there exists functions $ \psi_\gamma $ such that

    $ ϑC2ψγηγ2=dln˜ρ2,ζC2ψγξγ2=0.
    $

    Moreover, $ (\pi_Q^*\tilde{\rho}_1)\mu_{\mathscr{C}_1} = (\pi_Q^*\tilde{\rho}_2)\mu_{\mathscr{C}_2} $ modulo a constant of motion.

    Proof. Let $ \eta_2^\gamma = c_\alpha^\gamma\eta_1^\alpha $ and $ \xi_2^\gamma = c_\alpha^\gamma \xi_1^\alpha $ where $ c_\alpha^\gamma $ is the required coordinate change. For ease of notation, we will drop the subscript "1." The matrices are related via

    $ mαβ2=cαγcβδmγδ1.
    $

    The density form is

    $ mαβ2ϑ2=LcαγWγ(cβδηδ)=cαγcβδLWγηδ+cβδmδγ1dcαγ+cαγdcβδ(Wγ)ηδ=mαβ2ϑ1+mαβ2σγαdcαγ+cαγdcβδ(Wγ)ηδ,
    $

    where $ (\sigma_\alpha^\gamma) : = (c_\gamma^\alpha)^{-1} $. By Lemma 5.9 below, we have

    $ mαβ2ϑ2=mαβ2ϑ1+mαβ2d[lndet(cαγ)]+cαγdcβδ(Wγ)ηδ.
    $
    (5.5)

    This shows that $ \vartheta_1 $ and $ \vartheta_2 $ differ by something exact and a multiple of the constraining 1-forms. Using the fact that $ \vartheta_1 - \varphi_\gamma\eta_1^\gamma = -d \ln\tilde{\rho}_1 $, we see that

    $ ϑ2(φγ+Cγ)ηγ=d[ln˜ρ1+lndet(cαγ)],
    $

    where $ m_2^{\alpha\beta}C_\gamma = c_\delta^\alpha dc_\gamma^\beta(W^\delta) $. This provides

    $ ˜ρ2=det(σγα)˜ρ1,ψγ=cδγ[φδ+Cδ].
    $

    These values of $ \psi_\gamma $ make $ \vartheta_2 $ exact. We next check that these values of $ \psi_\gamma $ make $ \zeta_2 $ vanish.

    $ mαβ2(ζ2ψγcγδξδ)=cαγcβδLWγξδ+cαγdcβδ(Wγ)ξδmαβ2ψγcγδξδ=mαβ2ζ1mαβ2φδξδ=0.
    $

    It remains to show that $ (\pi_Q^*\tilde{\rho}_1)\mu_{\mathscr{C}_1} = (\pi_Q^*\tilde{\rho}_2)\mu_{\mathscr{C}_2} $. This is equivalent to $ \mu_{\mathscr{C}_1} = (\pi_Q^*\det(c_\gamma^\alpha))\mu_{\mathscr{C}_2} $. Recalling Definition 4.1, we have $ \sigma_1 = d\Phi^1\wedge\ldots d\Phi^k $ and

    $ σ2=ki=1d(ciγΦγ)=det(cαγ)σ1+ki=1Φγdciγ.
    $

    Upon applying the constraints, the last term disappears and we obtain

    $ σ1ε1=σ2ε2=det(cαγ)σ1ε2.
    $

    This implies the desired result.

    A reason why studying $ (\vartheta_\mathscr{C}, \zeta_\mathscr{C}) $ is insightful is that it immediately demonstrates why holonomic systems systems are measure-preserving, i.e. when $ \eta^\alpha = df^\alpha $ and $ \xi^\alpha = 0 $, (5.4) can always be solved. This short section idemonstrates how our general theory collapses to the known holonomic case. We start with a helpful lemma.

    Lemma 5.9 (*). $ m_{\alpha\beta}\cdot dm^{\alpha\beta} = d\left[ \ln \det \left(m^{\alpha\beta}\right)\right] $.

    *We thank Dr. Alexander Barvinok for help with this proof.

    Proof. It suffices to check along a curve in the manifold. Let $ \gamma:I\to Q $ be a curve and let $ A(t) = \left(m^{\alpha\beta}\right)\circ\gamma(t) $ be the mass matrix along the curve. Note that $ A(t) $ is positive-definite and changes smoothly with $ t $. We have

    $ ddtlndetA(t)=ddtdetA(t)detA(t)=mi=1detAi(t)detA(t),
    $

    where $ A_i(t) $ is obtained from $ A(t) $ by differentiating the $ i $-th row and leaving all other rows intact, i.e.

    $ Ai(t)=(a11(t)a1m(t)a(i1)1(t)a(i1)m(t)ai1(t)aim(t)a(i+1)1(t)a(i+1)m(t)am1(t)amm(t)).
    $

    Expanding $ \det A_i(t) $ along the $ i $-th row:

    $ detAi(t)=mj=1(1)i+j1aij(t)detAij(t),
    $

    where $ A_{ij}(t) $ is the $ (m-1)\times (m-1) $ matrix obtained from $ A_i(t) $ and hence from $ A(t) $ by crossing out the $ i $-th row and $ j $-th column.

    Next, observe that $ (-1)^{i+j-1}\det A_{ij}/\det A(t) $ is the $ (j, i) $-th entry of the inverse matrix $ A^{-1}(t) = (b_{ij})(t) $, and since $ A(t) $ is symmetric, is also the $ (i, j) $-th entry of $ (b_{ij})(t) $. Summarizing,

    $ ddtlndetA(t)=mi,j=1aij(t)bij(t).
    $

    Proposition 5.10. If the constraints are holonomic, then there exist function $ \varphi_\gamma $ such that $ \vartheta_{\mathscr{C}}-\varphi_\gamma\eta^\gamma $ is exact. In particular, if $ \mathscr{C} $ is chosen such that all $ \eta^\alpha $ are closed, $ \vartheta_{\mathscr{C}} $ is exact.

    Proof. When the constraints are holonomic, the 1-forms $ \eta^\alpha $ can be chosen such that they are closed. Then the density form is

    which is exact by Lemma 5.9. If a different realization is chosen, Theorem 5.8 states that the resulting (5.4) is still solvable.

    It turns out that for natural Lagrangian systems subject to linear constraints, the divergence - particularly the density form - is encoded in the nonholonomic connection. This interpretation seems to be new.

    Throughout this section, let $ L:TQ\to\mathbb{R} $ be a natural Lagrangian with Riemannian metric $ g $ subject to the linear constraints $ \eta^\alpha(v) = 0 $. The nonholonomic connection for this system is given by (cf. §5.3 in [20] and [39]):

    $ CXY=XY+Wimij[X(ηj(Y))ηj(XY)].
    $

    The equations of motion can then be described via

    $ C˙q˙q=F,
    $

    where $ F $ contains the potential and external forces (the constraint forces are contained in the connection).

    The nonintegrability of the constraints appears in the torsion of the connection. Computing this, we see

    $ TC(X,Y)=CXYCYX[X,Y]=Wimij[X(ηj(Y))Y(ηj(X))ηj(XYYX)]=Wjmij[X(ηj(Y))Y(ηj(X))ηj([X,Y])]=Wjmijdηj(X,Y).
    $

    The torsion can be written as

    $ TC=mαβWαdηβ.
    $

    Indeed, if the constraining 1-forms $ \eta^j $ are all closed (so holonomic) then the torsion vanishes. It is worth pointing out that the torsion is vertical-valued; if $ X, Y\in M $, then $ T^\mathscr{C}(X, Y)\in M^\perp $, i.e. $ T(X, Y) $ is orthogonal to the constraint distribution.

    Due to the fact that the torsion is a (1, 2)-tensor, its trace will be a (0, 1)-tensor. Therefore, the trace of the nonholonomic torsion will be a 1-form:

    $ trTC=mαβiWαdηβ.
    $

    Returning to the density form, we see that

    $ trTC+dlndet(mαβ)=ϑC,
    $

    i.e. the trace of the torsion differs from the density form by something exact. This leads to the following theorem.

    Theorem 6.1. A natural nonholonomic system subject to linear constraints has an invariant volume of the form $ (\pi_Q^*\rho)\cdot\mu_{\mathscr{C}} $ if and only if there exists functions $ \varphi_\gamma $ such that

    $ trTC+φγηγ
    $

    is exact.

    Remark 6.2. The vanishing of the torsion shows that the constraints are integrable while the integrability of the (trace of the) torsion shows that a volume is preserved.

    In the case of nonholonomic systems, the nonholonomic connection is compatible with the metric but has nonzero torsion. This idea extends to arbitrary, metric-compatible connections as the following theorem states.

    Theorem 6.3. Let $ \tilde{\nabla} $ be an affine connection compatible with the metric with torsion $ \tilde{T} $. There exists an invariant volume with density of the form $ \pi_Q^*\rho $ for the geodesic spray if and only if $ \text{tr}\tilde{T} $ is exact.

    Proof. Consider the volume form on $ TQ $ given by

    $ Ω=detgdx1dxndv1dvn.
    $

    We want to compute $ \mathcal{L}_X\Omega $ where $ X $ is the geodesic spray given by

    $ X=vixiΓijkvjvkvi.
    $

    The Lie derivative is then

    $ LXΩ=diXΩ=(d[detg](v)detg(Γiik+Γiki)vk)1detgΩ,
    $

    and therefore the divergence is given by

    $ divΩ(X)=d[lndetg](v)(Γiik+Γiki)vk.
    $
    (6.1)

    We will now use the fact that the connection is compatible with the metric:

    $ gjkxi=gkΓij+gjΓik.
    $

    This implies that

    $ gjkgjkxi=gjkgkΓij+gjkgjΓik=δjΓij+δkΓij=2Γkik.
    $

    Integrating the left-hand side above gives

    $ d[lndetg](v)=2Γikivk.
    $
    (6.2)

    Substituting (6.2) into (6.1), we get

    $ divΩ(X)=(ΓikiΓiik)vk.
    $

    It remains to show that this is the trace of the torsion. Indeed,

    $ ˜T=(ΓkijΓkji)xkdxidxjtr˜T=(ΓikiΓiik)dxk.
    $

    We conclude that

    $ divΩ(X)=tr˜T(v).
    $

    This shows that a way to interpret the torsion of a connection is by measuring how much the geodesic spray fails to preserve volume.

    We end this work with applying both Theorem 5.2 and Theorem 5.6 to various nonholonomic systems by calculating the general divergence for nonlinear systems or $ \vartheta_{\mathscr{C}} $ and $ \zeta_\mathscr{C} $ for affine/linear systems. Examples are taken from [18,20,31].

    We begin by examining multiple nonholonomic systems subject to affine/linear constraints.

    As an example of Theorem 5.6, we will prove that no invariant basic volumes exist for the Chaplygin sleigh. The Chaplygin sleigh has the configuration space $ Q = \mathrm{SE}_2 $, the special Euclidean group, and the following Lagrangian

    $ L=12(m˙x2+m˙y2+(I+ma2)˙θ22ma˙x˙θsinθ+2ma˙y˙θcosθ),
    $

    where $ (x, y)\in\mathbb{R}^2 $ is the coordinate of the contact point, $ \theta\in\mathrm{SO}_2 $ is its orientation, $ m $ is the sleigh's mass, $ I $ is the moment of inertia about the center of mass, and $ a $ is the distance from the center of mass to the contact point (cf. §1.7 in [20]).

    The nonholonomic constraint is that the sleigh can only slide in the direction it is pointing and is given by

    $ ˙ycosθ˙xsinθ=0,
    $

    which corresponds to the 1-form $ \eta = \left(\cos\theta\right)dy - \left(\sin\theta\right)dx $ and function $ \xi = 0 $.

    To determine the existence of an invariant volume, we only need to compute $ \vartheta_{\mathscr{C}} $ as $ \zeta_\mathscr{C} = 0 $. The constraining vector field and 1-form are:

    $ W=ma2+IIm[cosθysinθx]aIθ,η=(cosθ)dy(sinθ)dx.
    $

    This gives us

    $ ϑC=1η(W)LWη=mama2+I[(sinθ)dy+(cosθ)dx].
    $

    As a consequence of this, the divergence of the Chaplygin sleigh is given by

    $ divμC(XDH)=3mavI+ma2,v=˙xcosθ+˙ysinθ.
    $
    (7.1)

    We want to show that for any $ \tilde{\eta}\in\Gamma(\mathcal{D}^0) $, $ \vartheta_{\mathscr{C}}+\tilde{\eta} $ is not exact. Because there is only one constraint, it suffices to show that there does not exist a smooth $ k $ such that $ \vartheta_{\mathscr{C}}+k\cdot\eta $ is closed, i.e. it requires the following to be zero:

    $ d(ϑC+kη)=mama2+I[(cosθ)dθdy(sinθ)dθdx]+(kxcosθ+kysinθ)dxdy+(kθcosθksinθ)dθdy(kθsinθ+kcosθ)dθdx.
    $

    Separating the above, we need the following three to vanish:

    $ 0=kxcosθ+kysinθ,0=kθcosθksinθ+mama2+Icosθ,0=kθsinθ+kcosθ+mama2+Isinθ.
    $
    (7.2)

    The second two lines of (7.2) are overdetermined for $ k $ in the $ \theta $-direction and are inconsistent (unless $ a = 0 $ and we obtain the trivial solution $ k\equiv 0 $). Therefore, there does not exist a smooth $ k $ such that $ \vartheta_{\mathscr{C}}+k\cdot\eta $ is closed. We note that this is compatable with the known result that when $ a = 0 $, no asymptotically stable dynamics occur.

    The next example is that of the falling rolling disk whose configuration space is $ Q = \mathrm{SE}_2\times S^1 $. Its Lagrangian is

    $ L=m2[(ξR(˙φsinθ+˙ψ))2+η2sin2θ+(ηcosθ+R˙θ)2],
    $

    where

    $ ξ=˙xcosφ+˙ysinφ+R˙ψ,η=˙xsinφ+˙ycosφ.
    $

    We will consider both the case of the rolling coin on a stationary table (linear constraints) and on a rotating table (affine constraints).

    Stationary Table The case of the disk on a stationary table was studied in [40]. The constraints are given by the vanishing of the following 1-forms:

    $ η1=cosφdx+sinφdy+Rdψ,η2=sinφdx+cosφdy.
    $
    (7.3)

    The corresponding dual vector fields are

    $ W1=1mcosφx+1msinφy+RIψ,W2=J+mR2Jm+m2R2sin2θ[sinφx+cosφy]RcosθJm+m2R2sin2θθ.
    $

    Computing $ \vartheta_{\mathscr{C}} $, we obtain

    $ ϑC=mR2sin(2θ)J+mR2sin2θ,
    $

    which is exact. The resulting invariant volume is

    $ 1J+mR2sin2θμC.
    $

    Rotating Table Although the case of the stationary table has been studied, the authors are unaware of any results for the case of a rotating table. If the falling rolling disk is placed on a table with constant angular velocity $ \Omega $, the constraints become affine with

    $ η1(v)+ξ1=0,ξ1=Ω(ycosφxsinφ)η2(v)+ξ2=0,ξ2=Ω(xcosφ+ysinφ)
    $

    where $ \eta^\alpha $ are from (7.3). The volume from the stationary table is still preserved as

    $ LW1ξ1=LW2ξ2=0.
    $

    We next consider the case of a non-homogeneous sphere rolling without slipping on a horizontal plane, both stationary and rotating. The center of mass of the sphere is located at its geometric center while its principal moments of inertia are distinct. This example has been studied by Chaplygin [41] and, e.g. [1,42,43].

    The Lagrangian is the kinetic energy,

    $ L=12I1(˙θcosψ+˙φsinψsinθ)2+12I2(˙θsinψ+˙φcosψsinθ)2+12I3(˙ψ+˙φcosθ)2+12M(˙x2+˙y2),
    $

    where it is assumed that the radius is 1.

    Stationary Table When the table is stationary, the constraints are given by the two 1-forms

    $ η1=dxsinφdθ+cosφsinθdψ,η2=dy+cosφdθ+sinφsinθdψ.
    $

    The density form is given by $ \vartheta_{\mathscr{C}} = A\cdot d\theta + B\cdot d\psi $ where

    $ A=Msin(2θ)[J1+J2sin2ψ]2(J3+J4sin2θ+J5sin2θsin2ψ),B=MJ2sin(2ψ)sin2θ2(J3+J4sin2θ+J5sin2θsin2ψ),
    $

    and the constants $ J_j $ are

    $ J1=I1I2I1I3+I2MI3M,J2=I1I3I2I3+I1MI2M,J3=I3M2+I1I2I3+I1I3M+I2I3M,J4=I2M2I3M2+I1I2MI1I3M,J5=I1M2I2M2+I1I3MI2I3M.
    $

    The density form is exact and the resulting invariant volume is

    $ (1+β1β)12cosθ(J3+(J3+J4)tan2θ)12μC,β=J5sin2θsin2ψ2J3+2J4sin2θ+J5sin2θsin2ψ.
    $

    Rotating Table When the table is rotating, the constraints becomes affine

    $ η1(v)+ξ1=0,ξ1=Ωy,η2(v)+ξ2=0,ξ2=Ωx.
    $

    Notice that,

    $ LW1ξ1=LW2ξ2=0,LW2ξ1=LW1ξ2=ΩM.
    $

    As the matrix $ (m_{\alpha\beta}) $ is symmetric, we have that the product

    $ mαβLWαξβ=0.
    $

    Therefore, the Chaplygin sphere on a rotating table is volume-preserving with the same volume as in the stationary case.

    Theorem 5.2 does not require that $ Q $ be orientable. Consider the Möbius strip immersed in $ \mathbb{R}^3 $ by

    $ x=(1+vcos(u2))cos(u),y=(1+vcos(u2))sin(u),z=vsin(u2),
    $
    (7.4)

    for $ 0\leq u\leq 2\pi $ and $ -1/2 < v < 1/2 $. The Euclidean metric pulled back to the Möbius strip is

    $ g=(4vcos(u2)+2v2cos(u2)+v24+2)dudu+2dvdv.
    $

    Notice that the above metric is for the double cover rather than the Möbius strip itself as a consequence of (7.4) only being an immersion. However, the resulting equations of motion will be well-defined on the strip. In two-dimensional systems, any single constraint is automatically holonomic which will make volume-preservation trivial. Let us "thicken" the strip by $ w $ with resulting metric

    $ gthick=(4vcos(u2)+2v2cos(u2)+v24+2)dudu+2dvdv+dwdw.
    $

    Consider the linear nonholonomic constraint

    $ η=dv+sin(u)dw,W=12v+12sin(u)w.
    $

    The density form is

    $ ϑC=sin(u)cos(u)1+sin2(u)du,
    $

    which is exact. This produces the invariant volume

    $ 1+sin2(u)μC,
    $

    which is shown in Figure 2.

    Figure 2.  A plot of the density corresponding to the invariant volume for the nonholonomic system on the Möbius strip.

    The previous examples consisted of a natural Lagrangian and affine constraints which were describable by Theorem 5.6. The next two examples utilize Theorem 5.2 on systems with natural Lagrangians but nonlinear constraints. These examples can be found in [31,44].

    Let $ L:T\mathbb{R}^3\to\mathbb{R} $ be the Lagrangian given by

    $ L=12m(˙x2+˙y2+˙z2)mgz,
    $

    subject to the nonlinear constraint of constant kinetic energy

    $ Ψ=˙x2+˙y2+˙z2c=0,c>0.
    $

    Transferring to the Hamiltonian side, we have

    $ H=12m(p2x+p2y+p2z)+mgz,Φ=12(p2x+p2y+p2z)c=0,
    $

    where $ \Phi $ is normalized. As the constraints are nonlinear, (4.3) can be used to determine the divergence. The requisite data to compute the divergence is

    $ CdΦ=m(pxdx+pydy+pzdy),CdΦ(XΦ)=m(p2x+p2y+p2z),ΔCΦ=3m,ϕ=12mcm(p2x+p2y+p2z),{H,ϕ}=2gcpz(p2x+p2y+p2z).
    $

    Although $ \mathcal{C}^*d\Phi(X_\Phi)\ne 0 $ everywhere, it does in some tubular neighborhood of $ M $. The divergence of the system is given by

    $ divμC(XMH)=3CdΦ([XH,Xϕ])3{H,ϕ}M=18mgcpz(p2x+p2y+p2z)218mgcpz(p2x+p2y+p2z)2=9mgpzc,
    $

    where we used the fact that $ p_x^2+p_y^2+p_z^2 = 2c $. The divergence does not vanish so $ \mu_{\mathscr{C}} $ is not preserved. However, there exists an exact 1-form which produces the divergence:

    $ 9mgcdz(XMH)=divμC(XMH).
    $

    Therefore, the following volume-form is preserved

    $ exp(9mgzc)μC.
    $

    The other nonlinear constraint example we will examine is Appel's example. The Lagrangian is the same as the constant kinetic energy case,

    $ L=12m(˙x2+˙y2+˙z2)mgz,
    $

    while the nonlinear constraint is now

    $ Ψ=a2(˙x2+˙y2)˙z2=0.
    $

    The data on the Hamiltonian side becomes

    $ H=12m(p2x+p2y+p2z)+mgz,Φ=a22(p2x+p2y)12p2z=0,
    $

    where, again, the constraint is normalized. As before, the constraints are nonlinear so (4.3) can be used. The requisite data is

    $ CdΦ=a2mpxdx+a2mpydympzdz,CdΦ(XΦ)=ma4(p2x+p2y)+mp2z,ΔCΦ=m(2a21),ϕ=a2(p2x+p2y)p2z2ma4(p2x+p2y)+2mp2z,{H,ϕ}=a2(a2+1)gpz(p2x+p2y)(a4p2x+a4p2y+p2z)2.
    $

    The constraint is not admissible as $ \mathcal{C}^*d\Phi(X_\Phi) $ vanishes at $ p_x = p_y = p_z = 0 $ which is in the constraint manifold. As long as $ a\ne 1 $, this is the only place where this degeneracy occurs. The divergence is

    $ divμC(XMH)=3CdΦ([XH,Xϕ])3{H,ϕ}M=12a2(a2+1)mgpz(p2x+p2y)(a6p2x+a6p2yp2z)(a4p2x+a4p2y+p2z)3.
    $

    To simplify the divergence, notice that the constraint makes $ p_z^2 = a^2(p_x^2+p_y^2) $. Substituting this, the divergence becomes

    $ divμC(XMH)=12(a21)mg(a2+1)pz.
    $

    The following exact 1-form solves the cohomology equation

    $ 12(a21)a2pzdpz(XMH)=divμC(XMH),
    $

    as

    $ ˙pz=a2mg1+a2.
    $

    Therefore, the following form is preserved

    $ pKzμC,K=12(a21)a2.
    $

    Unfortunately, this form is not a volume-form as it vanishes when $ p_z = 0 $.

    All of the examples examined so far have been for systems whose Lagrangian is natural. We conclude this work with two examples whose Lagrangian is not given by a Riemannian metric.

    Consider the Lagrangian

    $ L=14(˙x4+˙y4+˙z4),
    $

    subject to the nonintegrable constraint $ \dot{z} = x\dot{y} $. The Hamiltonian is

    $ H=34(p4/3x+p4/3y+p4/3z),
    $

    and the constraint becomes

    $ Φ=p1/3zxp1/3y.
    $

    Notice that singularities appear when the momentum vanishes. Continuing with the computations, we have

    $ CdΦ=dzxdyCdΦ(XΦ)=x23p2/3y+13p2/3zϕ=3pz3x2pz+3xpyx2+ν2/3,ν=pypzΔCΦ=0
    $

    Applying the constraint, the divergence simplifies to

    $ divμC(XMH)=3p1/3xx42x(1+x4)=3x42x(1+x4)dx(XMH)
    $

    The following form is exact

    $ α=3x46x(1+x4)dx,
    $

    and an invariant form is

    $ (1+x4)9/4x6μC.
    $

    Notice the singularity at $ x = 0 $.

    Suppose we have the relativistic Lagrangian

    $ L=m0c21v2c2,v2=˙x2+˙y2+˙z2.
    $

    With the same constraints $ \dot{z} = x\dot{y} $. The Hamiltonian and constraint are

    $ H=cp2+m2c2,Φ=c(pzxpy)p2+m2c2.
    $

    The computation yields

    $ CdΦ=dzxdyCdΦ(XΦ)=c((m2c2+p2x)(1+x2)+(xpz+py)2)(m2c2+p2)3/2ϕ=(p2+m2c2)(pzxpy)(m2c2+p2x)(1+x2)+(xpz+py)2ΔCΦ=0
    $

    Applying the constraints, the divergence simplifies to

    $ divμC(XMH)=3xpx(1+x2)m2c2+p2x+(1+x2)p2y=3x1+x2dx(XMH)
    $

    Therefore, an invariant volume is given by

    $ (1+x2)3/2μC.
    $

    We thank Dr. J.C. Marrero for pointing us to existing work in this field as well as the reviewers for helpful comments.

    [1] Krishnamurthy H, Gouaux E (2012) X-ray structures of LeuT in substrate-free outward-open and apo inward-open states. Nature 481: 469–474. doi: 10.1038/nature10737
    [2] Sanders CR, Sonnichsen F (2006) Solution NMR of membrane proteins: practice and challenges. Magn Reson Chem 44 Spec No: S24–40.
    [3] Horst R, Stanczak P, Stevens RC, et al. (2013) beta2-Adrenergic receptor solutions for structural biology analyzed with microscale NMR diffusion measurements. Angew Chem Int Ed Engl 52: 331–335. doi: 10.1002/anie.201205474
    [4] Chun E, Thompson AA, Liu W, et al. (2012) Fusion partner toolchest for the stabilization and crystallization of G protein-coupled receptors. Structure 20: 967–976. doi: 10.1016/j.str.2012.04.010
    [5] Baker LA, Baldus M (2014) Characterization of membrane protein function by solid-state NMR spectroscopy. Curr Opin Struct Biol 27: 48–55. doi: 10.1016/j.sbi.2014.03.009
    [6] Hopkins AL, Groom CR (2002) The druggable genome. Nat Rev Drug Discov 1: 727–730. doi: 10.1038/nrd892
    [7] Overington JP, Al-Lazikani B, Hopkins AL (2006) How many drug targets are there? Nat Rev Drug Discov 5: 993–996. doi: 10.1038/nrd2199
    [8] Bakheet TM, Doig AJ (2009) Properties and identification of human protein drug targets. Bioinformatics 25: 451–457. doi: 10.1093/bioinformatics/btp002
    [9] Maslennikov I, Choe S (2013) Advances in NMR structures of integral membrane proteins. Curr Opin Struct Biol 23: 555–562. doi: 10.1016/j.sbi.2013.05.002
    [10] Klammt C, Maslennikov I, Bayrhuber M, et al. (2012) Facile backbone structure determination of human membrane proteins by NMR spectroscopy. Nat Methods 9: 834–839. doi: 10.1038/nmeth.2033
    [11] Berman HM, Battistuz T, Bhat TN, et al. (2002) The Protein Data Bank. Acta Crystallogr D Biol Crystallogr 58: 899–907. doi: 10.1107/S0907444902003451
    [12] Berman HM, Westbrook J, Feng Z, et al. (2000) The Protein Data Bank. Nucleic Acids Res 28: 235–242. doi: 10.1093/nar/28.1.235
    [13] Tang M, Comellas G, Rienstra CM (2013) Advanced solid-state NMR approaches for structure determination of membrane proteins and amyloid fibrils. Acc Chem Res 46: 2080–2088. doi: 10.1021/ar4000168
    [14] Ni QZ, Daviso E, Can TV, et al. (2013) High frequency dynamic nuclear polarization. Acc Chem Res 46: 1933–1941. doi: 10.1021/ar300348n
    [15] Zou P, McHaourab HS (2010) Increased sensitivity and extended range of distance measurements in spin-labeled membrane proteins: Q-band double electron-electron resonance and nanoscale bilayers. Biophys J 98: L18–20. doi: 10.1016/j.bpj.2009.12.4193
    [16] Mchaourab HS, Steed PR, Kazmier K (2011) Toward the fourth dimension of membrane protein structure: insight into dynamics from spin-labeling EPR spectroscopy. Structure 19: 1549–1561. doi: 10.1016/j.str.2011.10.009
    [17] Tusnady GE, Dosztanyi Z, Simon I (2004) Transmembrane proteins in the Protein Data Bank: identification and classification. Bioinformatics 20: 2964–2972. doi: 10.1093/bioinformatics/bth340
    [18] Mchaourab HS, Lietzow MA, Hideg K, et al. (1996) Motion of spin-labeled side chains in T4 lysozyme. Correlation with protein structure and dynamics. Biochemistry 35: 7692–7704.
    [19] Hubbell WL, McHaourab HS, Altenbach C, et al. (1996) Watching proteins move using site-directed spin labeling. Structure 4: 779–783. doi: 10.1016/S0969-2126(96)00085-8
    [20] Fanucci GE, Cafiso DS (2006) Recent advances and applications of site-directed spin labeling. Curr Opin Struct Biol 16: 644–653. doi: 10.1016/j.sbi.2006.08.008
    [21] Weierstall U, James D, Wang C, et al. (2014) Lipidic cubic phase injector facilitates membrane protein serial femtosecond crystallography. Nat Commun 5: 3309.
    [22] Liu W, Wacker D, Gati C, et al. (2013) Serial femtosecond crystallography of G protein-coupled receptors. Science 342: 1521–1524. doi: 10.1126/science.1244142
    [23] Li D, Boland C, Walsh K, et al. (2012) Use of a robot for high-throughput crystallization of membrane proteins in lipidic mesophases. J Vis Exp: e4000.
    [24] Li D, Boland C, Aragao D, et al. (2012) Harvesting and cryo-cooling crystals of membrane proteins grown in lipidic mesophases for structure determination by macromolecular crystallography. J Vis Exp: e4001.
    [25] Alexander N, Al-Mestarihi A, Bortolus M, et al. (2008) De novo high-resolution protein structure determination from sparse spin-labeling EPR data. Structure 16: 181–195. doi: 10.1016/j.str.2007.11.015
    [26] Hirst SJ, Alexander N, Mchaourab HS, et al. (2011) RosettaEPR: an integrated tool for protein structure determination from sparse EPR data. J Struct Biol 173: 506–514. doi: 10.1016/j.jsb.2010.10.013
    [27] Islam SM, Stein RA, McHaourab HS, et al. (2013) Structural refinement from restrained-ensemble simulations based on EPR/DEER data: application to T4 lysozyme. J Phys Chem B 117: 4740–4754. doi: 10.1021/jp311723a
    [28] Fischer AW, Alexander NS, Woetzel N, et al. (2015) BCL::MP-Fold: Membrane protein structure prediction guided by EPR restraints. Proteins 83: 1947–1962. doi: 10.1002/prot.24801
    [29] Jeschke G, Chechik V, Ionita P, et al. (2006) DeerAnalysis2006 - a comprehensive software package for analyzing pulsed ELDOR data. Appl Magn Reson 30: 473–498. doi: 10.1007/BF03166213
    [30] Hagelueken G, Ward R, Naismith JH, et al. (2012) MtsslWizard: In Silico Spin-Labeling and Generation of Distance Distributions in PyMOL. Appl Magn Reson 42: 377–391. doi: 10.1007/s00723-012-0314-0
    [31] Beasley KN, Sutch BT, Hatmal MM, et al. (2015) Computer Modeling of Spin Labels: NASNOX, PRONOX, and ALLNOX. Methods Enzymol 563: 569–593. doi: 10.1016/bs.mie.2015.07.021
    [32] Hatmal MM, Li Y, Hegde BG, et al. (2012) Computer modeling of nitroxide spin labels on proteins. Biopolymers 97: 35–44. doi: 10.1002/bip.21699
    [33] Alexander NS, Stein RA, Koteiche HA, et al. (2013) RosettaEPR: Rotamer Library for Spin Label Structure and Dynamics. PLoS One 8: e72851. doi: 10.1371/journal.pone.0072851
    [34] Sali A, Blundell TL (1993) Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234: 779–815. doi: 10.1006/jmbi.1993.1626
    [35] Fiser A, Sali A (2003) Modeller: generation and refinement of homology-based protein structure models. Methods Enzymol 374: 461–491. doi: 10.1016/S0076-6879(03)74020-8
    [36] Webb B, Sali A (2014) Protein structure modeling with MODELLER. Methods Mol Biol 1137: 1–15. doi: 10.1007/978-1-4939-0366-5_1
    [37] Webb B, Sali A (2014) Comparative protein structure modeling using MODELLER. Curr Protoc Bioinformatics 47: 5.6.1–5.6.32.
    [38] Rohl CA, Strauss CEM, Chivian D, et al. (2004) Modeling structurally variable regions in homologous proteins with rosetta. Proteins 55: 656–677. doi: 10.1002/prot.10629
    [39] Misura KM, Chivian D, Rohl CA, et al. (2006) Physically realistic homology models built with ROSETTA can be more accurate than their templates. Proc Natl Acad Sci U S A 103: 5361–5366. doi: 10.1073/pnas.0509355103
    [40] Schwede T, Kopp J, Guex N, et al. (2003) SWISS-MODEL: An automated protein homology-modeling server. Nucleic Acids Res 31: 3381–3385. doi: 10.1093/nar/gkg520
    [41] Zhang Y (2009) I-TASSER: fully automated protein structure prediction in CASP8. Proteins 77 Suppl 9: 100–113.
    [42] Zhang Y (2008) I-TASSER server for protein 3D structure prediction. BMC Bioinformatics 9: 40. doi: 10.1186/1471-2105-9-40
    [43] Roy A, Kucukural A, Zhang Y (2010) I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 5: 725–738. doi: 10.1038/nprot.2010.5
    [44] Combs SA, Deluca SL, Deluca SH, et al. (2013) Small-molecule ligand docking into comparative models with Rosetta. Nat Protoc 8: 1277–1298. doi: 10.1038/nprot.2013.074
    [45] Stevens RC, Cherezov V, Katritch V, et al. (2013) The GPCR Network: a large-scale collaboration to determine human GPCR structure and function. Nat Rev Drug Discov 12: 25–34.
    [46] Kroeze WK, Sheffler DJ, Roth BL (2003) G-protein-coupled receptors at a glance. J Cell Sci 116: 4867–4869. doi: 10.1242/jcs.00902
    [47] Yamashita A, Singh SK, Kawate T, et al. (2005) Crystal structure of a bacterial homologue of Na+/Cl--dependent neurotransmitter transporters. Nature 437: 215–223. doi: 10.1038/nature03978
    [48] Faham S, Watanabe A, Besserer GM, et al. (2008) The crystal structure of a sodium galactose transporter reveals mechanistic insights into Na+/sugar symport. Science 321: 810–814. doi: 10.1126/science.1160406
    [49] Perez C, Koshy C, Yildiz O, et al. (2012) Alternating-access mechanism in conformationally asymmetric trimers of the betaine transporter BetP. Nature 490: 126–130. doi: 10.1038/nature11403
    [50] Ma D, Lu P, Yan C, et al. (2012) Structure and mechanism of a glutamate-GABA antiporter. Nature 483: 632–636. doi: 10.1038/nature10917
    [51] Kazmier K, Sharma S, Quick M, et al. (2014) Conformational dynamics of ligand-dependent alternating access in LeuT. Nat Struct Mol Biol 21: 472–479. doi: 10.1038/nsmb.2816
    [52] Gregory KJ, Nguyen ED, Reiff SD, et al. (2013) Probing the metabotropic glutamate receptor 5 (mGlu(5)) positive allosteric modulator (PAM) binding pocket: discovery of point mutations that engender a "molecular switch" in PAM pharmacology. Mol Pharmacol 83: 991–1006. doi: 10.1124/mol.112.083949
    [53] Yarov-Yarovoy V, Schonbrun J, Baker D (2006) Multipass membrane protein structure prediction using Rosetta. Proteins 62: 1010–1025.
    [54] Barth P, Schonbrun J, Baker D (2007) Toward high-resolution prediction and design of transmembrane helical protein structures. Proc Natl Acad Sci U S A 104: 15682–15687. doi: 10.1073/pnas.0702515104
    [55] Barth P, Wallner B, Baker D (2009) Prediction of membrane protein structures with complex topologies using limited constraints. Proc Natl Acad Sci U S A 106: 1409–1414. doi: 10.1073/pnas.0808323106
    [56] Nugent T, Jones DT (2012) Accurate de novo structure prediction of large transmembrane protein domains using fragment-assembly and correlated mutation analysis. Proc Natl Acad Sci U S A 109: E1540–1547. doi: 10.1073/pnas.1120036109
    [57] Nugent T, Jones DT (2013) Membrane protein orientation and refinement using a knowledge-based statistical potential. BMC Bioinformatics 14: 276. doi: 10.1186/1471-2105-14-276
    [58] Hopf TA, Colwell LJ, Sheridan R, et al. (2012) Three-dimensional structures of membrane proteins from genomic sequencing. Cell 149: 1607–1621. doi: 10.1016/j.cell.2012.04.012
    [59] Weiner BE, Woetzel N, Karakas M, et al. (2013) BCL::MP-fold: folding membrane proteins through assembly of transmembrane helices. Structure 21: 1107–1117. doi: 10.1016/j.str.2013.04.022
    [60] Simons KT, Kooperberg C, Huang E, et al. (1997) Assembly of Protein Tertiary Structures from Fragments with Similar Local Sequences using Simulated Annealing and Bayesian Scoring Functions. J Mol Biol 268: 209–225. doi: 10.1006/jmbi.1997.0959
    [61] Rohl CA, Strauss CE, Misura KM, et al. (2004) Protein structure prediction using Rosetta. Methods Enzymol 383: 66–93. doi: 10.1016/S0076-6879(04)83004-0
    [62] Kazmier K, Alexander NS, Meiler J, et al. (2011) Algorithm for selection of optimized EPR distance restraints for de novo protein structure determination. J Struct Biol 173: 549–557. doi: 10.1016/j.jsb.2010.11.003
    [63] Dimaio F, Leaver-Fay A, Bradley P, et al. (2011) Modeling symmetric macromolecular structures in rosetta3. PLoS One 6: e20450. doi: 10.1371/journal.pone.0020450
    [64] Leaver-Fay A, Tyka M, Lewis SM, et al. (2011) ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol 487: 545–574. doi: 10.1016/B978-0-12-381270-4.00019-6
    [65] Metropolis N, Rosenbluth A, Rosenbluth M, et al. (1953) Equations of state calculations by fast computing machines. J Chem Phys 21: 1087–1091. doi: 10.1063/1.1699114
    [66] Metropolis NU, Ulam S (1949) The Monte Carlo Method. J Am Stat Assoc 44: 335–341. doi: 10.1080/01621459.1949.10483310
    [67] Perozo E, Cortes DM, Cuello LG (1999) Structural rearrangements underlying K+-channel activation gating. Science 285: 73–78. doi: 10.1126/science.285.5424.73
    [68] Liu YS, Sompornpisut P, Perozo E (2001) Structure of the KcsA channel intracellular gate in the open state. Nat Struct Biol 8: 883–887. doi: 10.1038/nsb1001-883
    [69] Zou P, Mchaourab HS (2009) Alternating Access of the Putative Substrate-Binding Chamber in the ABC Transporter MsbA. J Mol Biol 393: 574–585. doi: 10.1016/j.jmb.2009.08.051
    [70] Altenbach C, Cai K, Klein-Seetharaman J, et al. (2001) Structure and function in rhodopsin: mapping light-dependent changes in distance between residue 65 in helix TM1 and residues in the sequence 306-319 at the cytoplasmic end of helix TM7 and in helix H8. Biochemistry 40: 15483–15492. doi: 10.1021/bi011546g
    [71] Viklund H, Elofsson A (2008) OCTOPUS: improving topology prediction by two-track ANN-based preference scores and an extended topological grammar. Bioinformatics 24: 1662–1668. doi: 10.1093/bioinformatics/btn221
    [72] Adamian L, Liang J (2006) Prediction of transmembrane helix orientation in polytopic membrane proteins. BMC Struct Biol 6: 13. doi: 10.1186/1472-6807-6-13
    [73] Okada T, Sugihara M, Bondar AN, et al. (2004) The retinal conformation and its environment in rhodopsin in light of a new 2.2 A crystal structure. J Mol Biol 342: 571–583.
    [74] Misura KM, Baker D (2005) Progress and challenges in high-resolution refinement of protein structure models. Proteins 59: 15–29. doi: 10.1002/prot.20376
    [75] Tyka MD, Jung K, Baker D (2012) Efficient sampling of protein conformational space using fast loop building and batch minimization on highly parallel computers. J Comput Chem 33: 2483–2491. doi: 10.1002/jcc.23069
    [76] Woetzel N, Karakas M, Staritzbichler R, et al. (2012) BCL::Score--knowledge based energy potentials for ranking protein models represented by idealized secondary structure elements. PLoS One 7: e49242. doi: 10.1371/journal.pone.0049242
    [77] Karakas M, Woetzel N, Staritzbichler R, et al. (2012) BCL::Fold--de novo prediction of complex and large protein topologies by assembly of secondary structure elements. PLoS One 7: e49240. doi: 10.1371/journal.pone.0049240
  • This article has been cited by:

    1. Luis C. García-Naranjo, Rafael Ortega, Antonio J. Ureña, Invariant Measures as Obstructions to Attractors in Dynamical Systems and Their Role in Nonholonomic Mechanics, 2024, 29, 1560-3547, 751, 10.1134/S156035472456003X
  • Reader Comments
  • © 2016 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(7222) PDF downloads(1315) Cited by(1)

Figures and Tables

Figures(6)  /  Tables(2)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog