
Much of the focus of applied dynamical systems is on asymptotic dynamics such as equilibria and periodic solutions. However, in many systems there are transient phenomena, such as temporary population collapses and the honeymoon period after the start of mass vaccination, that can last for a very long time and play an important role in ecological and epidemiological applications. In previous work we defined transient centers which are points in state space that give rise to arbitrarily long and arbitrarily slow transient dynamics. Here we present the mathematical properties of transient centers and provide further insight into these special points. We show that under certain conditions, the entire forward and backward trajectory of a transient center, as well as all its limit points must also be transient centers. We also derive conditions that can be used to verify which points are transient centers and whether those are reachable transient centers. Finally we present examples to demonstrate the utility of the theory, including applications to predatory-prey systems and disease transmission models, and show that the long transience noted in these models are generated by transient centers.
Citation: Ankai Liu, Felicia Maria G. Magpantay, Kenzu Abdella. A framework for long-lasting, slowly varying transient dynamics[J]. Mathematical Biosciences and Engineering, 2023, 20(7): 12130-12153. doi: 10.3934/mbe.2023540
[1] | N. Akhavan Kharazian, F. M. G. Magpantay . The honeymoon period after mass vaccination. Mathematical Biosciences and Engineering, 2021, 18(1): 354-372. doi: 10.3934/mbe.2021019 |
[2] | Yicang Zhou, Yiming Shao, Yuhua Ruan, Jianqing Xu, Zhien Ma, Changlin Mei, Jianhong Wu . Modeling and prediction of HIV in China: transmission rates structured by infection ages. Mathematical Biosciences and Engineering, 2008, 5(2): 403-418. doi: 10.3934/mbe.2008.5.403 |
[3] | Xue Liu, Xin You Meng . Dynamics of Bacterial white spot disease spreads in Litopenaeus Vannamei with time-varying delay. Mathematical Biosciences and Engineering, 2023, 20(12): 20748-20769. doi: 10.3934/mbe.2023918 |
[4] | Asma Alshehri, John Ford, Rachel Leander . The impact of maturation time distributions on the structure and growth of cellular populations. Mathematical Biosciences and Engineering, 2020, 17(2): 1855-1888. doi: 10.3934/mbe.2020098 |
[5] | Tongqian Zhang, Ning Gao, Tengfei Wang, Hongxia Liu, Zhichao Jiang . Global dynamics of a model for treating microorganisms in sewage by periodically adding microbial flocculants. Mathematical Biosciences and Engineering, 2020, 17(1): 179-201. doi: 10.3934/mbe.2020010 |
[6] | Frédéric Mazenc, Gonzalo Robledo, Daniel Sepúlveda . A stability analysis of a time-varying chemostat with pointwise delay. Mathematical Biosciences and Engineering, 2024, 21(2): 2691-2728. doi: 10.3934/mbe.2024119 |
[7] | Yan-Xia Dang, Zhi-Peng Qiu, Xue-Zhi Li, Maia Martcheva . Global dynamics of a vector-host epidemic model with age of infection. Mathematical Biosciences and Engineering, 2017, 14(5&6): 1159-1186. doi: 10.3934/mbe.2017060 |
[8] | Jonathan E. Forde, Bailey Meeker . A model of varicella-zoster reactivation. Mathematical Biosciences and Engineering, 2010, 7(4): 765-777. doi: 10.3934/mbe.2010.7.765 |
[9] | Tailei Zhang, Hui Li, Na Xie, Wenhui Fu, Kai Wang, Xiongjie Ding . Mathematical analysis and simulation of a Hepatitis B model with time delay: A case study for Xinjiang, China. Mathematical Biosciences and Engineering, 2020, 17(2): 1757-1775. doi: 10.3934/mbe.2020092 |
[10] | Ram P. Sigdel, C. Connell McCluskey . Disease dynamics for the hometown of migrant workers. Mathematical Biosciences and Engineering, 2014, 11(5): 1175-1180. doi: 10.3934/mbe.2014.11.1175 |
Much of the focus of applied dynamical systems is on asymptotic dynamics such as equilibria and periodic solutions. However, in many systems there are transient phenomena, such as temporary population collapses and the honeymoon period after the start of mass vaccination, that can last for a very long time and play an important role in ecological and epidemiological applications. In previous work we defined transient centers which are points in state space that give rise to arbitrarily long and arbitrarily slow transient dynamics. Here we present the mathematical properties of transient centers and provide further insight into these special points. We show that under certain conditions, the entire forward and backward trajectory of a transient center, as well as all its limit points must also be transient centers. We also derive conditions that can be used to verify which points are transient centers and whether those are reachable transient centers. Finally we present examples to demonstrate the utility of the theory, including applications to predatory-prey systems and disease transmission models, and show that the long transience noted in these models are generated by transient centers.
Transient phenomena occur everywhere and are a growing subject of interest in modeling, especially in ecology [1,2]. A well-known example of transience is the "honeymoon period" of a disease, a period after the onset of mass vaccination wherein the number of infected individuals temporarily stays at a low level for a very long period of time [3]. This has been observed in both model simulations and real epidemiological systems [4,5]. Honeymoon periods may end with a resurgence of the disease that may be mistakenly attributed to a change in the system (e.g., pathogen evolution) when it is actually only the natural (and possibly volatile) end of a temporary but long-lasting respite from epidemics. Another type of transient phenomena is a temporary "population collapse" that occurs in many relatively simple models of interacting populations. In particular, fisheries exhibit these types of dynamics that can also last for decades despite the implementation of strict management practices [6,7]. A careful understanding of transient dynamics can have important ramifications in the control and prediction of the behaviour of complex systems for which asymptotic dynamics have already been established, including many examples arising from eco-epidemiological applications [8,9,10].
A series of pioneering papers by Hastings et al. [11,12], Morozov et al. [13,14] and Francis et al. [15] showed that transient behaviours can be organized into a classification system, related to the presence of invariant sets (such as saddle points and other saddle-type structures) or the disappearance of one as a parameter value is changed. In [13] long transience were characterized as either (1) having fast transitions between different dynamical regimes relative to other timescales of the dynamics, or (2) evolving "very slowly" for a long time compared to other relevant timescales. Motivated by the second characterization, we proposed definitions of mathematical quantities called "transient points" and "transient centers" in [16] to give technical and model-independent definitions of dynamics for which a certain observable of the system evolves "very slowly" for a "long time". In the case of the honeymoon period of the disease, we can set the observable to be the number of infectious individuals and show that this can be made to vary as slowly as desired for as long a period of time as desired if the trajectory goes through points in phase space with small enough initial numbers of infectious individuals [16]. These points cluster about states with zero infectious individuals and these states form an invariant set near which arbitrarily long and arbitrarily slow honeymoon periods can be generated. We note that only the observable is changing slowly at this point, the other states of the system (such as the number susceptible or recovered individuals), which are usually not observed in application, are typically not changing slowly during the honeymoon period [4].
In this paper we expand upon the previous work in [16] to formally define behaviours like honeymoon periods and temporary population collapses. We also look further into the properties of transient centers and their applications to ecological models. We emphasize here that we use the term "transient centers" for brevity, but what we mean are points that give rise to nearby trajectories where a fixed observable of the system can be made to vary arbitrarily slowly for a temporary period of time that can be made arbitrarily long. The concept of other types of transient dynamics including those characterized by rapid transitions between different regimes is beyond the scope of this study.
In Section 2 we establish our notation and give a quick overview of the results from [16], including our technical definitions of transience and transient centers. In Section 3 we present some extensions of our basic theory of transient centers, and in Section 4 we further extend this theory to apply to reachable transient centers. In Section 5 we derive sufficient and easily verifiable conditions to that which equilibria and associated trajectories are comprised of transient centers. In Section 6 we present applications of the new results to simple systems and in Section 7 we present applications to models from ecology and epidemiology. In Section 8 we review some of the implications of the new results and discuss future directions.
Let f∈C(Rn,Rn). We consider the following autonomous ordinary differential equation system,
dxdt=f(x). | (2.1) |
We use the notation ϕtξ to denote the value at time t of the solution to the initial value problem (2.1) with initialization given by x(0)=ξ. We also use the notation |⋅| to denote the ℓ2 norm. Throughout this manuscript, we make the following two assumptions,
(H1) For any x(0)=ξ∈Rn, the system has a unique solution that is differentiable for all t∈R.
(H2) For any x(0)=ξ∈Rn, the system has a solution that is continuous in ξ for all t∈R.
We begin by reviewing some definitions and theorems from [16]. First, we define transient points as points whose forward trajectory exhibits long transient behaviour.
Definition 2.1 (Transient points, based on Definition 5.1 from [16]). Let ξ∈Rn, v∈C1(Rn,R) and s>0. Let Dv(x)=∇v(x)⋅f(x), this is the derivative of v along the trajectories of solutions to (2.1). We define Ts(ξ) to be,
Ts(ξ)={inf{t≥0:|Dv(ϕtξ)|>s},ifthesetisnon−empty,∞,otherwise. | (2.2) |
Let T>0. We say that ξ is a (v,s,T)−transient point if T<Ts(ξ)<∞.
Thus a (v,s,T)-transient point ξ is a point such that if a trajectory is initialized at ξ, the value of the function v along that solution changes slower than s for longer than a given time scale T. While v is changing slower than s we say that the dynamics are "slow". The requirement that Ts(ξ)<∞ guarantees that the value of v does eventually stop being slow, distinguishing this slow transient dynamics from Lyapunov stable dynamics.
The value of Ts(ξ) clearly depends on the choice of v and the right-hand-side function f in (2.1). When we need to specify v and f clearly we use the notation Tv,fs(x) instead of Ts(x) to denote the expression in (2.2). Next, we define transient centers to be points whose neighborhoods contain transient points of arbitrary slowness s over arbitrary time scales TD.
Definition 2.2 (Transient centers, Definition 6.1 from [16]). Let v∈C1(Rn,R). We call ξ a v-transient center if there exists S>0 such that for all 0<s<S and all T>0, in every neighborhood of ξ there exists a (v,s,T)-transient point.
We also include the following theorem which provides necessary conditions for a point to be a transient center.
Theorem 2.3 (Theorem 6.2 from [16]). Let v∈C1(Rn,R). If ξ is a v-transient center then Dv(ξ)=0.
Proof. If |Dv(ξ)|=c≠0 then from the continuity of Dv we can find a neighbourhood with no (v,s,T)-transient points if s=c2, which contradicts ξ being a v-transient center. A similar proof is provided in [16].
We showed in [16] (and for completeness we will also prove here in Theorem 3.1 parts (a)–(b)) that not only is Dv=0 at the transient center, but it remains zero throughout the entire forward trajectory of ξ. This led us to define Ξ, a collection of all points that satisfies the condition above in [16]. The set Ξ is a set of candidates for transient centers.
Ξ={ξ:∇v(ϕtξ)⋅f(ϕtξ)=0forallt∈R}. | (2.3) |
Clearly this set Ξ depends on the choice of v and the right-hand-side function f in (2.1). When it is important to denote what v and f are being used, we use the notation Ξv,f instead of Ξ to denote the set in (2.3). Clearly any equilibria of (2.1) is in Ξ. For any ξ∈Ξ, it is clear from the definition that along trajectories initialized near ξ the value of v can be made to vary slowly. However, for ξ to be a transient center, we require trajectories to eventually exit the slow region.
In this section we strengthen some of the results we proved in [16] and derive additional results that demonstrate the basic properties of transient centers.
We begin with some of the basic properties of transient centers in Theorem 3.1. Some of these properties have already been presented in [16], but we include those parts here for completeness. The notation Dmv(ϕtξ) refers to the mth derivative with respect to t of v along the trajectory ϕtξ. For any set B⊂Rn we use the notation ϕtB={ϕtx:x∈B}.
Theorem 3.1. Let v∈C1(Rn,R). If ξ is a v-transient center then the following statements hold,
(a) ϕtξ is a v-transient center for all t≥0.
(b) If v∈Cm(Rn,R) and f∈Cm−1(Rn,Rn) then Dkv(ϕtξ)=0 for t≥0, k=1,2,…,m and m∈Z>0.
(c) If Dv(ϕ−tξ)=0 for all t>0, ϕtξ is a v-transient center for all t∈R.
(d) ξ is an (αv+β)-transient center for all α≠0 and β∈R.
(e) If ξ is also a u-transient center and |Du|≤|Dw|≤|Dv|, where u and w∈C1(Rn,R), then ξ is also w-transient center.
(f) Let ξ∈Ξ and ζ∈Rn. Suppose that for any neighborhood B of ζ there exist τ,t∈R such that ϕτξ∈ϕtB then ζ is also a v-transient center.
Proof of (a). Let S>0 be an S that satisfies the requirements of ξ being a transient center. Let t>0 and C be an arbitrary neighborhood of ϕtξ. By (H1)–(H2) we can find a neighborhood B of ξ such that C⊂ϕtB. Let s∈(0,S) and T>0 be arbitrary. By choice of S, we can find a (v,s,T+t)-transient point x in B. This maps to ϕtx∈C which is a (v,s,T)-transient point. This proves that ϕtξ for arbitrary t≥0 is also a v-transient center with the same S. Another similar proof of this result is also presented in [16].
Proof of (b). For any fixed m∈Z>0 the base case k=1 follows from part (a). The result follows from induction.
Proof of (c). Let S>0 be an S that satisfies the requirements of ξ being a transient center. Consider g(x,τ)=|Dv(ϕ−τx)| for x∈Rn and τ>0. From v∈C1(Rn,R), f∈C(Rn,Rn), (H1)–(H2), g(x,τ) is continuous in both of its arguments. From parts (a)–(b) and the requirements of part (c), we must have g(ξ,τ)=0 for all τ. Let t>0, s∈(0,S), T>0 and B be an arbitrary neighborhood of ϕ−tξ. Let R>0 be such that BR(ϕ−tξ)⊂B. Consider the collection of values rτ for τ∈[0,t] defined as rτ=max{r∈[0,R]:|Dv(ϕ−τx)|<s∀x∈Brτ}. By the continuity of g, rτ>0 for all τ∈[0,t] and thus, by compactness, r=minτ∈[0,t]rτ>0. Since ξ is a v-transient center, there exists x∈Br(ξ) that is a (v,s,T)-transient point. This maps to ϕ−tx∈B. By choice of r, x must be a (v,s,T+t)-transient point, which means it is also a (v,s,T)-transient point. Thus ϕ−tξ must be a v-transient center for all t>0. Putting this together with part (a) completes the proof of this part of the theorem.
Proof of (d). This easily follows from Tαv+β,fs(ξ)=Tv,fs/|α|(ξ) if α≠0.
Proof of (e). Let u, v and w∈C1(Rn,R). Suppose that ξ is both a u- and v-transient center and |Du|≤|Dw|≤|Dv(x)| for all x∈Rn. Since ξ is a u and v-transient center, then there is an S>0 such that for every s∈(0,S) and T>0, in every neighborhood of ξ there is an x such that T<Tu,fs(x)<∞ and T<Tv,fs(x)<∞. It follows that, T<Tv,fs(x)≤Tw,fs(x)≤Tu,fs(x)<∞. It follows that ξ is also a w-transient center.
Proof of (f). By parts (a) and (c), since ξ∈Ξ it follows that ϕτ−tξ is a v-transient center for any choice of τ and t. Let S>0 be an S that satisfies the requirements of ϕτ−tξ being a transient center. Let B be a neighborhood of ζ and find τ,t such that ϕτξ∈ϕtB. This means ϕτ−tξ∈B and we can find an x∈B that is a (v,s,T)-transient point for any s∈(0,S) and T>0. Thus ζ is also a v−transient center.
Remark 3.2. Here are some remarks on the statements in Theorem 3.1.
(i) The condition that Dv(ϕtξ)=0 for all t<0 cannot be omitted in part (c). An example of why is presented in Example 6.4.
(ii) Theorem 6.11 from [16], which asserts that if there exists positive constants a and b such that a|Dv|≤|Du|≤b|Dv| then a v-transient center ξ is also a u-transient center, can also be proven from Theorem 3.1 parts (d) and (e).
(iii) Part (f) generalizes part (a) to include limit points. An example of when this applies is when the trajectory of ξ will enter any neighborhood of ζ infinitely often forward or backward in time. This causes the transient center property of ξ to translate to ζ as well. An application of this is given in Example 6.5.
In [16] we proved that if ξ∈Ξ we do not need to find (v,s,T)-transient points for all T>0 to show the transient center property. Instead it is enough to find points that eventually leave the slow region. For completeness we present this theorem here as Theorem 3.3 and give a stronger result in Theorem 3.4.
Theorem 3.3 (Theorem 6.8 from [16]). Let v∈C1(Rn,R) and ξ∈Ξ. If there exists S>0 such that for all 0<s<S, in any neighborhood of ξ there is an x such that Ts(x)<∞ then ξ is a v-transient center.
Proof. Let S be as given in the conditions of the theorem. Let s∈(0,S), B be a neighborhood of ξ and T>0. Recall from Theorem 3.1 parts (a)–(b) we know that Dv(ϕtξ)=0 for all t≥0. From v∈C1(Rn,R), f∈C(Rn,Rn) and (H1)–(H2), there exists r>0 small enough such that Br(ξ)⊂B and Dv(ϕTz)<s for all z∈Br(ξ). Since Br(ξ) is a neighborhood of ξ then by the requirements of the theorem there must be a point x∈Br(ξ) such that Ts(ξ)<∞. By our choice of r this must also satisfy Ts(ξ)>T. Thus x∈B is a (v,s,T)-transient point. Since it is possible to find such a point for all neighborhoods B, s∈(0,S) and T>0, ξ must be a v-transient center. A similar proof of this theorem is also presented in Theorem 6.8 in [16].
The next result changes the previous "if" statement to an "if and only if" statement with a simpler requirement that Ts(x)<∞ for just one fixed slowness value S∗>0 instead of requiring the property for all arbitrarily small slowness values s∈(0,S).
Theorem 3.4. Let v∈C1(Rn,R) and ξ∈Ξ. The point ξ is a v-transient center if and only if there exists S∗>0 such that in every neighborhood of ξ there is an x such that TS∗(x)<∞.
Proof. → Suppose that ξ is a transient center. From the definition of a transient center, there exists an S>0 such that for all given s∈(0,S) and T>0, in every neighborhood of ξ there exists a (v,s,T)-transient point x. Set S∗∈(0,s] and choose any T>0. Recall that a (v,s,T) transient point must have T<TS∗(x)<Ts(x)<∞. Thus in every neighborhood of ξ we can always find an x such that TS∗(x)<∞.
← Suppose that ξ∈Ξ and there exists S∗>0 such that in every neighborhood of ξ there is an x with TS∗(x)<∞. From the definition of Ts(x) and the continuity of Dv(ϕtx), we must have Ts(x)<TS∗(x)<∞ for all s∈(0,S∗). It follows from Theorem 3.3 that ξ is a transient center.
The following corollary is a useful restatement of Theorem 3.4 that makes it clear what conditions are equivalent to a point ξ∈Ξ not being a transient center.
Corollary 3.5. Let v∈C1(Rn,R) and ξ∈Ξ. The point ξ is not a transient center if and only if for all s>0 and there exists a neighborhood N of ξ such that Ts(x)=∞ for all x∈N.
In this section we further develop the ideas behind reachable transient points and reachable transient centers that we introduced in [16]. We begin by recalling the definitions of such points.
Definition 4.1. Consider the system (2.1) with v∈C1(Rn,R). Let T>0, s>0 and ξ be a (v,s,T)-transient point of this system. We call ξ a reachable (v,s,T)-transient point if ξ is also a (v,s,T)-transient point for the system with reversed time,
dˆxdt=−f(ˆx), | (4.1) |
We denote the solution to (4.1) with initial condition x(0)=ξ by ˆϕtξ and note that ˆϕtξ=ϕ−tξ.
Lemma 4.2 (Adapted from Theorem 8.2 from [16]). Let ξ∈Rn, τ>0 and ˆξ=ϕτξ. There exists a unique solution to (4.1) such that ˆϕ0ˆξ=ˆξ and for all t∈[0,τ] we have ϕτ−tξ=ˆϕtˆξ.
Proof. Refer to the proof of Theorem 8.2 in [16].
In this section we also use the notation ˆTs(ξ) denote the hitting time (2.2) for the time-reversed system. In terms of our earlier notation where we specify v and f, we have that ˆTs(ξ)=Tv,−fs(ξ). From Lemma 4.2 it is clear that if ξ is a reachable (v,s,TD)-transient point of (2.1), there exists ˆξ with |Dv(ˆξ)|=s such that,
ˆϕˆTs(ξ)ˆξ=ξ. | (4.2) |
Definition 4.3. Let v∈C1(Rn,R). We call ξ a reachable v-transient center if there exists S>0 such that for all 0<s<S and all T>0, in every open neighborhood of ξ there exists a reachable (v,s,T)-transient point.
Theorem 4.4. Let v∈C1(Rn,R). Suppose that a point ξ∈Rn is a reachable v-transient center of (2.1). Then,
(a) ξ is a v-transient center for both the forward equation (2.1) and the time-reversed equation (4.1).
(b) ξ∈Ξ.
Proof. Part (a) follows from the symmetry in Definitions 2.2, 4.1 and 4.3. Part (b) follows from part (a) and Theorem 3.1 part (a).
The converse of Theorem 4.4 part (a) does not hold. This is is easily illustrated by Example 4.5.
Example 4.5. Consider the system (2.1) in one-dimension with f(x)=|x|. The general solution to this system is,
ϕtx(0)={x(0)e−t,ifx(0)≤0,x(0)et,ifx(0)>0. | (4.3) |
Set v(x)=x and ξ=0. Then, Dv(ϕtx(0))=|ϕtx(0)|. Let s>0 and TD>0. We derive that,
Ts(x(0))=Tv,fs(x(0))={0,ifx(0)<−s,∞,if−s≤x(0)≤0,ln(sx(0)),if0<x(0)≤s,0,ifx(0)>s. | (4.4) |
Clearly, x(0)=ϵse−T for ϵ∈(0,1) are the only (v,s,T)-transient points of (2.1).
Similarly we can solve for the same quantity for the time-reversed system (4.1),
ˆTs(x(0))=Tv,−fs(x(0))={0,ifx(0)<−s,ln(−sx(0)),if−s≤x(0)≤0,∞,if0<x(0)≤s,0,ifx(0)>s. | (4.5) |
It follows that x(0)=−ϵse−T<0 are the only (v,s,T)-transient points of (4.1) for any ϵ∈(0,1).
The calculations above illustrate several things. First, the original (forward time) system (2.1) with f(x)=|x| has (v,s,T)-transient point in every neighborhood of ξ=0. Second, the reversed-time system (4.1) also has a (v,s,T)-transient point in every neighborhood of ξ=0. Third, the two systems have different transient points and these transient points are not reachable. Thus, ξ=0 is a v-transient center of both the forward time system and the reversed-time system, but it is not a reachable v-transient center.
We are able to obtain similar properties as in Theorem 3.1 for reachable transient centers.
Theorem 4.6. Let v∈C1(Rn,R). If ξ is a reachable v-transient center then the following statements hold,
(a) ϕtξ is a reachable v-transient center for all t∈R.
(b) If v∈Cm(Rn,R) and f∈Cm−1(Rn,Rn) then Dkv(ϕtξ)=0 for t∈R, k=1,2,…,m and m∈Z>0.
(c) ξ is a reachable v-transient center for the reversed flow.
(d) ξ is a reachable (αv+β)-transient center for all α≠0 and β∈R.
(e) If ξ is also a reachable u-transient center and |Du|≤|Dw|≤|Dv|, where u and w∈C1(Rn,R), then ξ is also a w-transient center.
(f) If ζ∈Rn is such that for any open neighborhood B of ζ there exist τ,t∈R such that ϕτξ∈ϕtB then ζ is also a v-transient center.
Proof of (a). Let S>0 be an S that satisfies the requirements of ξ being a reachable v-transient point.
Consider g(x,τ)=|Dv(ϕτx)| for x∈Rn and τ∈R. From v∈C1(Rn,R), f∈C(Rn,Rn), (H1)–(H2), g(x,τ) is continuous in both of its arguments. From Theorem 4.4 part (b) we must have g(ξ,τ)=0 for all τ. Let t∈R, s∈(0,S), T>0 and B an arbitrary neighborhood of ϕtξ. Let I=[0,t] if t≥0 and [t,0] otherwise. Let R>0 be such that BR(ϕtξ)⊂B. Consider the collection of values rτ for τ∈I defined as rτ=max{r∈[0,R]:|Dv(ϕτx)|<s∀x∈Brτ}. By the continuity of g, rτ>0 for all τ∈[0,t] and thus, by compactness, r=minτ∈Irτ>0. Since ξ is a reachable v-transient center, there exists x∈Br(ξ) that is a reachable (v,s,T+|t|)-transient point. This maps to ϕtx∈B. By choice of r we get that:
● If t≥0 then ϕtx must be a (v,s,T)-transient point for the forward system and a (v,s,T+2|t|)-transient point for the time-reversed system.
● If t<0 then ϕtx must be a (v,s,T+2|t|)-transient point for the forward system and a (v,s,T)-transient point for the time-reversed system.
In either case, ϕtx is a reachable (v,s,T)-transient point. This proves that ϕtξ is a reachable v-transient center.
Proof of (b). For any fixed m∈Z>0 the base case k=1 follows from Theorem 4.4 part (c). The result follows from induction. The other cases follow using induction.
Proof of (c). This follows from the Definition 4.3.
Proof of (d). This result follows from observing that Tαv+β,fs(ξ)=Tv,fs/|α|(ξ) and Tαv+β,−fs(ξ)=Tv,−fs/|α|(ξ) if α≠0.
Proof of (e). This is very similar to the proof of Theorem 3.1 part (e).
Proof of (f). This uses part (a) and is similar to the proof of Theorem 3.1 part (f).
Remark 4.7. The main differences between the results for reachable transience centers (Theorem 4.6) and the original results (Theorem 3.1) is that we do not require an extra condition for t<0 for part (a), and ξ∈Ξ in part (f) is already guaranteed by Theorem 4.4.
The next result shows that if ξ∈Ξ then we can simplify the conditions to prove that ξ is a reachable transient center. This follows the style of Theorem 3.4 for simple transient centers.
Theorem 4.8. Let v∈C1(Rn,R) and ξ∈Ξ. The point ξ is a reachable transient center if and only if there exists S∗>0 such that in every neighborhood of ξ there is an x such that TS∗(x)<∞ and ˆTS∗(x)<∞
Proof. This proof is similar to the proof of Theorem 3.4.
We now move on to finding conditions for equilibria to be transient centers. Recall that an equilibrium ξ of (2.1) is Lyapunov stable if for every ϵ>0 there exists δ>0 such that |ξ−x|<δ implies |ξ−ϕtx|<ϵ. Using Corollary 3.5, we easily prove that Lyapunov stable equilibria are in Ξ but they cannot be transient centers. This means stable nodes, stable spirals and centers cannot be transient centers for any choice of v.
Theorem 5.1. Let v∈C1(Rn,R).
(a) If ξ is an equilibrium of (2.1) then ξ∈Ξ.
(b) If ξ is a Lyapunov stable equilibrium of (2.1) then ξ is not a v-transient center.
Proof of (a). Let ξ be an equilibrium. Then ϕtξ=ξ, f(ξ)=0 and Dv(ϕtξ)=Dv(ξ)=∇v(ξ)⋅f(ξ)=∇v(ξ)⋅0=0 for all t∈R. This shows that ξ∈Ξ.
Proof of (b). Let ξ be a Lyapunov stable equilibrium. From part (a), ξ∈Ξ. Let s>0 and N1 be any bounded neighborhood of ξ. Since v is continuously differentiable, |∇v(x)| is continuous in x and we can find an M>0 such that |∇v(x)|<M for all x∈N1. Since f is continuous and f(ξ)=0 we can find a neighborhood N2 of ξ such that N2⊂N1 and |f(x)|<sM for all x∈N2. Finally, since ξ is Lyapunov stable, we can find a neighborhood N3 of ξ such that N3⊂N2 and ϕtx∈N2 for all x∈N3 and t≥0. Thus, for any x∈N3, |Dv(ϕtx)|=|∇v(ϕtx)⋅f(ϕtx)|≤|∇v(ϕtx)||f(ϕtx)|<(M)(sM)=s for all x∈N. Since this is true for arbitrary s>0, it follows from Corollary 3.5 that ξ is not a transient center.
Next we recall the definitions of stable/unstable subspaces, sets and manifolds associated with (2.1) following [17]. Here we use the notation ϕ−t to denote the flow in backward time, with ϕ−tˆξ=ˆϕtˆξ. We also use the notation Hv(x) to represent the Hessian matrix of v and Jf(x) to represent the Jacobian of any function f∈C1(Rn,Rn) evaluated at x∈Rn. A superscript T denotes the transpose of matrix or vector.
Definition 5.2 (Stable/unstable subspaces, sets and manifolds [17]). Let f∈C1(Rn,Rn), ξ be an equilibrium of (2.1) and A=Jf(ξ). The subspace E and F defined below are respectively the stable and unstable spaces of the linearization of (2.1) system about ξ.
E=⨁ℜ(λi)<0ker(A−λiI)ai,F=⨁−ℜ(λi)<0ker(A−λiI)ai, | (5.1) |
where λi is an eigenvalue of A with multiplicity ai, for i=1,…,n. We define the stable set WE(ξ) and unstable set WF(ξ) to be,
WE(ξ)={x:limt→∞|ϕtx−ξ|=0},WF(ξ)={x:limt→−∞|ϕtx−ξ|=0}. | (5.2) |
On a fixed neighborhood U(ξ) of ξ, we can also define the stable manifold ME(ξ) and unstable manifold MF(ξ) to be,
ME(ξ)=⋃α>0ME,α,MF(ξ)=⋃α>0MF,α. | (5.3) |
where
ME,α(ξ)={x:ϕtx∈U(ξ)∀t≥0,supt≥0eαt|ϕtx−ξ|<∞},MF,α(ξ)={x:ϕtx∈U(ξ)∀t≤0,supt≤0e−αt|ϕtx−ξ|<∞}. | (5.4) |
If ξ is a hyperbolic fixed point, Teschl [17] has shown that there is a neighborhood U of ξ such that,
G(E∩U)=WE(ξ)∩U=ME(ξ)∩U,G(F∩U)=WF(ξ)∩U=MF(ξ)∩U, |
where G is the topological conjugacy obtained from Hartman-Grobman Theorem in [17] (also presented in the appendix for convenience as Lemma 8.2).
Theorem 5.3. Let v∈C2(Rn,R) and f∈C1(Rn,Rn). Suppose that 0 is a hyperbolic equilibrium of the system (2.1) and A=Jf(0). Let E and F respectively be the stable and unstable eigenspaces associated with 0. If F={0} then 0 cannot be a v-transient center. If E={0} then 0 cannot be a reachable v-transient center.
Proof. This follows from Theorems 5.1 and 4.4.
Now we consider unstable equilibria and when they are transient centers. We first look at unstable equilibria of linear systems.
Theorem 5.4. Let f(x)=A(x−ξ) for some A∈Rn×n with at least one positive and real eigenvalue. Let F be the unstable eigenspace of ξ. If {ξ+z:z∈F}⊄{x:∇v(x)⋅f(x)=0} then ξ is a v-transient center.
Proof. For ease of notation we first set ξ=0. Let A and F be as required in the theorem. We can find a point x∗∈F such that x∗∉{x:∇v(x)⋅f(x)=0}. Then S∗=|∇v(x∗)⋅f(x∗)|>0. Since f(ξ)=0 clearly x∗≠0.
By considering the system in reversed time initialized at x∗ we derive the unique trajectory e−Atx∗ and we know that e−Atx∗∈F for all t (Lemma 8.1). Since this subspace is spanned by the generalized eigenvectors of A corresponding to eigenvalues with positive real parts, we must have e−Atx∗→0 as t→∞. For every neighborhood of 0 there exists a t>0 such that the point e−Atx∗ is in that neighborhood. Going back to the linear flow in forward time, this point has the property that TS∗(e−Atx∗)=t<∞. By Theorem 3.4, 0 is a v-transient center.
The proof of the case for general ξ∈Rn follows easily using the translation y=x−ξ and the system dydt=Ay.
This last theorem on unstable equilibria of linear systems being transient centers can be extended to hyperbolic unstable equilibria of nonlinear systems. Later we present and prove a more general theorem that works for nonlinear systems as Theorem 5.7 which includes a condition to show reachability. The ideas behind this theorem is very similar to the last proof but with more technical detail to deal with the nonlinearity. This requires Lemmas 5.5 and 5.6.
Lemma 5.5. Let v∈C2(Rn,R) and f∈C1(Rn,Rn). The gradient of Dv(ϕtξ) is given by,
∇Dv|ϕtξ=(HTvf+(Jf)T∇v)|ϕtξ. | (5.5) |
Proof. This follows from using the product rule on ∇Dv|ϕtξ=∇(∇v⋅f)|ϕtξ.
Next we present a λ-lemma which we will use to prove our main results in this section. We recall first that λ-lemmas are used to describe the trajectories near a hyperbolic manifold. Here we only present a version for hyperbolic fixed points. For the case of general hyperbolic manifolds refer to [18] and [19].
Lemma 5.6 (λ-lemma). Let f∈C1(Rn,Rn). Suppose that 0 is a hyperbolic equilibrium of the system (2.1) with associated nontrivial stable manifold ME and nontrivial unstable manifold MF. Let x∗∗∈ME and x∗∈MF. For any neighbourhoods B0 of 0, B1 of x∗ and B2 of x∗∗, there exists x∈B0 and t1,t2>0 such that ϕt1x∈B1 and ϕ−t2x∈B2.
Proof. Since 0 is a hyperbolic fixed point then Rn=E⊕F. If E or F is {0} the proof is trivial. So we consider the case when both are non-trivial. By the Hartman-Grobman Theorem (as stated in the appendix as Lemma 8.2) there exists a homeomorphism G with open domain U containing 0 such that for all x∈U,
G∘ϕtx=eAtG(x). | (5.6) |
Let r>0 be sufficiently small so that there exists R>0 such that,
G(Br(0))⊂BR2(0)⊂¯BR(0)⊂G(U). | (5.7) |
Since x∗∗∈ME and x∗∈MF we must have ϕtx∗∗→0 and ϕ−tx∗→0 as t→∞. For simplicity, we first assume that ϕtx∗∗∈Br(0) and ϕ−tx∗∈Br(0) for all t>0. In this case we can just work with intersections of the given neighborhoods with Br(0) instead of the full neighborhoods. So without loss of generality we assume that B1,B2,B⊂Br(0).
Since both E and F have nonzero dimension, for any x we can write G(x)=GF(x)⊕GE(x) where GF(x)∈F and GE(x)∈E. Since x∗∈MF, we know that G(x∗)∈F so GF(x∗)=G(x∗).
We also set,
λ+<=minℜ(λi)>0{ℜ(λi)},λ−>=maxℜ(λi)<0{ℜ(λi)}. |
Since 0 is a hyperbolic fixed point neither of these are zero and we can find positive constants b+<, b−>, and |ℓ|<min{|λ+<|,|λ−<|} such that,
‖e−At‖≤b+<e(ℓ−λ+<)t,‖eAt‖≤b−>e(ℓ+λ−>)t | (5.8) |
where ‖⋅‖ is the ℓ2 matrix norm.
For any t1,t2>0, we can define
x=G−1(G(ϕ−t1x∗)+G(ϕt2x∗∗)). | (5.9) |
Then for all t∈[−t2,t1],
|G∘ϕtx|=|eAtG(x)|,=|eAt[G(ϕ−t1x∗)+G(ϕt2x∗∗)]|,≤|eAtG(ϕ−t1x∗)|+|eAtG(ϕt2x∗∗)|,=|G(ϕt−t1x∗)|+|G(ϕt+t2x∗∗)|,≤R. |
Thus for all t∈[−t2,t1], ϕtx∈U. Since G∘ϕ−t2x=e−At2G(x)=e−A(t1+t2)G(x∗)+G(x∗∗) then,
|G∘ϕ−t2x−G(x∗∗)|≤|e−A(t1+t2)GF(x∗)|,≤b+<e(ℓ−λ+<)(t1+t2)|GF(x∗)|,≤b+<e(ℓ−λ+<)(t1+t2)R2. | (5.10) |
Furthermore, since G∘ϕt1x=eAt1G(x)=eAt1(e−At1G(x∗)+G(ϕt2x∗∗))=G(x∗)+eA(t1+t2)G(x∗∗) then,
|G∘ϕt1x−G(x∗)|≤|eA(t1+t2)GE(x∗∗)|,≤b−>e(ℓ+λ−>)(t1+t2)|GE(x∗∗)|.≤b−>e(ℓ+λ−>)(t1+t2)R2. | (5.11) |
Finally,
|G(x)|≤|G(ϕ−t1x∗)|+|G(ϕt2x∗∗)|,=|e−At1GF(x∗)|+|eAt2GE(x∗∗)|,≤b+<e(ℓ−λ+<)t1|GF(x∗)|+b−>e(ℓ+λ−>)t2|GE(x∗∗)|,≤b+<e(ℓ−λ+<)t1R2+b−>e(ℓ+λ−>)t2R2. | (5.12) |
Since ℓ−λ+< and ℓ+λ−> are both negative numbers, we can take t1,t2 large enough so that G∘ϕ−t2x∈G(B1), G(x)∈G(B0) and G∘ϕt1x∈G(B2). Since G is a homeomorphism, these choices of t1, t2 and the choice of x as defined in (5.9) have the required properties of the theorem.
We now consider the general case (no longer assuming that ϕtx∗∗∈Br(0) and ϕ−tx∗∈Br(0) for all t>0). Since x∗∗∈ME and x∗∈MF, there exists s>0 such that ϕtx∗∗∈Br(0) and ϕ−tx∗∈Br(0) for all t>s. Let ˆx∗∗=ϕsx∗∗ and ˆx∗=ϕ−sx∗∗. The results above can be applied to ˆx∗∗ and ˆx∗. This means for every neighborhood B of 0, ˆB2 of ˆx∗∗ and ˆB1 of ˆx∗ we can derive ˆt1,ˆt2>0 and x such that x∈B, ϕˆt1x∈ˆB1 and ϕ−ˆt2x∈ˆB2. Now, for any neighborhood B2 of x∗∗ and B1 of x∗, we can set ˆB2=ϕsB2={ϕsy:y∈B2} and ˆB1=ϕsB1={ϕsy:y∈B1} which are neighborhoods of ˆx∗∗ and ˆx∗ respectively. Applying the previous result and setting t1=s+ˆt1, t2=s+ˆt2 completes the proof of the theorem.
We are now in the position to provide conditions that guarantee that a hyperbolic equilibrium point is a reachable transient center. To prove this we use the λ-lemma and the idea that one way to reach the slow region is by staying close to the stable manifold.
Theorem 5.7. Let v∈C2(Rn,R) and f∈C1(Rn,Rn). Suppose that 0 is a hyperbolic equilibrium of the system (2.1) and A=Jf(0). Let E and F respectively be the stable and unstable eigenspaces associated with 0. If AT∇v(0)⊥̸F then 0 is a v-transient center. If in addition, AT∇v(0)⊥̸E, then 0 is a reachable v-transient center.
Proof. Since 0 is a hyperbolic fixed point then none of the eigenvalues of A have zero real part. If we were only interested in transient centers without reachability, a proof of the first part of this theorem can be similar to the proof of Theorem 5.4 where we selected a point in the unstable subspace and mapped it backwards in time closer to the unstable equilibrium. In the nonlinear case we would instead choose a point in the unstable manifold and map it backwards in time towards the equilibrium.
We now give a proof of the theorem for the case when both E and F have nonzero dimension and the second part of the theorem, which is concerned with reachability, can apply. To prove both parts, we cannot just select a point on the unstable manifold since those are not reachable transient points. Instead we select a point that that has components in both the unstable and stable manifold.
For any v∈C2(Rn,R), it is clear that 0∈Ξ. Since 0 is an equilibrium we must have ϕt0=0 for all t. In addition, f(ϕt0)=f(0)=0 and Jf(ϕt0)=A. Using Lemma 5.5 at ξ=0 yields ∇(Dv)(0)=AT∇v(0). Suppose AT∇v(0)⊥̸F, since the level curve Dv=0 is perpendicular to ∇(Dv) at x=0, we thus know that F is transversal to the level curve Dv=0. By Theorem 9.6 in [17] we know there exists an invariant unstable manifold MF that is tangent to F at x=0. We apply the same argument on unstable manifold. Therefore for all small enough r>0, there exists a k>0 and x∗∈MF∩Br(0), x∗∗∈ME∩Br(0) such that |Dv(x∗)|>k and |Dv(x∗∗)|>k. An example illustration is provided in Figure 1.
By Lemma 5.6 we know that for any neighborhood Br2(x∗∗), Br0(0) and Br1(x∗), there exists x∈Br0(0) and t1,t2 such that ϕt1x∈Br1(x∗) and ϕ−t2x∈Br2(x∗∗). By taking r1 and r2 to be small enough we know that |D(ϕt1x)|>k2 and |D(ϕt2x)|>k2. Thus Tk2(x)<∞ and ˆTk2(x)<∞. Since r0 is arbitrary, it follows from Theorem 4.8 that 0 is a reachable v−transient center.
Recall that in Theorem 3.1 the transient center property can translate along the trajectory as well as to an ω-limit point. The next theorem shows that such property can also translate from the limit point back to any trajectories that converges to it if η is hyperbolic fixed point and the trajectory is in Ξ.
Theorem 5.8. Let v∈C2(Rn,R) and f∈C1(Rn,Rn). Suppose that 0 is a hyperbolic equilibrium of the system (2.1) and A=Jf(0). Let ξ∈Ξ. If AT∇v(0)⊥̸F and ϕtξ→0 then 0 and ξ are both v-transient centers.
Proof. From the first part of Theorem 5.7, 0 is a v-transient center. The proof that ξ is also a v-transient center is similar to the proof of Theorem 5.7. Again let U and G be as given by the Hartman-Grobman theorem so that U is neighborhood of 0 such that for all x∈U, whenever ϕtx∈U, we have (5.6) holds. We also again choose r>0 and R>0 sufficiently small so that (5.7) holds.
Let x∗∈MF be chosen so that for some k>0, |Dv(x∗)|>k (it is possible to do this following the same reasoning as in Theorem 5.7). Since ϕtξ→0, we know that ξ∈ME. By Lemma 5.6, for any r1>0, r2>0, and r>0, there exists x∈Br(0), t1>0 and t2>0 such that ϕ−t2x∈Br2(ξ) and ϕt1x∈Br1(x∗). Thus by taking r1 small enough we can get ϕt1x close enough to x∗ so that |D(ϕt1x)|>k2. Thus Tk2(x∗∗)≤t1+t2<∞. Since ξ∈Ξ and r2 is arbitrary, by Theorem 3.4 it follows that ξ is a v−transient center.
In this section we present some simple applications to help demonstrate the results of the previous three sections. For brevity we use the notation (a,b) for a vector, we mean a column vector [a,b]T. Additionally, instead of using x to denote a vector representing the state of the system as in previous sections, here we use x to denote the first component of two-dimensional states denoted by (x,y).
Example 6.1 (Application of Theorems 3.4, 4.8, 5.4, and 5.7). Consider the following system,
ddt[xy]=[101−1]⏟A[xy]. | (6.1) |
Let v(x,y)=x. We note that this simple, linear system was discussed in Example 6.13 from [16] where we proved that the origin is a v-transient center using a preliminary theorem on transient centers. Here we prove the same result using Theorem 5.4, and furthermore show that the origin is not reachable. In addition, we show that the origin is a reachable v-transient center if we change the choice of observable to v(x,y)=y.
We first set v(x,y)=x. The eigenvalues of A are 1 and -1. This implies that (0,0) is an unstable hyperbolic equilibrium (specifically a saddle point). The eigenvector of A corresponding to the positive eigenvalue λ=1 is (2,1) so F=span((2,1)). We also easily compute that {(x,y):∇v(x,y)⋅f(x,y)=0}={(0,y):y∈R}=span((0,1)). Clearly, F=span((2,1))⊄span((0,1))={(x,y):∇v(x,y)⋅f(x,y)=0}. By Theorem 5.4, the origin is a v-transient center.
To show that the origin is not a reachable v-transient center, we use the explicit solution of the linear system. Starting from an initial point (x(0),y(0)), the solution for all t∈R is given by,
(x(t),y(t))=(x(0)et,y(0)e−t+x(0)(et−e−t)2). | (6.2) |
Thus, v(t)=x(0)et. Let S∗>0. For any open neighborhood of the origin we can find (x(0),y(0)) such that TS∗(x(0),y(0))<∞. From the exact solution, such a point needs to satisfy 0<|x(0)|<S∗ and TS∗(x(0),y(0))=ln(S∗x(0)). However, any such point will also have |v(t)|=|x(0)|et<S∗ for all t<0 implying that ˆTS∗(x(0),y(0))=∞. By Theorem 3.4 the origin is a v-transient center and by Theorem 4.8 it is not a reachable v-transient center.
Now consider the case v(x,y)=y. We can use Theorem 5.7 to prove that ξ is a reachable v-transient center. We already found F=span((2,1)). It is also easy to show that E=span((0,1)). Since
AT∇v(ξ)=[110−1][01]=[1−1]. | (6.3) |
Clearly, AT∇v(ξ)⊥̸E and AT∇v(ξ)⊥̸F. Thus ξ is a reachable v-transient center. This result is maintained even if we add small nonlinear perturbations to the system.
Example 6.2 (Application of Theorems 3.1 and 5.1). Consider the system given by,
ddt[xy]=[−xyx−1]. | (6.4) |
Let v(x,y)=x. This is Example 5.5 from [16] where we gave a direct proof that ξ=(0,0) is a v-transient center. Here we further show that all points in the candidate set Ξ (given by (2.3)) are v-transient centers except for the equilibrium at (1,0).
From part (a) in Theorem 3.1, all points on the trajectory of ξ=(0,0) in forward and backward time, which is the invariant set L={(0,y):y∈R}, must be v-transient centers. Consider the set Ξ corresponding to this system,
Ξ={(x0,y0):(1,0)⋅(−x(t)y(t),x(t)−1)=0}={(x0,y0):x(t)y(t)=0}=L∪{(1,0)}. | (6.5) |
The point (1,0) is the only point in Ξ that is not in the invariant set L. This point is the only equilibrium of the system. The Jacobian evaluated at (1,0) has eigenvalues ±i, which means that (1,0) is a non-hyperbolic equilibrium. Observe that while trajectories have xy≠0 we can write dydx=x−1−xy, which allows us to find a constant C along such portions of the trajectory by solving this separable differential equation and deriving
x−ln(|x|)+y22=C. | (6.6) |
We can now show that (1,0) is a Lyapunov stable equilibrium by considering the function,
V(x,y)=x−1−ln(|x|)+y22. | (6.7) |
Outside the invariant set L, this function is positive except it is zero at the point (1,0). Furthermore, along trajectories of (6.4) outside of L we have dVdt=(1x−1,−y)⋅(−xy,x−1)=0. Thus V is a Lyapunov function that shows the point (1,0) is Lyapunov stable. From Theorem 5.1, (1,0) cannot be a v-transient center for any choice of v. This gives an example of a point in Ξ that is not a v-transient center.
Example 6.3 (Application of Theorems 5.3 and 5.7). Consider the system given by,
ddt[xy]=[y2y−4x−x3]. | (6.8) |
The origin is clearly an equilibrium of this system. The Jacobian of the right-hand-side function evaluated at (0,0) is given by,
A=[01−42], | (6.9) |
The eigenvalues of the Jacobian evaluated at the origin are 1±√3i. Thus the point (0,0) is an unstable hyperbolic fixed point of this system. The set F from (5.1) is all of R2. Thus Theorem 5.7 guarantees that any choice of v such that AT∇v(0,0)≠(0,0), the origin would be a v-transient center. Since det(A)≠0 then any choice of v such that ∇v(0,0)≠(0,0), such as v=x or v=y, is guaranteed to make the origin a v-transient center. We note that it is evident from the phase-plane diagram of this system that there are also non-constant choices of v such that ∇v(0,0)≠(0,0) that would make the origin a v-transient center.
We also note that for this system, E={(0,0)}. So for any choice of v, we have AT∇v(0,0)⊥E. and we cannot apply the second part of Theorem 5.7. From Theorem 5.3, the origin cannot be a reachable v-transient center for any choice of v.
Example 6.4 (Illustrating the necessity of the condition in Theorem 3.1 part (c)). Consider the following system,
ddt[xy]=[10], | (6.10) |
v={13x3−xy,ifx<0,y−yex,ifx≥0. | (6.11) |
Here we show that the condition that Dv(ϕtξ)=0 for all t<0 in Theorem 3.1 part (c) cannot be omitted.
We note that v∈C1(R2,R2) and,
Dv={x2−y,ifx<0,−yex,ifx≥0. | (6.12) |
Since x(t)=x(0)+t and y(t)=y(0) for all t∈R we find that Ξ={(x,0):x≥0}. Let ϵ∈(0,1). Starting from initial point (x,ϵ) with x≥0 the forward trajectory is (x+t,ϵ), and |Dv(x+t,ϵ)|=ϵex+t for t≥0. Thus every point (x,0) for x≥0 is clearly a v−transient center. In particular that means ϕt(0,0)=(t,0) for all t≥0. However if we consider t<0, ϕt(0,0)=(t,0) is not a reachable transient center since Dv(t,0)=t2≠0 for t<0.
Example 6.5 (Application of Theorems 3.1 part (f), 3.4 and 5.1). Consider the following system,
ddt[xy]=[−xy(x−1)(y+1)]. | (6.13) |
Let v=x. We restrict our focus to the dynamics on the invariant subset {(x,y):x≥0,y≥−1}.
We first prove that (0,0) is a v-transient center and use this to prove that (0,−1) is also a v-transient center. Both are equilibria of (6.13) and are therefore, by Theorem 5.1, in Ξ. We note that (0,0) is a non-hyperbolic equilibrium of (6.13) so our results for hyperbolic equilibria do not apply.
Observe that in the set K:={(x,y):0≤x≤1,−1≤y≤0} we have x′≥0 and y′≤0. So for any small ϵ∈(0,1) such that the initial point (x(0),y(0))=(ϵ,−ϵ)∈K, the resulting forward trajectory cannot exit K via the boundaries x=0 and y=0. In addition, y′=(x−1)(y+1)≥(ϵ−1)(y+1) then by Gronwall's inequality, y(t)≥−1+(1−ϵ)e(ϵ−1)t which implies that the trajectory also cannot exit K via the boundary y=−1. Finally, y(t)≤y(0)=−ϵ for t≥0 so x′=−xy≥ϵx, by Gronwall inequality we have x(t)≥eϵtx(0)=ϵeϵt. Thus this trajectory will exit K in finite time t∗>0 via the boundary x=1. Let (x(t∗),y(t∗))=(1,y∗) be the point of exit where we must have y∗∈(−1,−ϵ).
Let L(x,y)=(x−ln(x))+(y+1−ln(y+1)). Then,
DL(x,y)=(1−1x)(−xy)+(1−1y+1)(x−1)(y+1)=0, |
which means L is invariant along trajectories of (6.13). Thus for small ϵ,
y∗+2−ln(y∗+1)=L(1,y∗)=L(ϵ,−ϵ)=ϵ−ln(ϵ)+ϵ+1−ln(ϵ+1)∼−ln(ϵ). |
This shows us that as ϵ→0+ we take y∗→−1. Thus we can choose ϵ small enough so that,
|Dv(1,y∗)|=|−y∗|≥12. |
Since this is true for all small enough ϵ>0 we conclude that (0,0) is a v−transient center by using S=12 in Theorem 3.4. Now since {(0,−1)} is the omega limit set of (0,0), by Theorem 3.1 part (f), we can also conclude that (0,−1) is a v-transient center.
Example 7.1. Consider the specific form of the predator-prey model studied in Hastings et al. [11],
dxdt=ax(1−xK)−rxyx+h,dydt=ϵ(brxyx+h−my). | (7.1) |
Here x denotes the prey population and y denotes the predator population. We consider the dynamics in the invariant and biologically relevant set {(x,y):x≥0,y≥0}. All model parameters are assumed to be positive. The parameter a is the prey reproduction rate, K is the prey carrying capacity and m is the predator death rate. The interaction between predator and prey is described using a type Ⅱ functional response with parameters r and b. The parameter ϵ is called the slow-fast parameter since taking small values of ϵ allows the timescale of the dynamics of y to be longer than x.
We previously showed numerically in [16] that if solutions are initialized close to the invariant set {x=0}, we get trajectories that remain close to that set for a long time at the beginning of the trajectory (but when they move away they do not necessarily return to be close to x=0 later on as happens in the case when ϵ is small). This showed that the parameter ϵ is not the direct cause of long transience about x=0, rather that is caused by the invariant set itself. However, taking small values of ϵ does tend to push trajectories of this system close to the invariant set which makes the transient behaviour more "reachable".
Let v=x. We now formally prove that points in the invariant set {(0,y):y≥0} are v-transient centers and therefore cause arbitrarily slow dynamics for arbitrarily long periods of times (for any fixed ϵ>0). We begin by noting that the origin is an equilibrium for the system with the Jacobian matrix evaluated at the origin given by,
A=Jf(0,0)=[a00−ϵm]. | (7.2) |
Thus E=span{(0,1)} and F=span{(1,0)}. Since (0,0) is a hyperbolic equilibrium, it follows from [17] that dim(ME)=dim(MF)=1. Moreover, it is easy to check that WE={(0,y):y∈R} is part of the stable set as defined in Definition 5.2.
The set {Dv=0}={(0,y):y≥0}∪{(x,y):x≠0,y=a(x+h)r(1−xK)}. Since {(0,y):y≥0} is invariant, we have {(0,y):y≥0}⊆Ξ. Finally, since ∇v=(1,0) we compute that,
AT∇v((0,0))=(a,0). |
From (a,0)⋅(1,0)=a≠0 we know AT∇v(0,0)⊥̸F. By Theorem 5.8, (0,0) is a v-transient center and so are all points on {(0,y):y≥0}. This shows that this invariant set causes arbitrarily slow changes in the prey for arbitrarily long periods.
Since AT∇v(0,0)⊥E we cannot use Theorem 5.7 to show that (0,0) is a reachable transient center and have this reachability property translate to all of {(0,y):y≥0}. However, we can prove this for the case when m<br. The proof is provided below and divided into three parts.
Theorem 7.2. Consider the system given in Example 7.1 with v=x. The set {(0,y):y≥0} is comprised of reachable v-transient centers.
Let m<br. It follows that 0<br<2br−m and,
0<hm2br−m<hmbr<h. | (7.3) |
Set δ=min{hm2br−m,K2}, ˆη=(δ+h)(a+1)r, η=ah(1−δK)2r and S∗=12min{δ(1+aδK),aδ(1−δK)(1−h2(δ+h))}. We also use the notation R2+={(x,y):x>0,y>0} and define the following sets in R2+:
L={(x,y):0<x<δ,0<y<η},ˆL={(x,y):0<x<δ,y>ˆη} | (7.4) |
Part 1. Proof that TS∗(x0,y0)<∞ for all (x0,y0)∈L. We already know that the trajectories of this system initiated in L cannot exit L via the x=0 boundary or the y=0 boundary. We will show that an initial point (x0,y0)∈L with trajectory ϕt(x0,y0)=(x(t),y(t)) exits L for the first time via the x=δ boundary in finite time. Since xx+h is monotone increasing for x>0 it follows from (7.3) and m<br that on L,
m−brxx+h>m−brhmbrhmbr+h=m−mmbr+1>m2. | (7.5) |
It follows from this and (7.1) that dydt=−ϵy(m−brxx+h)<−ϵym2 while the trajectory is in L. By Gronwall's inequality, y(t)≤y0e−ϵm2t∈(0,η) and thus the trajectory cannot exit L via the y=η boundary either.
Next, we note that in L we have,
dxdt=ax(1−xK)−rxyx+h>(a(1−δK)−ryh)x>(a(1−δK)−rηh)x=a(1−δK)x2≥ax4. | (7.6) |
Thus by Gronwall's inequality we must have x(t)≥x0ea4t for all t≥0 such that the trajectory remains in L. Thus, ϕt(x0,y0)=(x(t),y(t)) will exit L via the x=δ boundary in finite time. We denote this time by t∗ and so we can write x(t∗)=δ and y(t∗)∈(0,η). Clearly t∗<4aln(δx0).
Finally, since y(t∗)<η=ah(1−δK)2r, we obtain,
|Dv(ϕt∗(x0,y0)|=|dx(t∗)dt|=aδ(1−δK)−rδy(t∗)δ+h≥aδ(1−δK)(1−h2(δ+h))>S∗. | (7.7) |
This proves that TS∗(x0,y0)<t∗<∞. This holds for all (x0,y0)∈L.
Part 2. Proof that ˆTS(ˆx0,ˆy0)<∞ for all (ˆx0,ˆy0)∈ˆL. Consider the time-reversed system (4.1) where f is given in (7.1). We already know that the trajectories of this system initiated in ˆL cannot exit ˆL via the ˆx=0 boundary. We will show that an initial point of the form (ˆx0,ˆy0)∈ˆL with trajectory ˆϕt(ˆx0,ˆy0)=(ˆx(t),ˆy(t)) exits ˆL for the first time via the ˆx=δ boundary in finite time. In this case since the bounds on ˆx are the same as the bounds on x in Part 1, we can apply (7.5) and derive that for as long as the trajectory remains in ˆL we have dˆydt=ϵˆy(m−brˆxˆx+h)>ϵˆym2. By Gronwall's inequality it follows that ˆy(t)≥ˆy0eϵm2t>ˆη and the trajectory cannot leave ˆL via the ˆy=ˆη boundary.
Next we note that in ˆL we have,
dˆxdt=rˆxˆyˆx+h−aˆx(1−ˆxK)>(rˆyδ+h−a)ˆx>(rˆηδ+h−a)ˆx=ˆx. | (7.8) |
By Gronwall's inequality, we must have ˆx(t)≥ˆx0et. Thus, for (ˆx0,ˆy0)∈ˆL the trajectory ˆϕt(ˆx0,ˆy0) will exit ˆL via the ˆx=δ boundary in finite time. We denote this time by ˆt∗ so we can write ˆx(ˆt∗)=δ and ˆy(t∗)>ˆη. Clearly ˆt∗<ln(δˆx0).
Finally, since ˆy(t∗)>ˆη=(δ+h)(a+1)r then,
|Dˆv(ˆϕˆt∗(ˆx0,ˆη))|=|dˆx(ˆt∗)dt|=rδˆy(ˆt∗)δ+h−a(1−δK)δ>δ(1+aδK)>S∗. | (7.9) |
This implies that ˆTS∗(ˆx0,ˆy0)<t∗<∞. This argument holds for all (ˆx0,ˆy0)∈ˆL.
Part 3. Proof that {(y,0):y≥0} is comprised of reachable transient centers. We first note that η=ah(1−δK)2r≤ah2r<(δ+h)(a+1)r=ˆη. Also, on the invariant set x=0, we have dydt=−ϵy(m−brxx+h)=−ϵmy so ϕt(0,y0)=(0,y0e−ϵmt). Thus, ϕτξ=ϕτ(0,2ˆη)=(0,η2) if τ=1ϵmln(4ˆηη).
Let ξ=(0,2ˆη) which is clearly in Ξ. By continuity of the solutions with respect to initial conditions, we can guarantee that after time τ our solution is close to (0,η2) by initializing close to (0,2η) at time 0. In particular, we can find an r>0 small enough such that for ˆx0∈(0,r) it follows that ϕτ(ˆx0,2η) is close enough to (0,η2) to guarantee that it is in L. Thus by our choice of r, ˆx0∈(0,r) implies (ˆx0,2η)∈ˆL and ϕτ(ˆx0,2η)∈L. From parts 1–2, it follows that TS∗(ˆx0,2η)≤τ+TS∗(ϕτ(ˆx0,2η))<∞ and ˆTS∗(ˆx0,2η)<∞. Since we can find such a point (ˆx0,2η) in every neighborhood of (0,2η), it follows from Theorem 4.8 that (0,2η) is a reachable v-transient center. Since the forward and reversed time trajectory of (0,2η) is {(0,y):y>0} and the forward trajectory has limit point (0,0), it follows from Theorem 4.6 that the entire set {(0,y):y≥0} is comprised of reachable v-transient centers.
Example 7.3. Consider susceptible-infectious-recovered (SIR) model with vaccination. This has been studied in detail in the literature including [20] and their transient dynamics have been explored in [4,16].
dSdt=(1−p)μ−βSI−μS,dIdt=βSI−γI−μI. |
We assume all parameters are positive and that the basic reproduction number Rp=β(1−p)γ+μ>1 with R0=βγ+μ being the basic reproduction number in the absence of vaccination. The dynamics are invariant and biologically feasible in the region where S≥0 and I≥0. This has a unique disease free equilibrium (DFE) at (1−p,0) and an endemic equilibrium at (1R0,μβ(Rp−1)). For this system we set v=I, the fraction of infectious individuals.
The Jacobian at the DFE is
A=Jf|DFE=[−μ−β(1−p)0β(1−p)−γ−μ] |
The eigenvalues λ1 and λ2 with corresponding eigenvectors v1 and v2 are given by,
λ1=−μ,λ2=β(1−p)−γ−μ,v1=(1,0),v2=(β(1−p),γ−β(1−p)). |
Since Rp>1, we have that E=span(v1), F=span(v2). For v=I, we have Dv=dIdt=(βS−γ−μ)I. Thus, {(S,I):S≥0,I=0}⊂Ξ and
AT∇v=(0,β(1−p)−γ−μ). |
Clearly AT∇v⊥̸F since Rp>1. It is also easy to show that DFE attracts every point from {(S,I):S≥0,I=0}=Ξ. Thus, by Theorem 5.8 the set {(S,I):S≥0,I=0} is comprised of v-transient centers.
Since AT∇v⊥E, we cannot use Theorem 5.7 to show reachability of (0,0) and thus we cannot easily say anything about the reachability of {(S,I):S≥0,I=0}. In this case it may be possible to make arguments similar to what we did in Example 7.1 to show that under certain parameter conditions the set is comprised of reachable transient centers. In particular, if initial conditions are given by the prevaccine era endemic equilibrium (obtained by setting p=0 in the expression for the endemic equilibrium), under certain parameter regimes we obtain numerical trajectories that get very close to {(S,I):S≥0,I=0} leading to prolonged honeymoon periods after the initiation of mass vaccination. Deriving these parameter regimes is a subject of future study but some numerical results are availabline in [4].
Example 7.4. We consider another type of SIR model. This extension of the SIR model allows for a changing total population size and is from Shan et al. [21].
dSdt=A−δS−βSIS+I+R,dIdt=−(δ+ν)I−μ(b,I)I+βSIS+I+R,dRdt=μ(b,I)I−δR. |
We assume all parameters are positive and limI→0μ(b,I)=μ1 for all b>0. The dynamics are invariant and biologically feasible in the region where S≥0, I≥0 and R≥0. There is a unique disease free equilibrium (DFE) at ξ=(Aδ,0,0). We further assume that the basic reproduction number of this system R0=βd+ν+μ1>1. We again set the observable v=I for this system.
The Jacobian at the DFE is,
A=Jf|DFE=[−δ−β00−(δ+ν)−μ1+β00μ1−δ] |
The eigenvalues and corresponding eigenvectors are given by,
λ1=λ2=−δ,λ3=β−δ−ν−μ1v1=(1,0,0),v2=(0,0,1),v3=(−βμ1,β−ν−μ1μ1,1). | (7.10) |
Since R0>1, we have that E=span(v1,v3) and F=span(v3). Thus, {(S,I,R):S≥0,I=0,R≥0}⊂Ξ,
AT∇v=(−β,β−δ−ν−μ1,μ1), |
and AT∇v⊥̸F. It is also easy to show that DFE attracts every point from {(S,I,R):S≥0,I=0,R≥0}=Ξ. Thus by Theorem 5.8 this set is comprised of v-transient centers.
We presented a more comprehensive quantification of long transient dynamics that was initially developed in [16]. Here we focus on transient centers, points in state space that give rise to long transience in its vicinity. One interesting aspect of transient centers is that it can easily translate from point to point. If an initial point is a transient center, this property translates along the trajectory forward in time. This property can also translate in reversed time along the trajectory if the initial point is in Ξ, a set of candidates for reachable transient centers. Moreover, if the trajectory converges to a limit point and it belongs to Ξ, the limit point will also be transient center.
We also further developed the concept of reachability, an important property that makes the transient dynamics generated by transient centers attainable from other points in state space. Many of our results for basic transient centers were extended to reachable transient centers. In addition, a new result (Theorem 5.7) provides easily verifiable conditions to show when a hyperbolic fixed point with some generic assumptions is a reachable transient center. We also presented applications of our results to both simple systems and more complex models from eco-epidemiology.
There are many directions we are interested in continuing on in our study of long transience and transient centers. In particular, we are interested in non-equilibrium transient centers. One possible direction is an extension of Theorem 5.7 to foliations with hyperbolic leaves. Another is an application to slow-fast systems with multiple turning points on the slow manifold.
The research of AL and FMGM is part of the Mathematics for Public Health program funded by the Natural Sciences and Engineering Research Council of Canada. Part of this work was conducted while FMGM was visiting the Mathematical Institute at the University of Oxford.
The authors declare there is no conflict of interest.
In this Appendix we continue to assume that (H1) and (H2) hold. Recall that the solution to (2.1) with initial condition x(0)=ξ evaluated at time t is denoted by ϕtξ. The solution to (4.1) with initial condition ˆx(0)=ξ evaluated at time t is denoted by ˆϕtˆξ.
Lemma 8.1. Let f(x)=Ax where A∈Rn×n.
(i) The solution to (2.1) with x(0)=ζ∈Rn is ϕtζ=eAtζ for t∈R.
(ii) The solution to (4.1) with x(0)=ζ∈Rn is given by ˆϕtζ=e−Atζ for t∈R.
(iii) Suppose that A has at least one real and positive eigenvalue. Let F be the unstable eigenspace associated with A. If ζ∈F then ˆϕtζ∈F for all t∈R and |ˆϕtζ|→0 as t→∞.
Proof. This follows from the basic theory of linear ordinary differential equations with constant coefficients [17,22].
Lemma 8.2 (Hartman-Grobman Theorem from [23]). Suppose that ξ is a hyperbolic fixed point of the system (2.1). Let ψtζ be the solution at time t of the linearized system,
dydt=Jf(ξ)(y−ξ),y(0)=ζ. | (8.1) |
Then there exists an open set U containing ξ and homeomorphism G with domain U such that,
G∘ϕtx=ψtG(x). | (8.2) |
whenever x is in U and both sides of the equation are defined.
Proof. See proofs of the Hartman-Grobman Theorem [23,page 354].
[1] |
G. J. van Geest, H. Coops, M. Scheffer, E. H. van Nes, Long transients near the ghost of a stable state in eutrophic shallow lakes with fluctuating water levels, Ecosystems, 10 (2007), 37–47. https://doi.org/10.1007/s10021-006-9000-0 doi: 10.1007/s10021-006-9000-0
![]() |
[2] |
D. T. Iles, R. Salguero-Gomez, P. B. Adler, D. N. Koons, Linking transient dynamics and life history to biological invasion success, J. Ecol., 104 (2015), 399–408. https://doi.org/10.1111/1365-2745.12516 doi: 10.1111/1365-2745.12516
![]() |
[3] |
A. R. McLean, R. M. Anderson, Measles in developing countries. Part Ⅱ. The predicted impact of mass vaccination, Epidemiol. Infect., 100 (1988), 419–442. https://doi.org/10.1017/s0950268800067170 doi: 10.1017/s0950268800067170
![]() |
[4] |
N. A. Kharazian, F. M. G. Magpantay, The honeymoon period after mass vaccination, Math. Biosci. Eng., 18 (2021), 354–372. https://doi.org/10.3934/mbe.2021019 doi: 10.3934/mbe.2021019
![]() |
[5] |
C. J. E. Metcalf, A. Wesolowski, A. K. Winter, J. Lessler, S. Cauchemez, W. J. Moss, et al., Using serology to anticipate measles post-honeymoon period outbreaks, Trends Microbiol., 28 (2020), 597–600. https://doi.org/10.1016/j.tim.2020.04.009 doi: 10.1016/j.tim.2020.04.009
![]() |
[6] |
K. T. Frank, B. Petrie, J. A. D. Fisher, W. C. Leggett, Transient dynamics of an altered large marine ecosystem, Nature, 477 (2011), 86–89. https://doi.org/10.1038/nature10285 doi: 10.1038/nature10285
![]() |
[7] |
J. M. Maroto, M. Morán, Transient dynamics: equilibrium, collapse, and extinction in age-structured models. The case of the northern cod stock, Ecol. Modell., 398 (2019), 35–43. https://doi.org/10.1016/j.ecolmodel.2019.02.006 doi: 10.1016/j.ecolmodel.2019.02.006
![]() |
[8] |
K. Abbott, K. Cuddington, A. Hastings, Transients in ecology: stochasticity, management, and understanding, Theor. Ecol., 14 (2021), 623–624. https://doi.org/10.1007/s12080-021-00524-8 doi: 10.1007/s12080-021-00524-8
![]() |
[9] |
S. M. C. Abo, S. R. Smith, Is a covid-19 vaccine likely to make things worse, Vaccines, 8 (2020), 761. https://doi.org/10.3390/vaccines8040761 doi: 10.3390/vaccines8040761
![]() |
[10] |
Z. Ahmad, G. Bonanomi, D. di Serafino, F. Giannino, Transmission dynamics and sensitivity analysis of pine wilt disease with asymptomatic carriers via fractal-fractional differential operator of mittag-leffler kernel, Appl. Numer. Math., 185 (2023), 446–465. https://doi.org/10.1016/j.apnum.2022.12.004 doi: 10.1016/j.apnum.2022.12.004
![]() |
[11] |
A. Hastings, K. C. Abbott, K. Cuddington, T. Francis, G. Gellner, Y. C. Lai, et al., Transient phenomena in ecology, Science, 361 (2018). https://doi.org/10.1126/science.aat6412 doi: 10.1126/science.aat6412
![]() |
[12] |
A. Hastings, K. C. Abbott, K. Cuddington, T. Francis, Y. C. Lai, A. Morozov, et al., Effects of stochasticity on the length and behaviour of ecological transients, J. R. Soc. Interface, 18 (2021), 20210257. https://doi.org/10.1098/rsif.2021.0257 doi: 10.1098/rsif.2021.0257
![]() |
[13] |
A. Morozov, K. Abbott, K. Cuddington, T. Francis, G. Gellner, A. Hastings, et al., Long transients in ecology: theory & applications, Phys. Life Rev., 32 (2020), 1–40. https://doi.org/10.1016/j.plrev.2019.09.004 doi: 10.1016/j.plrev.2019.09.004
![]() |
[14] |
A. Morozov, K. Abbott, K. Cuddington, T. Francis, G. Gellner, A. Hastings, et al., Long living transients: enfant terrible of ecological theory, Phys. Life Rev., 32 (2020), 55–58. https://doi.org/10.1016/j.plrev.2020.03.002 doi: 10.1016/j.plrev.2020.03.002
![]() |
[15] |
T. Francis, K. C. Abbott, K. Cuddington, G. Gellner, A. Hastings, Y. C. Lai, et al., Management implications of long transients in ecological systems, Nat. Ecol. Evol., 5 (2021), 285–294. https://doi.org/10.1038/s41559-020-01365-0 doi: 10.1038/s41559-020-01365-0
![]() |
[16] |
A. Liu, F. M. G. Magpantay, A quantification of long transient dynamics, SIAM J. Appl. Math., 82 (2022), 381–407. https://doi.org/10.1137/20m1367131 doi: 10.1137/20m1367131
![]() |
[17] | G. Teschl, Ordinary Differential Equations and Dynamical Systems, American Mathematical Society, 140 (2012). Available from: https://www.mat.univie.ac.at/gerald/ftp/book-ode/ode.pdf. |
[18] | L. Barreira, Y. Pesin, Nonuniform Hyperbolicity: Dynamics of Systems with Nonzero Lyapunov Exponents, Encyclopedia of Mathematics and its Applications, Cambridge University Press, 2007. https://doi.org/10.1017/CBO9781107326026 |
[19] |
J. Cresson, S. Wiggins, A λ-lemma for normally hyperbolic invariant manifolds, Regul. Chaotic Dyn., 20 (2015), 94–108. https://doi.org/10.1134/S1560354715010074 doi: 10.1134/S1560354715010074
![]() |
[20] | M. J. Keeling, P. Rohani, Modeling Infectious Diseases In Humans and Animals, Princeton University Press, 2008. https://doi.org/10.1515/9781400841035 |
[21] |
C. Shan, Y. Yi, H. Zhu, Nilpotent singularities and dynamics in an sir type of compartmental model with hospital resources, J. Differ. Equations, 260 (2016), 4339–4365. https://doi.org/10.1016/j.jde.2015.11.009 doi: 10.1016/j.jde.2015.11.009
![]() |
[22] | R. K. Miller, A. N. Michel, Ordinary Differential Equations, Dover Publications, 2007. |
[23] | C. Chicone, Ordinary Differential Equations with Applications, Springer New York, NY, 2006. https://doi.org/10.1007/0-387-35794-7 |
1. | Grant R. Poulsen, Claire E. Plunkett, Jody R. Reimer, First Passage Times of Long Transient Dynamics in Ecology, 2024, 86, 0092-8240, 10.1007/s11538-024-01259-3 |