In this paper we define an infinite-dimensional controlled piecewise deterministic Markov process (PDMP) and we study an optimal control problem with finite time horizon and unbounded cost. This process is a coupling between a continuous time Markov Chain and a set of semilinear parabolic partial differential equations, both processes depending on the control. We apply dynamic programming to the embedded Markov decision process to obtain existence of optimal relaxed controls and we give some sufficient conditions ensuring the existence of an optimal ordinary control. This study, which constitutes an extension of controlled PDMPs to infinite dimension, is motivated by the control that provides Optogenetics on neuron models such as the Hodgkin-Huxley model. We define an infinite-dimensional controlled Hodgkin-Huxley model as an infinite-dimensional controlled piecewise deterministic Markov process and apply the previous results to prove the existence of optimal ordinary controls for a tracking problem.
Citation: Vincent Renault, Michèle Thieullen, Emmanuel Trélat. Optimal control of infinite-dimensional piecewise deterministic Markov processes and application to the control of neuronal dynamics via Optogenetics[J]. Networks and Heterogeneous Media, 2017, 12(3): 417-459. doi: 10.3934/nhm.2017019
In this paper we define an infinite-dimensional controlled piecewise deterministic Markov process (PDMP) and we study an optimal control problem with finite time horizon and unbounded cost. This process is a coupling between a continuous time Markov Chain and a set of semilinear parabolic partial differential equations, both processes depending on the control. We apply dynamic programming to the embedded Markov decision process to obtain existence of optimal relaxed controls and we give some sufficient conditions ensuring the existence of an optimal ordinary control. This study, which constitutes an extension of controlled PDMPs to infinite dimension, is motivated by the control that provides Optogenetics on neuron models such as the Hodgkin-Huxley model. We define an infinite-dimensional controlled Hodgkin-Huxley model as an infinite-dimensional controlled piecewise deterministic Markov process and apply the previous results to prove the existence of optimal ordinary controls for a tracking problem.
[1] | Properties of relaxed trajectories for a class of nonlinear evolution equations on a Banach space. SIAM J. Control Optim. (1983) 21: 953-957. |
[2] | Optimal control of systems governed by a class of nonlinear evolution equations in a reflexive Banach space. Journal of Optimization Theory and Applications (1978) 25: 57-81. |
[3] | Properties of relaxed trajectories of evolution equations and optimal control. SIAM J. Control Optim. (1993) 31: 1135-1142. |
[4] | The emergence of the deterministic Hodgkin-Huxley equations as a limit from the underlying stochastic ion-channel mechanism. Ann. Appl. Probab. (2008) 18: 1279-1325. |
[5] | A general denseness result for relaxed control theory. Bull. Austral. Math. Soc. (1984) 30: 463-475. |
[6] | D. Bertsekas and S. Shreve, Stochastic Optimal Control: The Discrete-Time Case, Academic Press, 1978. |
[7] | P. Billingsley, Convergence Of Probability Measures, John Wiley & Sons, New York, 1968. |
[8] | Millisecond-timescale, genetically targeted optical control of neural activity. Nature Neuroscience (2005) 8: 1263-1268. |
[9] | Numerical methods for the exit time of a Piecewise Deterministic Markov Process. Adv. in Appl. Probab. (2012) 44: 196-225. |
[10] | An exact stochastic hybrid model of excitable membranes including spatio-temporal evolution. J. Math. Biol. (2011) 63: 1051-1093. |
[11] | Optimal control of Piecewise Deterministic Markov Processes with finite time horizon. Modern Trends of Controlled Stochastic Processes: Theory and Applications (2010) 144-160. |
[12] | AMDP algorithms for portfolio optimization problems in pure jump markets. Finance Stoch. (2009) 13: 591-611. |
[13] | N. Bäuerle and U. Rieder, Markov Decision Processes With Applications To Finance, Springer, Heidelberg, 2011. |
[14] | Stability and ergodicity of piecewise deterministic Markov processes. SIAM J. of Control and Opt. (2008) 47: 1053-1077. |
[15] | Singular perturbation for the discounted continuous control of Piecewise Deterministic Markov Processes. Appl. Math. and Opt. (2011) 63: 357-384. |
[16] | Optimal stopping with continuous control of piecewise deterministic Markov processes. Stoch. Stoch. Rep. (2000) 70: 41-73. |
[17] | Convergence of stochastic gene networks to hybrid piecewise deterministic processe. Ann. Appl. Prob. (2012) 22: 1822-1859. |
[18] | Piecewise-Deterministic Markov Processes: A general class of non-diffusion stochastic models. J. R. Statist. Soc. (1984) 46: 353-388. |
[19] | M. H. A. Davis, Markov Models and Optimization, Chapman and Hall, 1993. doi: 10.1007/978-1-4899-4483-2 |
[20] | B. de Saporta, F. Dufour and H. Zhang, Numerical Methods for Simulation and Optimization of Piecewise Deterministic Markov Processes, Wiley, 2016. |
[21] | J. Diestel and J. J. Uhl, Vector Measures, American Mathematical Society, Providence, 1977. |
[22] | A Markovian analysis of additive-increase multiplicative-decrease algorithms. Adv. in Appl. Probab. (2002) 34: 85-111. |
[23] | N. Dunford and J. T. Schwartz, Linear Operators. Part Ⅰ: General Theory, Academic Press, New York-London, 1963. |
[24] | K. -J. Engel and R. Nagel, One Parameter Semigroups for Linear Evolution Equations, Springer-Verlag New York, 2000. |
[25] | Piecewise deterministic Markov control processes with feedback controls and unbounded costs. Acta Applicandae Mathematicae (2004) 82: 239-267. |
[26] | R. Gamkrelidze, Principle of Optimal Control Theory Plenum, New York, 1987. |
[27] | Averaging for a fully coupled piecewise deterministic Markov process in infinite dimensions. Adv. in Appl. Probab. (2012) 44: 749-773. |
[28] | Algebraic invariance conditions in the study of approximate (null-)controllability of Markov switch processes. Mathematics of Control, Signals, and Systems (2015) 27: 551-578. |
[29] | Quantization. IEEE Trans. Inform. Theory (1998) 44: 2325-2383. |
[30] | A quantitative description of membrane current and its application to conduction and excitation in nerve. J. Physiol. (1952) 117: 500-544. |
[31] | Q. Hu and W. Yue, Markov Decision Processes with Their Applications, Springer US, 2008. |
[32] | Multivariate point processes: Predictable projections, Radon-Nikodym derivatives, representation of martingales. Z. Wahrsag. Verw. Gebiete (1975) 34: 235-253. |
[33] | Photocycles of Channelrhodopsin-2. Photochemistry and Photobiology (2009) 85: 400-411. |
[34] | Computational models of Optogenetic tools for controlling neural circuits with light. Conf. Proc. IEEE Eng. Med. Biol. Soc. (2013) 5934-5937. |
[35] | Handbook of computational and numerical methods in finance. Birkhäuser Boston (2004) 253-297. |
[36] | Reduction of stochastic conductance-based neuron models with time-sacles separation. J. Comput. Neurosci. (2012) 32: 327-346. |
[37] | Properties of the relaxed trajectories of evolution equations and optimal control. SIAM J. Control Optim. (1989) 27: 267-288. |
[38] | Minimal time spiking in various ChR2-controlled neuron models. J. Math. Biol. (2017) 1-42. |
[39] | Limit theorems for infinite-dimensional Piecewise Deterministic Markov Processes. Applications to stochastic excitable membrane models. Electron. J. Probab. (2012) 17: 1-48. |
[40] | Optimal control of piecewise deterministic Markov processes. Stochastics. An International Journal of Probability and Stochastic Processes (1985) 14: 165-207. |
[41] | Relaxed variational problem. J. Math. Anal. Appl. (1962) 4: 111-128. |
[42] | Necessary conditions for minimum in relaxed variational problems. J. Math. Anal. Appl. (1962) 4: 129-145. |
[43] | J. Warga, Optimal Control of Differential and Functional Equations, Wiley-Interscience, New York, 1972. |
[44] | J. C. Williams and J. Xu et al, Computational optogenetics: Empirically-derived voltage-and light-sensitive Channelrhodopsin-2 model, JPLoS Comput Biol, 9 (2013), e1003220. doi: 10.1371/journal.pcbi.1003220 |
[45] | L. C. Young, Lectures on the Calculus of Variations and Optimal Control Theory, W. B. Saunders, Philadelphia, PA, 1969. |
[46] | On reducing a jump controllable Markov model to a model with discrete time. Theory Probab. Appl. (1980) 25: 58-69. |