Quasi-Stationary Distributions for Continuous-Time Markov Chains David Sirl [email protected] http://www.maths.uq.edu.au/˜dsirl/ AUSTRALIAN RESEARCH COUNCIL Centre of Excellence for Mathematics and Statistics of Complex Systems Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 1 Quasi-Stationary Behaviour Sample Path of B−D Chemical Process with α=100, k1=1, k−1=1, k2=1 250 200 State 150 100 50 0 0 0.05 0.1 0.15 Time 0.2 0.25 0.3 Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 2 The Chemical Reaction 2X A+X k1 B k−1 k2 X Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 3 The Chemical Reaction 2X A+X k1 B k−1 k2 X Model the number of molecules of X with a CTMC — a birth-death process on S = {0} ∪ C , where zero is absorbing and C is an irreducible transient class. The system can be either closed or open with respect to A & B. C = {1, 2, . . . , N } or {1, 2, . . .}, respectively. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 3 An Outline: I will mostly talk about establishing the existence of QSDs. We will: Review the theory of quasi-stationarity, looking at the chemical reaction example. See that explicit evaluation of QSDs is not usually possible. Conjecture an extremely general, somewhat unlikely existence result. Briefly introduce some other topics of research in the area. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 4 Recall . . . A time-homogeneous CTMC (X(t), t ≥ 0) taking values in a countable set S (Z+ ) is completely described by its transition function P (t) = (pij (t), i, j ∈ S, t ≥ 0). In practice we know only the transition rates (p0ij (0+) = qij , i, j ∈ S). If we know P , we can in principle answer any question about the behaviour of the chain. The challenge is to try and answer these questions in terms of Q. We will assume that the process is absorbed with probability one, and is therefore regular (non-explosive). Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 5 Recall . . . A Birth-Death Process is a CTMC with state space S = {0, 1, 2, . . .} such that if the chain is in state i, transitions can only be made to state i − 1 or i + 1. The q-matrix has non-zero entries qi,i+1 = λi qi,i−1 = µi (birth rates), (death rates), and qii = −(λi + µi ), (put µ0 = 0) where λi , µi > 0 ∀i ∈ C , and λ0 ≥ 0. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 6 Definitions A distribution a = (ai , i ∈ C) is a QSD over C if when the initial Pdistribution is a, the state probabilities pj (t) = i∈C ai pij (t) conditioned on non-absorption are time-invariant (and given by a): pj (t) = aj , 1 − p0 (t) j ∈ C. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 7 Definitions A distribution a = (ai , i ∈ C) is a QSD over C if when the initial Pdistribution is a, the state probabilities pj (t) = i∈C ai pij (t) conditioned on non-absorption are time-invariant (and given by a): pj (t) = aj , 1 − p0 (t) j ∈ C. A distribution b = (bi , i ∈ C) is a LCD over C if bj gives the limiting probability of the process being in state j , conditional on non-absorption, independently of the starting state i: pij (t) lim = bj , t→∞ 1 − pi0 (t) i, j ∈ C. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 7 Definitions A µ-invariant measure (over C ) for P is a collection of numbers m = (mi , i ∈ C) which, for some µ > 0, satisfy X mi pij (t) = e−µt mj , j ∈ C, t ≥ 0. i∈C Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 8 Definitions A µ-invariant measure (over C ) for P is a collection of numbers m = (mi , i ∈ C) which, for some µ > 0, satisfy X mi pij (t) = e−µt mj , j ∈ C, t ≥ 0. i∈C A µ-invariant measure (over C ) for Q is a collection of numbers m = (mi , i ∈ C) which, for some µ > 0, satisfy X mi qij = −µmj , j ∈ C. i∈C Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 8 Finite State Space Easy because of spectral decomposition (of P ) and Perron-Frobenius theory. The LCD and unique QSD is given by the probability measure m such that mPC (t) = e−νt m. This is equivalent to mQC = −νm, where −ν is the eigenvalue with maximal real part (it is real and negative). Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 9 The Decay Parameter The quantity − log(pij (t)) λC := lim t→∞ t exists and is independent of i, j ∈ C . Called the decay parameter because pij (t) ≤ Mij e−λC t , 0 < Mij < ∞. Can show that for a µ-invariant measure for P over C to exist, it is necessary that (0 <)µ ≤ λC . Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 10 Infinite State Space P is now an infinite matrix, so Perron-Frobenius theory no longer applies. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 11 Infinite State Space P is now an infinite matrix, so Perron-Frobenius theory no longer applies. QSDs (if they exist) are given by probability measures m such that mPC (t) = e−νt m. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 11 Infinite State Space P is now an infinite matrix, so Perron-Frobenius theory no longer applies. QSDs (if they exist) are given by probability measures m such that mPC (t) = e−νt m. The LCD (if it exists) is given by m such that mPC (t) = e−λC t m. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 11 Infinite State Space P is now an infinite matrix, so Perron-Frobenius theory no longer applies. QSDs (if they exist) are given by probability measures m such that mPC (t) = e−νt m. The LCD (if it exists) is given by m such that mPC (t) = e−λC t m. These expressions involve P and λC , which are not known and impossible (or at best very difficult) to find analytically — we need conditions in terms of the q-matrix. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 11 Infinite State Space Theorem: A µ-invariant measure for Q is also µ-invariant for P (and is therefore a QSD) iff X yi qij = κyj , 0 ≤ yi ≤ mi , j ∈ C, i∈C has only the trivial solution for some (all) κ > −µ. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 12 Infinite State Space Theorem: A µ-invariant measure for Q is also µ-invariant for P (and is therefore a QSD) iff X yi qij = κyj , 0 ≤ yi ≤ mi , j ∈ C, i∈C has only the trivial solution for some (all) κ > −µ. This condition does not depend explicitly on P or λC . Does depend on having a particular µ, m to check. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 12 Infinite State Space Theorem: If m is a µ-invariant probability measure for Q, then X µ= mi qi0 i∈C is neccesary and sufficient for m to be a QSD. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 13 Infinite State Space Theorem: If m is a µ-invariant probability measure for Q, then X µ= mi qi0 i∈C is neccesary and sufficient for m to be a QSD. This allows us to find all µ-invariant probability measures for Q. We can then check which of these are QSDs using the previous result. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 13 Infinite State Space In order to find these µ-invariant measures for Q we must solve the system X mi qij = −µmj , j ∈ C. i∈C Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 14 Infinite State Space In order to find these µ-invariant measures for Q we must solve the system ! X X mi qij = − mi qi0 mj , j ∈ C. i∈C i∈C We can eliminate µ explicitly from the system we need to solve, however this renders the system non-linear in m. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 14 Infinite State Space In order to find these µ-invariant measures for Q we must solve the system ! X X mi qij = − mi qi0 mj , j ∈ C. i∈C i∈C We can eliminate µ explicitly from the system we need to solve, however this renders the system non-linear in m. Finding explicit expressions for QSDs is rarely possible. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 14 Infinite State Space Once we find a µ-invariant probability measure m for Q, we need to check if X yi qij = κyj , 0 ≤ yi ≤ mi , j ∈ C, i∈C has only the trivial solution for some (all) κ > −µ. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 15 Infinite State Space Once we find a µ-invariant probability measure m for Q, we need to check if X yi qij = κyj , 0 ≤ yi ≤ mi , j ∈ C, i∈C has only the trivial solution for some (all) κ > −µ. Suppose QC has no summable left eigenvectors for some (all) eigenvalues κ > 0. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 15 Infinite State Space Once we find a µ-invariant probability measure m for Q, we need to check if X yi qij = κyj , 0 ≤ yi ≤ mi , j ∈ C, i∈C has only the trivial solution for some (all) κ > −µ. Suppose QC has no summable left eigenvectors for some (all) eigenvalues κ > 0. Then it cannot possibly have any left eigenvectors bounded above by m. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 15 Infinite State Space Theorem: If the equations X yi qij = κyj , yi ≥ 0, j ∈ C, i∈C X yi < ∞ i∈C have only the trivial solution for some (all) κ > 0, then all µ-invariant probability measures for Q are also µ-invariant for P and are therefore QSDs. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 16 Infinite State Space Theorem: If the equations X yi qij = κyj , yi ≥ 0, j ∈ C, i∈C X yi < ∞ i∈C have only the trivial solution for some (all) κ > 0, then all µ-invariant probability measures for Q are also µ-invariant for P and are therefore QSDs. Call this condition the “Reuter FE Condition” Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 16 Infinite State Space Theorem: If the equations X yi qij = κyj , yi ≥ 0, j ∈ C, i∈C X yi < ∞ i∈C have only the trivial solution for some (all) κ > 0, then all µ-invariant probability measures for Q are also µ-invariant for P and are therefore QSDs. Call this condition the “Reuter FE Condition” If this condition holds, all we have to do is find a µ-invariant measure for Q and this is a QSD. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 16 Infinite State Space Theorem: If the equations X yi qij = κyj , yi ≥ 0, j ∈ C, i∈C X yi < ∞ i∈C have only the trivial solution for some (all) κ > 0, then all µ-invariant probability measures for Q are also µ-invariant for P and are therefore QSDs. Call this condition the “Reuter FE Condition” If this condition holds, all we have to do is find a µ-invariant measure for Q and this is a QSD. However we want it to be λC -invariant, so that we have the LCD; but we don’t know λC . Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 16 Birth-Death Processes Theorem: For a B-D process with which is absorbed with probability one, suppose that the initial distribution has compact support. Then If D < ∞ then there is a unique QSD which is the LCD. If D = ∞ then either λC = 0 and there are no QSDs, or λC > 0 and there is a one-parameter family of QSDs, one of which is the LCD. Here D= ∞ X n=1 1 µn πn ∞ X m=n πm , λ1 · · · λn−1 πn = . µ2 · · · µn Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 17 The Chemical Reaction 2X A+X k1 B k−1 k2 X The birth and death rates are, respectively, λi = αk1 i, and i(i − 1) µi = k2 i + k−1 . 2 Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 18 The Chemical Reaction One can show that this process is absorbed with probability one (and is therefore regular). Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 19 The Chemical Reaction One can show that this process is absorbed with probability one (and is therefore regular). We can also show that D= ∞ X nΓ(n + r) k−1 n−1 [nk + n(n − 1) ](αs) 2 n=1 2 ∞ X (αs)m−1 , mΓ(m + r) m=n Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 19 The Chemical Reaction One can show that this process is absorbed with probability one (and is therefore regular). We can also show that D= ∞ X nΓ(n + r) k−1 n−1 [nk + n(n − 1) ](αs) 2 n=1 2 ∞ X (αs)m−1 , mΓ(m + r) m=n and that this is finite. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 19 The Chemical Reaction One can show that this process is absorbed with probability one (and is therefore regular). We can also show that D= ∞ X nΓ(n + r) k−1 n−1 [nk + n(n − 1) ](αs) 2 n=1 2 ∞ X (αs)m−1 , mΓ(m + r) m=n and that this is finite. So there is a unique quasistationary distribution, which is limiting conditional. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 19 A Connection For a Birth-Death process, the Reuter FE conditions hold iff D = ∞. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 20 A Connection For a Birth-Death process, the Reuter FE conditions hold iff D = ∞. So, let’s replace D diverges (converges) in van Doorns’ result with the Reuter FE condition holds (fails). Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 20 Infinite State Space Conjecture: Suppose a process is absorbed with probability one, and that the initial distribution has compact support. Then If the Reuter FE conditions fail then there is only one finite µ-invariant measure; it is in fact λC -invariant and is therefore the LCD. If the Reuter FE conditions hold, either λC = 0 and there are no QSDs (and no LCD), or λC > 0 and there is a one-parameter family of finite µ-invariant measures (QSDs), 0 < µ ≤ λC , of which the λC -invariant measure is the LCD. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 21 Approximating QSDs Closed form expressions for QSDs are very rarely available — we only have expressions of the form mQC = −µ(m)m. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 22 Approximating QSDs Closed form expressions for QSDs are very rarely available — we only have expressions of the form mQC = −µ(m)m. We therefore need approximation methods. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 22 Approximating QSDs Closed form expressions for QSDs are very rarely available — we only have expressions of the form mQC = −µ(m)m. We therefore need approximation methods. The most common method is truncation: solve n X (n) mi qij = −λ (n) (n) mj , j ∈ C (n) i∈C (n) for m(n) , where λ(n) is the P-F maximal e-value of QC (n) , for succesively larger n. Here {C (n) } is a collection of increasing, finite subsets of C . Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 22 Approximating QSDs Closed form expressions for QSDs are very rarely available — we only have expressions of the form mQC = −µ(m)m. We therefore need approximation methods. The most common method is truncation: solve n X (n) mi qij = −λ (n) (n) mj , j = 1, . . . , n i=1 for m(n) , where λ(n) is the P-F maximal e-value of QC (n) , for succesively larger n. Here {C (n) } is a collection of increasing, finite subsets of C . e.g. C (n) = {1, . . . , n}. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 22 Approximating QSDs X (n) (n) mi qij = −λ (n) (n) mj , j ∈ C (n) . i∈C (n) What happens to these solutions as n becomes large? We know that λ(n) ↓ λC as n → ∞. But under what conditions does m(n) ‘converge’, in some sense? If m(n) converges, does it converge to m? Breyer & Hart (2000) give some sufficient conditions. We know that northwest corner truncation works for Birth-Death Processes, and Subcritical Markov Branching Process. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 23 The Quasi-Stationary Distribution Sample Path of B-D Chemical Process with α=100, k =1, k =1, k =1 1 -1 2 300 250 State 200 150 100 50 0 0 0.05 0.1 0.15 Time 0.2 0.25 0.3 Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 24 Another Chemical Reaction 2X 2X A+X k1 B k2 This is not a Birth-Death process: it has jumps up of size 1, but jumps down of size 2: qi,i+1 = αik1 , i(i − 1) . qi,i−2 = k2 2 Hopefully my conjecture can deal with this!! Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 25 Further Work The ‘renewal dynamical’ approach of Ferrari, Kesten, Martínez and Picco: When the process is absorbed, instantaneously restart it in C according to some law ν : ν qij = qij + qi0 νj , i 6= j ∈ C. If the original chain is absorbed in finite mean time, this modified chain P ν is positive recurrent and therefore has a stationary distribution, say ρ. Define a function Φ by Φ(ν) = ρ. If Φ(m) = m then m is a QSD. Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 26 Further Work The domain of attraction problem: Our definition of a limiting conditional distribution looked at the behaviour of pij (t) 1 − pi0 (t) We would like to be able to say what happens to pj (t) , 1 − pj (t) pj (t) = P i ai pij (t) with an arbitrary initial distribution a (over C ). Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 27 Further Work Convergence to quasistationarity (LC-ness) - another ‘decay parameter’ β = sup{γ > 0 : |mij (t) − mj | = O(e−γt ) as t → ∞} pij (t) where mij (t) = 1−p gives the conditional transition i0 (t) probabilities and mj = limt→∞ mij (t) is the LCD. This provides a measure of how quickly the process settles to a quasistationary regime. For the chemical process we have looked at, β ≈ 102 , λC ≈ 10−6 . Quasi-Stationary Distributions for Continuous-Time Markov Chains – p. 28
© Copyright 2025 Paperzz