Document not found! Please try again

Using MCMC for Maximum Likelihood Estimation in ...

17 downloads 0 Views 524KB Size Report
The nonlinear mixed effects model. The EM-type algorithms. Some pharmacokinetics examples. Estimation of the likelihood. Using MCMC for Maximum ...
The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

Using MCMC for Maximum Likelihood Estimation in Non-Linear Mixed Effects Models

Marc Lavielle

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

A pharmacokinetics example : theophylline

Each individual curve is described by the same parametric model, with its own individual parameters. Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

A pharmacokinetics example : theophylline

Each individual curve is described by the same parametric model, with its own individual parameters. Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The basic model The incomplete data model

The model

yij = f (xij , φi ) + εij , 1 ≤ i ≤ N , 1 ≤ j ≤ ni

yij ∈ R is the jth observation of subject i, N is the number of subjects ni is the number of observations of subject i. The regression variables, or design variables, (xij ) are known, The individual parameters (φi ) are unknown,

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The basic model The incomplete data model

The model

yij = f (xij , φi ) + εij , 1 ≤ i ≤ N , 1 ≤ j ≤ ni

yij ∈ R is the jth observation of subject i, N is the number of subjects ni is the number of observations of subject i. The regression variables, or design variables, (xij ) are known, The individual parameters (φi ) are unknown,

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The basic model The incomplete data model

The model

yij = f (xij , φi ) + εij , 1 ≤ i ≤ N , 1 ≤ j ≤ ni

yij ∈ R is the jth observation of subject i, N is the number of subjects ni is the number of observations of subject i. The regression variables, or design variables, (xij ) are known, The individual parameters (φi ) are unknown,

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The basic model The incomplete data model

The model yij = f (xij , φi ) + εij , 1 ≤ i ≤ N , 1 ≤ j ≤ ni The vector φi of individual parameters is assumed to be Gaussian: φi = µ + ηi

with

ηi ∼i.i.d. N (0, Γ)

µ: unknown vector of population parameters (the fixed effects), (ηi ): unknown random vectors (the random effects). The sequence (εij ) is assumed to be Gaussian: εij ∼i.i.d. N (0, σ 2 ) Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The basic model The incomplete data model

The incomplete data model yij = f (xij , φi ) + εij , 1 ≤ i ≤ N , 1 ≤ j ≤ ni We are in a classical framework of “incomplete data”: the measurements y = (yij , 1 ≤ i ≤ N , 1 ≤ j ≤ ni ) are the “observed data” the individual random parameters φ = (φi , 1 ≤ i ≤ N), are the “non observed data”, the “complete data” of the model is (y , φ). (y , φ) ∼ p(y , φ; θ) θ = (µ, Γ, σ 2 ) is an unknown set of parameters Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The basic model The incomplete data model

Our objectives Estimate θ Estimate p(φ |y ; θ) Proposed approach: Define a sequence (θk ) that converges to the Maximum Likelihood Estimate of θ (a.s.) Run a non homogenous MCMC algorithm with stationary distributions sequence (p(φ |y ; θk )) that converges to p(φ |y ; θMLE ).

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The basic model The incomplete data model

Our objectives Estimate θ Estimate p(φ |y ; θ) Proposed approach: Define a sequence (θk ) that converges to the Maximum Likelihood Estimate of θ (a.s.) Run a non homogenous MCMC algorithm with stationary distributions sequence (p(φ |y ; θk )) that converges to p(φ |y ; θMLE ).

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The EM algorithm

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

(Expectation-Maximization)

(Dempster, Laird et Rubin, JRSSB, 1977)

Iteration k of the algorithm:

step E : evaluate the quantity Qk (θ) = E[log p(y , φ; θ)|y ; θk−1 ] step M : update the estimation of θ: θk = Argmax Qk (θ)

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The SAEM algorithm

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

(Stochastic Approximation of EM)

Delyon, Lavielle and Moulines (the Annals of Statistics, 1999)

Iteration k of the algorithm: step E : Simulation: draw the non observed data φ(k) with the conditional distribution p(φ |y ; θk−1 ) Stochastic approximation: h i Qk (θ) = Qk−1 (θ) + γk log p(y , φ(k) ; θ) − Qk−1 (θ) P P 2 (γk ) is a decreasing sequence: γk = +∞, γk < +∞.

step M: update the estimation of θ: θk = Argmax Qk (θ) Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The SAEM algorithm

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

(Stochastic Approximation of EM)

Delyon, Lavielle and Moulines (the Annals of Statistics, 1999)

Iteration k of the algorithm: step E : Simulation: draw the non observed data φ(k) with the conditional distribution p(φ |y ; θk−1 ) Stochastic approximation: h i Qk (θ) = Qk−1 (θ) + γk log p(y , φ(k) ; θ) − Qk−1 (θ) P P 2 (γk ) is a decreasing sequence: γk = +∞, γk < +∞.

step M: update the estimation of θ: θk = Argmax Qk (θ) Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The SAEM algorithm

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

(Stochastic Approximation of EM)

Delyon, Lavielle and Moulines (the Annals of Statistics, 1999)

Iteration k of the algorithm: step E : Simulation: draw the non observed data φ(k) with the conditional distribution p(φ |y ; θk−1 ) Stochastic approximation: h i Qk (θ) = Qk−1 (θ) + γk log p(y , φ(k) ; θ) − Qk−1 (θ) P P 2 (γk ) is a decreasing sequence: γk = +∞, γk < +∞.

step M: update the estimation of θ: θk = Argmax Qk (θ) Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

Coupling SAEM with MCMC Kuhn and Lavielle, ESAIM P&S, 2004

Let Πθ be the transition probability of an ergodic Markov Chain with limiting distribution pΦ|Y (·|y ; θ). Iteration k of the algorithm: Simulation : draw φ(k) according to the transition probability Πθk−1 (φ(k−1) , ·).

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

The main convergence Theorem Theorem Under very general technical conditions, the SAEM sequence (θk ) converges a.s. to some (local) maximum of the observed likelihood g (y ; θ). Proof. See 1. Delyon, Lavielle & Moulines The Annals of Statistics (1999) 2. Kuhn & Lavielle ESAIM P&S (2004)

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

MCMC (Markov Chain Monte Carlo) an iterative procedure for the simulation of p(φ|y ; θ)

At iteration ` 1 2

draw a new value φc with a proposal distribution q, accept this new value, that is set φ` = φc with probability α(φc ) =

q(φc , φ(`−1)) p(φc |y ; θ) q(φc , φ(`−1)) p(φ(`−1) |y ; θ)

In the model y = f (x; φ) + g (x; φ)ε, computing α(φc ) only requires to compute f (x, φc ) and g (x, φc ) but not the derivatives of f and g Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

Some proposals used in Monolix Three following proposal kernels for 1 ≤ i ≤ N: (1)

1

qθk is the prior distribution of φi at iteration k, that is the Gaussian distribution N (Ai µk , Γk )

2

qθk is the multidimensional random walk N (φi , τk Γk ).

(2)

τk = τk−1 (1 + a(ρk−1 − ρ? )) 0 < a < 1; ρ? ≈ 0.4 ρk−1 : proportion of acceptation at iteration k − 1. 3

(3)

qθk is a succession of d unidimensional Gaussian random walks: each component of φi are successively updated.

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

Some proposals used in Monolix

Then, the simulation-step at iteration k consists in running (1)

1

m1 iterations of the Hasting-Metropolis with proposal qθk ,

2

m2 iterations with proposal qθk

3

m3 iterations with proposal qθk .

(2) (3)

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

Running several chains

Convergence of the algorithm can be improved by running L independent Markov Chain instead of only one. The simulation step requires to draw L sequences φ(k,1) , . . . , φ(k,L) at iteration k and to combine stochastic approximation and Monte Carlo in the approximation step: ! L 1X (k,`) log p(y , φ ; θ) − Qk−1 (θ) Qk (θ) = Qk−1 (θ) + γk L `=1

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

The EM algorithm The SAEM algorithm The MCMC algorithm Simulated Annealing

A Simulated Annealing version of SAEM Conditional distribution of φ: p( φ |y ; θ) = C (y ; θ)e −U(φ,y ;θ) Temperature parameter T : pT ( φ |y ; θ) = CT (y ; θ)e −

U(φ,y ;θ) T

Choose a decreasing Temperature sequence (Tk ) converging to 1. Then, at iteration k of SAEM, E-step: draw the non observed data φ(k) with the conditional distribution pTk ( · |y ; θk−1 ) M-step remains unchanged Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

A pharmacokinetics example : theophylline

Each concentration curve is described by the same pharmacokinetic model, with its own individual pharmacokinetics parameters. Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

oral administration, first-order absorbtion and elimination dose D at time t=0 absorption (rate ka ) → DRUG AMOUNT Q(t) → elimination (rate ke ) dQ (t) = ka Qa (t) − ke Q(t) dt dQa (t) = −ka Qa (t) dt Qa (t): amount at absorption site.   ka Q(t) =D e −ke t − e −ka t C (t) = V V (ka − ke ) C (t) : concentration of the drug, V : volume of the compartment Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

intravenous administration and first-order elimination

dose D (t=0) → DRUG AMOUNT Q(t) → elimination (rate ke )

dQ (t) = −kQ(t) dt Q(t) = De −kt C (t) =

Q(t) D = e −ke t V V

C (t) : concentration of the drug, V : volume of the compartment Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

intravenous administration and nonlinear elimination

dose D (t=0) → DRUG AMOUNT Q(t) → nonlinear elimination dC Vm C =− dt V (Km + C ) C (t) : concentration of the drug, (Vm , Km ) : Michaelis-Menten elimination parameters, V : volume of the compartment.

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

Importance Sampling The Importance Sampling algorithm computes an estimate `M (y ) of the observed likelihood. Z `(y , θ) = p(y , φ)d φ Z = h(y |φ)π(φ)d φ  Z  π(φ) = h(y |φ) π ˜ (φ)d φ π ˜ (φ)

1 2

Draw φ(1) , φ(2) , . . . , φ(M) with the distribution π ˜, Let M 1 X π(φ(j) ) `M (y ) = h(y |φ(j) ) M π ˜ (φ(j) ) j=1

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

Importance Sampling

M 1 X π(φ(j) ) `M (y ) = h(y |φ(j) ) M π ˜ (φ(j) ) j=1

E`M (y ) = `(y ) and Var`M (y ) = O(1/M) The instrumental distribution used in Monolix : 1) Estimate the conditional mean and variance of φ, (using MCMC), 2) Use for π ˜ a Gaussian distribution with these parameters.

Marc Lavielle

The MONOLIX software

The nonlinear mixed effects model The EM-type algorithms Some pharmacokinetics examples Estimation of the likelihood

Marc Lavielle

The MONOLIX software

Suggest Documents