Longitudinal Penalized Functional Regression

15 downloads 384 Views 2MB Size Report
Sco re. Subject 2; Visit 1. Subject 1; Visit 1. 15. 20. 25. 30. 35. 0 10. 30. 50. 1.0 ..... Software model = lpfr(Y, subject, covariates, funcs, ...) ▻ Available in refund ...
Longitudinal Penalized Functional Regression Jeff Goldsmith Johns Hopkins Bloomberg School of Public Health

Joint work with Ciprian Crainiceanu, Brian Caffo, and Daniel Reich

Outline

I

Longitudinal Functional Data

I

Real Data Example

I

Longitudinal Penalized Functional Regression

I

Application Results

microns2 ms 1.0 1.5 2.0 2.5 3.0

20

25

30

35

Cognitive Function Score 15 0.5

1.5

2.5

microns2 ms

Data Structure

0 10

0 10 30

30 50

Subject 1; Visit 1

Subject 2; Visit 1

50

microns2 ms

0 10 30 50 0 10 30 50

0 10

0 10

30

30

50

50 microns2 ms

50 0 10

0 10

30

30 50 1.0 1.5 2.0 2.5 3.0

30

microns2 ms

35

0 10

1.0 1.5 2.0 2.5 3.0

30

50

microns2 ms

25

30

1.0 1.5 2.0 2.5 3.0

20

0 10

1.0 1.5 2.0 2.5 3.0

microns2 ms

1.0 1.5 2.0 2.5 3.0

Cognitive Function Score 15

0.5

0.5

0.5

0.5

0.5

2.5

2.5

2.5

2.5

1.5

2.5

microns2 ms

1.5

microns2 ms

1.5

microns2 ms

1.5

microns2 ms

1.5

microns2 ms

Data Structure

50 0 10

0 10

30

30

50

Subject 1; Visit 1 Subject 1; Visit 2 Subject 1; Visit 3 Subject 1; Visit 4 Subject 1; Visit 5

Subject 2; Visit 1 Subject 2; Visit 2 Subject 2; Visit 3 Subject 2; Visit 4 Subject 2; Visit 5

50

Neuroimaging Application I I I

White vs. Grey Matter Multiple Sclerosis Diffusion Tensor Imaging [4]

Outcome and Predictors I

Corpus callosum and corticospinal tract profiles

I

PASAT-3

0.5 0.4 0.3

FA

0.6

0.7

FA Profile

0.0

0.2

0.4

0.6

Distance Along Tract

0.8

1.0

Study Population

I

100 subjects (66 women; 34 men)

I

Age between 21 and 70 years at first visit

I

Between 2 and 8 visits each (median = 3)

I

340 total visits

Longitudinal Penalized Functional Regression

Observe data [Yij , Wij1 (s), . . . , WijL (s), Xij ]. We propose the longitudinal functional regression model [2] Yij

∼ EF(µij , η)

g(µij ) = Xij β + Zij bi +

L Z X l=1

1

Wijl (s)γl (s)ds

0

I

Adds random effects to functional regression

I

Adds functional effects to GLMMs

(1)

Functional Contribution

For the functional contributions

R1 0

Wijl (s)γl (s)ds

I

Express (or estimate) the functional predictors using a principal components decomposition.

I

Express the functional coefficient in terms of a spline basis and induce smoothness via a mixed-effects model [6].

Functional Contribution: Principal Components Decomposition Let Wl

Σ (s, t) = Cov[Wijl (s), Wijl (s)] =

∞ X

λkl ψkl (s)ψkl (s),

k =1

with λ1l ≥ λ2l ≥ . . . non-increasing eigenvalues and ψ l (·) = {ψkl (·) : k ∈ Z+ } corresponding eigenfunctions. Approximate the functional predictors using Wijl (s) = µl (s) +

Kw X k =1

cijkl ψkl (s).

30

50

0 10

30

50

0 10

30

50

2.5 0.5

1.5

microns2 ms

2.5 0.5

1.5

microns2 ms

2.5 0.5

1.5

microns2 ms

2.5 0.5

0 10

0 10

30

50

0 10

30

50

Subject 1; Visit 2

Subject 1; Visit 3

Subject 1; Visit 4

Subject 1; Visit 5

Subject 2; Visit 1

Subject 2; Visit 2

Subject 2; Visit 3

Subject 2; Visit 4

Subject 2; Visit 5

20

25

30

35

Subject 1; Visit 1

30

50

30

50

30

50

microns2 ms 0 10

30

50

1.0 1.5 2.0 2.5 3.0

microns2 ms 0 10

1.0 1.5 2.0 2.5 3.0

microns2 ms 0 10

1.0 1.5 2.0 2.5 3.0

microns2 ms 0 10

1.0 1.5 2.0 2.5 3.0

microns2 ms

1.0 1.5 2.0 2.5 3.0

15

Cognitive Function Score

1.5

microns2 ms

2.5 1.5 0.5

microns2 ms

Functional Contribution: Predictor Estimation

0 10

30

50

Functional Contribution: γl (s)

Let φ(s) = {φ1 (s), φ2 (s), . . . , φKg (s)} be a truncated power series spline basis, so that γl (s) = φ(s)g l = g1l + g2l t +

Kg X

gkl (t − κk )+

k =3 K

g where g l = {g1l , . . . , gKg l }T and {κk }k =3 are knots.

Functional Contribution

The integral Z

R1 0

Wijl (s)γl (s)ds can now be written

1

Z Wijl (s)γl (s)ds = al +

0

0

Z 0

1

c 0ijl ψ T (s)φ(s)g l ds = al + c 0ijl M l g l

1

W l (s)γl (s)ds = al + C l M l g l

Longitudinal Penalized Functional Regression

Let I

X = [1 X (C 1 M 1 )[,1:2] . . . (C L M L )[,1:2] ];

I

Z = [Z1 (C 1 M 1 )[,3:Kg ] . . . (C L M L )[,3:Kg ] ];

I

β = [α, β, g11 , g21 , . . . , g1L , g2L ];

I

g g u = [{bi }Ii=1 , {gk 1 }k =3 , . . . , {gkL }k =3 ]

K

K

Longitudinal Penalized Functional Regression Using all the above, the longitudinal generalized functional regression model can be posed: Y

∼ EF (µ, γ)

g(µ) = X β + Z b +

L Z X l=1

1

Wijl (s)γl (s)ds

0

= Xβ + Zu    σb2 I I 0     0   0    u ∼ N  ..  ,  ..  .   . 0 0

0 ... 0 2 σg 1 I Kg −2 0 .. .. . . 0 . . . σg2 L I Kg −2

    .  

Joint Modeling

! !2g

g C ! !2X !2Y

Y

!2b b

W

Joint Modeling

h i ∼ N X ij β + Z ij b + c 0ij Mg, σY2 ; σY2 ∼ IG [AY , BY ] h i Wij (t) ∼ N µ(t) + c 0ij ψ(t)T , σX2 I ; σX2 ∼ IG [AX , BX ]

Yij c 0ij g b β

∼ N [0, Λ] ; λk ∼ IG [Aλ , Bλ ] for 1 ≤ k ≤ Kw     ∼ N 0, σg2 D ; σg2 ∼ IG Ag , Bg   ∼ N h0, σb2 I i; σb2 ∼ IG [Ab , Bb ] ∼ N 0, σβ2 I

Application

Recall our application: I

100 MS patients with 340 total visits

I

Regress PASAT-3 on corpus callosum and corticospinal tract profiles

Application Results

Multivariate Analysis

Univariate Analyses

1 γ(t)

0

0 0

20

40

60

Distance Along Tract

80

-2 0

10

20

30

40

Distance Along Tract

50

-3

-3

-3

-2

-2

-1

-1

γ(t)

γ(t) -1

-1 -2

Likelihood-Based 95% Confidence Interval Posterior Mean 95% Credible Interval

-3

γ(t)

0

0

1

2

PD of Right Corticospinal

1

MD of Corpus Callosum

2

PD of Right Corticospinal

1

MD of Corpus Callosum

0

20

40

60

Distance Along Tract

80

0

10

20

30

40

Distance Along Tract

50

Software

model = lpfr(Y, subject, covariates, funcs, ...) I

Available in refund package on CRAN [5]

I

Returns γˆ (s), confidence intervals, random effects, spline and PC bases.

Future Work

I

Tailored PC decompositions [1, 3]

I

Images as predictors

I

Expanded software capabilities

Acknowledgements and Contact

This work was supported in part by Award Number R01NS060910 from the National Institute Of Neurological Disorders And Stroke and by the Intramural Research Program of the National Institute of Neurological Disorders and Stroke.

Contact : [email protected] Website : www.biostat.jhsph.edu/~jgoldsmi

References C.Z. Di, C.M. Crainiceanu, B.S. Caffo, and N.M. Punjabi. Multilevel functional principal component analysis. Annals of Applied Statistics, 4:458–288, 2009. J. Goldsmith, C. Crainiceanu, B. Caffo, and D. Reich. A case study of longitudinal association between disability and neuronal tract measurements. Under Review, 2011. S. Greven, C. Crainiceanu, B. Caffo, and D. Reich. Longitudinal functional principal component analysis. Electronic Journal of Statistics, 2010. S. Mori and P. Barker. Diffusion magnetic resonance imaging: its principle and applications. The Anatomical Record, 257:102–109, 1999. P. Reiss, L. Huang, and J. Goldsmith. refund: Regression with Functional Data, 2010. R package version 0.1-2. D. Ruppert, M.P. Wand, and R.J. Carroll. Semiparametric Regression, volume 66. Cambridge: Cambridge University Press, 2003.

Thank You