Download Interval-censored observations from Multistate Models

Document related concepts
no text concepts found
Transcript
Interval-censored observations from Multistate
Models
Daniel Commenges
INSERM, Centre de Recherche Epidémiologie et Biostatistique, Equipe
Biostatistique, Bordeaux
http://sites.google.com/site/danielcommenges/
April 10, 2011
Short Course THIRD CHANNEL NETWORK CONFERENCE
Organization of the talk
1
Motivation
2
Multistate processes
3
Multistate models
4
Patterns of observation
5
Likelihood
6
Maximum likelihood estimators
7
The Illness-death model for dementia
8
HIV infection in haemophiliacs
Motivation
The survival model
State 0
α01 (t)
State 1
-
(alive)
(dead)
An “Illness model”
State 0
α01 (t)
State 1
-
(Health)
(Illness)
The illness-death model
α01 (t)
0: Health
α02 (t)
J
J
J
J
J
J
^
-
2: Death
1: Illness
α12 (t)
Interesting quantities
Transition intensities:
α01 (t) : incidence of the disease
α02 (t) : mortality rate for healthy
α12 (t) : mortality rate for diseased
Other quantities
probability of being healthy at time t
probability of being diseased at time t
time passed in diseased state
Interesting quantities
Transition intensities:
α01 (t) : incidence of the disease
α02 (t) : mortality rate for healthy
α12 (t) : mortality rate for diseased
Other quantities
probability of being healthy at time t
probability of being diseased at time t
time passed in diseased state
Multistate Processes
Stochastic processes and filtrations
Filtration in discrete time
A random variable X generates a σ-field (or σ-algebra): all the
events of the type {X ∈ B}. A filtration (or history is an
increasing sequence of σ-fields.
Consider a sequence of random variables X1 , X2 , . . .
Call F1 the σ-field generated by X1 ; F2 the σ-field generated by
X1 and X2 and so on. The sequence {F1 , F2 , . . . , Fn , } is a
filtration.
We will think of a stochastic process in discrete time as a
sequence of random variables {X1 , X2 , . . . , Xn , . . .}.
Continuous time
Filtration in continuous time
In continuous time a stochastic process is a family of random
variables {Xt ; t ∈ <+ }.
Such a continuous time stochastic process generates a
continuous time filtration {Ft ; t ∈ <+ }.
Definition of a multistate process
Consider a process X = (Xt , t ≥ 0).
A multi-state process is a process which can take a finite
number of states, that is, for any t, the variable Xt has values in
{0, 1, . . . , K }.
The law of a multistate process is defined by its finite
dimensional distribution:
P(Xt1 = j1 , Xt2 = j2 , . . . , Xtn = jn ), n ∈ N
Definition of a multistate process
Consider a process X = (Xt , t ≥ 0).
A multi-state process is a process which can take a finite
number of states, that is, for any t, the variable Xt has values in
{0, 1, . . . , K }.
The law of a multistate process is defined by its finite
dimensional distribution:
P(Xt1 = j1 , Xt2 = j2 , . . . , Xtn = jn ), n ∈ N
Markov processes
Markov property for a multistate process
P(Xt = j|Fs ) = P(Xt = j|Xs ), s < t
This defines a Markov chain.
Transition probabilities:
phj (s, t) = P(Xt = j|Xs = h), s < t
For a Markov multistate process, the transition probabilities
specify the law.
Transition probability matrices: The matrices P(s, t) with
elements phj (s, t)
Markov processes
Markov property for a multistate process
P(Xt = j|Fs ) = P(Xt = j|Xs ), s < t
This defines a Markov chain.
Transition probabilities:
phj (s, t) = P(Xt = j|Xs = h), s < t
For a Markov multistate process, the transition probabilities
specify the law.
Transition probability matrices: The matrices P(s, t) with
elements phj (s, t)
Chapman-Kolmogorov equation
P(s, t) = P(s, u)P(u, t), s < u < t
Transition intensities
αhj (t) = lim∆t→0 phj (t, t + ∆t)/∆t.
(1)
P
P
We define αhh (t) = − j6=h αhj (t), and j6=h αhj (t) is the
hazard function associated with the distribution of the sojourn
time in state h.
Rt
Integrated transition intensities: Ahj (t) = 0 αhj (u)du.
Matrix of transition intensities: A0 (t)
Relationship between transition probabilities and
intensities
Definition of the αhj (t) as limits of phj (t, t + ∆t)/∆t
Kolmogorov forward differential equation:
∂
0
∂t P(s, t) = P(s, t)A (t); P(s, s) = I
The Kolmogorov forward equation has an integral form and a
product-integral solution. The solution allows to compute P(s, t)
from A0 (t).
The law of X can be specified by A0 (t), t ≥ 0.
Relationship between transition probabilities and
intensities
Definition of the αhj (t) as limits of phj (t, t + ∆t)/∆t
Kolmogorov forward differential equation:
∂
0
∂t P(s, t) = P(s, t)A (t); P(s, s) = I
The Kolmogorov forward equation has an integral form and a
product-integral solution. The solution allows to compute P(s, t)
from A0 (t).
The law of X can be specified by A0 (t), t ≥ 0.
Homogeneous Markov chain
A0 (t) is constant
P(s, t) only depends on t − s.
Wrting A0 (t) = A0 and P(0, t) = P(t) the solution of the
Kolmogorov equation is
0
P(t) = etA
When A0 can be diagonalized P(t) can be computed easily.
This makes homogeneous Markov models numerically easy.
Semi-Markov process
Semi-Markov processes are not Markov processes: only the
sequence of states is Markovian;
The most used semi-Markov process:
P(Xt = j|Fs ) = P(Xt = j|Xs , TXs ), s < t
where TXs is the time of last entrance in state Xs . We may
replace TXs by s − TXs .
Transition probabilities and intensities in semi-Markov
processes
We can define a transition probabilities:
phj (s, t, Th ) = P(Xt = j|Xs = h, Th ), s < t
and transition intensities
αhj (t, Th ) = lim∆t→0 phj (t, t + ∆t, Th )/∆t.
(2)
However these quantities are random and we do not have the
Kolmogorov equations. In general we may still be able to derive
transition probabilities from transition intensities but the theory
is more complex.
Multistate models for a population
Multistate models for a population: states and time
We associate to each subject i of a population a multistate
process X i = (Xti )t≥0 with law specified by A0i (.).
The states of the X i s and the time t have same meaning for all
the subjects. However several choices of t are possible: for
instance t may be the calendar time or time since a
subject-specific event. If the event is birth, t is age.
Multistate models for a population: heterogeneity of
intensities
The population may be homogeneous or heterogeneous.
The A0i (.) in general depend on i.
The heterogenity may be modelled as depending of observed
covariates Zi (t).
i
αhj
(t) = φhj (αhj0 (t), Zi (t)),
Proportional intensity assumption:
αhj (t, Zi (t)) = αhj0 (t)rhj (Zi (t)); a common choice is
rhj (Zi (t)) = eβhj Zi (t) .
Additive intensity αhj (t, Zi (t)) = αhj0 (t) + βhj (t)Zi (t).
Patterns of observation
Selection of the sample
For estimation purpose we have to draw a sample of size n
from a population.
It often happens that subjects can enter the sample only if they
are in a given state.
For instance in an illness-death model the study sample may be
a random sample of healthy subjects. The condition is:
Xtii0 = 0
This is a kind of left-truncation which must be taken into
account for inference.
Left- and right-censoring
Subject i may be first observed at time Li until time Ci . We
observe:
(Xti , Li ≤ t ≤ Ci )
If Li = 0 this the extension of right-censoring to multistate
processes.
Discrete-time observations
We may consider that the state of the process i is observed at
only a finite number of times V0i , V1i , . . . , Vmi .
This typically happens in cohort studies where fixed visit times
have been planned. In such cases the exact times of transitions
are not known; it is only known that they occurred during a
particular interval; these observations are said to be
interval-censored.
It is also possible that the state of the process is not exactly
observed but it is known that it belongs to a subset of
{0, 1, . . . , K }
Mixed continuous and discrete-time observations
The most common pattern of observation is in fact a mixing of
discrete and continuous time observations. This is because
most multi-state models include states which represent clinical
status and one state which represents death: most often clinical
status is observed at discrete times (visits) while the (nearly)
exact time of death can be retrieved. It may also happen that
some events other than death are observed exactly.
Ignorability of the observation process
For writing reasonably simple likelihoods, there must be some
kind of independence of the mechanisms leading to incomplete
observations relatively to the process itself. A simple likelihood
can be written if the observation times are fixed. More
realistically, the observation process should be considered as
random and intervene in the likelihood. The mechanism leading
to incomplete data will be said to be ignorable if the likelihood
treating the observation process as non-random leads to the
same inference as the correct likelihood. An instance where
this works is the case of observation processes completely
independent of the processes of interest Xi . In Andersen et al.
(1993) a more general condition of independent censoring is
rigorously defined.
Likelihood
The likelihood or Bayesian approaches appear safer than using
marginal models because of selection and censoring.
Likelihood given with the following assumptions:
ignorability and sample size ancillary
conditional on the state at the first observation time V0
the processes X i s are independent
the processes are Markov
The likelihood can be written easily in terms of the phj (., .) and
the αhj (.).
For the usual semi-Markov model the same formulas can be
used but since these quantities are random it may be
necessary to take an expectation given the observations.
Likelihood
The likelihood or Bayesian approaches appear safer than using
marginal models because of selection and censoring.
Likelihood given with the following assumptions:
ignorability and sample size ancillary
conditional on the state at the first observation time V0
the processes X i s are independent
the processes are Markov
The likelihood can be written easily in terms of the phj (., .) and
the αhj (.).
For the usual semi-Markov model the same formulas can be
used but since these quantities are random it may be
necessary to take an expectation given the observations.
Likelihood: continuous-time observations
The process X is continuously observed from V0 to C. We
observe that transitions have occured at (exactly) times
T1 , T2 , . . . , Tm . With the convention T0 = V0 , the likelihood
(conditional on X (V0 )) is:
m−1
Y
L=[
r =0
pX (Tr ),X (Tr ) (Tr ,Tr +1 −)αX (Tr ),X (Tr +1 ) (Tr +1 )]pX (Tm ),X (Tm ) (Tm ,C).
Likelihood: discrete-time observations
Observations of X are taken at V0 , V1 , . . . , Vm . We write the
likelihood as if the Vj were fixed (ignorability):
L=
m−1
Y
r =0
pX (Vr ),X (Vr +1 ) (Vr ,Vr +1 ).
Likelihood: Mixed discrete-continuous time
observations
For simplicity we give the likelihood when the process is
observed at discrete times but time of transition towards one
absorbing state, representing generally death, is exactly
observed or right-censored. Denote by K this absorbing state.
Observations of X are taken at V0 , V1 , . . . , Vm and the vital
status is observed until C ( C ≥ Vm ). Let us call T̃ the follow-up
time that is T̃ = min(T , C), where T is the time of death; we
observe T̃ and δ = I{T ≤ C}.
Likelihood: Mixed discrete-continuous time
observations
The likelihood is:
L=
m−1
Y
r =0
pX (Vr ),X (Vr +1 ) (Vr ,Vr +1 )
X
j6=K
pX (Vm ),j (Vm ,T̃ −)αj,K (T̃ )δ .
Example: likelihood in the illness-death model
If the subject starts in state “health”, has never been observed
in the “illness” state and was last seen at visit L (at time VL ) the
likelihood is:
L = p00 (V0 , VL )[p00 (VL , T̃ )α02 (T̃ )δ + p01 (VL , T̃ )α12 (T̃ )δ ];
Example: likelihood in the illness-death model
If the subject has been observed in the illness state for the first
time at VJ then the likelihood is:
L = p00 (V0 , VJ−1 )p01 (VJ−1 , VJ )p11 (VJ , T̃ )α12 (T̃ )δ .
See Commenges and Gégout-Petit (Scand J Stat, 2007) for a
rigorous justification of these formulas.
Likelihood inference
Once we have a likelihood L(A0 (.), X ) I distinguish 6 ways of
getting estimators
Non-parametric: no restriction on A0 (.)
Parametric: A0 (.) ∈ M: M = (A0,θ (.))θ∈Θ
Flexible Parametric: A0 (.) ∈ Mλ (a family of models)
Smooth non-parametric by smoothing a non-parametric
estimate
Smooth non-parametric by penalized likelihood
Bayesian approach
The non-parametric methods may be combined with a
parametric form for effects of covariates leading to
semi-parametric models.
Case with continuous time observations
In this particular case, the problem can be split in as many
survival problems as there are transitions. For instance if
subject i enters state h at Thi and makes a h− > j transition at
Tji his likelihood contribution to estimation of αhj (.) is
−
e
R Tji
Ti
h
αhj (u)
αhj (Tji )
If a subject makes a transition to another state it is considered
as censored. So, estimation of αhj (.) can be done by survival
methods allowing left-truncation (or scattered entry). Estimation
may be parametric or semi-parametric: for instance a Cox
model with Breslow estimator for αhj (.) could be used.
Continuous time observations: inference by splitting
into survival problems
See Andersen and Keiding (2002), Multi-state models for event
history analysis, Stat Meth Med Res
Section 5.6
However this is only for estimating the parameters. For
inference about outcomes (probability of being in a state,
expectation of sojourn time) we must revert to the multi-state
model.
See also Andersen and Perme (2008) , Inference for outcome
probabilities in multistate models, Lifetime Data Analysis,
405-431.
The computational problem with interval censored
observations
With discrete time observations the likelihood involves
ph,j (Vr , Vr +1 ), h 6= j, where h = X (Vr ) and j = X (Vr +1 ).
The transition probabilities are linked to the transition
intensities. For the irreversible Markov illness-death model:
p00 (s, t) = e−A01 (s,t)−A02 (s,t)
p11 (s, t) = e−A12 (s,t)
Z t
p01 (s, t) =
p00 (s, u)α01 (u)p11 (u, t)du,
s
Integral computed numerically
Multiplicity = number of interval-censored transitions.
Homogeneous Markov Model
Because of the relationship:
0
P(t) = etA
the likelihood is numerically easy to compute even with
discrete-time observations,
because for a diagonal matrix the exponential is the diagonal
matrix with exponential of the original terms.
The model can be extended to piece-wise homogeneous
models.
Non-Homogeneous Markov Model: non-parametric
approach
A non-parametric approach in the spirit of Turnbull (1976) has
been proposed by Frydman (1992) for a three-state model.
Recently Halina Frydman has made an extension of that:
Frydman and Szarek (2009), Nonparametric estimation in a
Markov Illness-death process from interval censored
opbservations with missing intermediate transition status,
Biometrics.
However there are analytical and computational difficulties.
Non-Homogeneous Markov Model: penalized
likelihood approach
In the semi-parametric case where one baseline function is
estimated for each transition, the penalized likelihood is:
Z ∞
X
0
0
00
pl(A (.), β) = l(A (.), β) −
κhj
[αhj0
(u)]2 du,
h,j>h
0
where l is the loglikelihood, A0 (.) is the matrix of baseline
k.
hazard functions, β is the array of regression parameters βhj
Non-Homogeneous Markov Model: penalized
likelihood approach
The penalty term excludes that discrete or unsmooth functions
maximize pl(A0 (.), β). The solution of the penalized likelihood is
approximated using a basis of splines.
Choice of the smoothing parameter: crossvalidation.
Applications
1
HIV infection in haemophiliacs
2
The Dementia-death model
3
Dementia-Institution-Death model
0: HIV-
α01 (t)-
1: HIV+
α12 (τ )-
2: AIDS
Figure: Progressive three-state Model: if τ = t this is a Markov model;
if τ is the time since the 0 → 1 transition this is a semi-Markov model.
The Illness-death model for dementia
α01 (t)
0: Health
α02 (t)
J
J
J
J
J
J
^
-
2: Death
1: Dementia
α12 (t)
Objectives
Estimate the age-specific incidence of dementia
Estimate the age-specific mortality for demented
risk factors of dementia
Characteristics:
Time will be age
We have interval-censored data for onset of dementia
Death observed in continuous time
The Paquid Study: design
Paquid
3777 subjects were randomly taken from the population of
persons aged 65 or more living at home in a region around
Bordeaux, France. The study began in 1988 and the subjects
were diagnosed at the initial visit and then one, three, five and
eight years, ten years there after.
Observations
Subjects demented at the initial visits were excluded: left
truncation.
The situation was that of the mixed discrete-continuous time
pattern of observation since dementia status was observed in
discrete time, but the date of death could be retrieved exactly
The Paquid Study: number of observed cases
The sample consisted of 3675 subjects. During the 10 years of
follow-up, 437 incident cases of dementia were observed of
whom 161 died; 1299 subjects were observed in the healthy
state at the last visit before death (due to the interval censoring
part of them may have developed dementia before death
without being observed demented).
Age-specific incidence: survival and illness-death
Age-specific incidence and mortality rates: women
Age-specific incidence and mortality rates: men
Dementia-Institution-Death
1: Demented
α01 (t)
α04 (t) -
0: Healthy
@
@
@ α13 (t)
@
α14 (t)
@
@
@
?
R
@
4: Dead
6
α34 (t)
α24 (t)
@
@
α02 (t)
@
α23 (t)
@
@
R
@
2: Instit
3: Dem+Inst
Inference in Multi-state Models from Interval
Censored Data
A. Alioum
Biostatistics Team INSERM U897
Bordeaux School of Public Health, Bordeaux Segalen University
April 19, 2011
3rd Channel Network Conference Bordeaux, April 11, 2011
Multistate Models with Interval Censored Data Short Course
Part II: How to deal in practice with interval censored
data ?
This part of the course focuses on methods for analysis of
interval-censored data that have been implemented on
software, packages or programs
1
Two-state survival model
2
Illness-Death Model
3
More general multi-state models
Two-state model
Alive
Dead
Death
α01 (t)
State 0
- State 1
Onset of disease
Healthy
Diseased
Notations
T = time to failure or death = sojourn time in state 0
Hazard function (transition intensity)
λ(t) = α01 (t) = lim∆t→0 p01 (t, t + ∆t)/∆t
Survival function/CDF (transition probabilities)
Rt
S(t) = p00 (0, t) = exp − 0 α01 (u)du ;
F (t) = p01 (0, t) = 1 − p00 (0, t)
Objectives
Estimate the survival (hazard) function
Compare the distributions of failure time in two or more
populations
Set up a regression model to describe the distribution of
failure time for populations with different values of
covariates.
Review papers
Recent review papers on survival methods for
interval-censored data : Lesaffre et al. (2005), Gomez et al.
(2009), Zhang and Sun (2010),...
Observation schemes
Case II interval censored data
Instead of observing the actual value of the random
variable T , we only observe a window (L, R] such that
T ∈ (L, R]. Here L can be 0 (left-censored observation), R
can be infinite (right-censored observation), and
L = R = T (exact observation).
In cohort studies with periodic follow-up and m monitoring
times V1 , V2 , ..., Vm , the event of interest is only observed
to be occurred between two consecutive visit times Vl ,
Vl+1 and the observed data reduce to the interval
(L, R] = (Vl , Vl+1 ].
Others types of interval censoring
Current status, doubly-censored data, panel data
Assumptions
Non-informative censoring
Non-informative interval censoring assumption:
P(T ≤ t|L = l, R = r , L < T ≤ R) = P(T ≤ t|l < T ≤ r )
Except for the fact that T lies between l and r which are
the realisations of L and R, the interval (L, R] does not
provide any extra information for T (e.g. pre-specified
examination times)
However, if the survival process and the follow-up process
are not independent, the follow-up process may contain
the information about the survival process, hence the
processes must be modeled simultaneously to ensure the
valid inferences.
Simplified likelihood
In a study with n subjects, their potential times to event
T1 , T2 , ..., Tn are not observed and the observable data set
is then D = {(Li , Ri ], 1 ≤ i ≤ n}
Under the assumption of independent censoring, the
"partial likelihood" is given by
L(S(.)|D) =
n
Y
i=1
[S(Li ) − S(Ri )]
Example 1
Breast cancer data (Finkelstein and Wolfe, 1985)
94 breast cancer patients who were given radiation (RT) or
radiation therapy plus chemotherapy (RCT).
Planned visits every 4 to 6 months, but actual visits vary
from patients to patients, times between visits also vary
Event time to breast retraction (cosmetic appearance)
Non-parametric estimation of the survival function
The equivalent of the Kaplan-Meier estimation for
interval-censored data was first proposed by Peto (1973),
and Turnbull (1976) then improved the numerical algorithm
to estimate the survival function (EM algorithm)
From the data {(Li , Ri ], i = 1, 2, ..., n} a set of
non-overlapping intervals {(q1 , p1 ], (q2 , p2 ], ..., (qm , pm ]} is
generated over which S(t) is estimated : the NPMLE of
S(t) can decrease only on intervals (qj , pj ].
Other more efficient algorithm have been proposed : ICM
(Groeneboom and Wellner, 1992), EM-ICM (Wellner and
Zahn (1997)
NPMLE of the survival curve
Non-parametric comparison survival curves
Comparing survival curves under interval-censoring :
extension of the existing test statistics for right-censored
data (Sun, 2006)
Two classes of test statistics : rank-based (weighted
log-rank tests), survival-based (weighted differences of
estimated survival functions)
SAS macros for analyzing interval-censored data (ICE,
EMICM, ICSTEST ; http://support.sas.com/kb/24980)
R package Icens (Gentleman and Vandal, 2008) : compute
the NPMLE of the survival function under interval
censoring using different algorithms.
PH models for interval-censored data
Simplified likelihood
L(S0 (.), β|D) =
n
Y
0
0
[S0 (Li )exp (β Zi ) − S0 (Ri )exp (β Zi ) ]
i=1
Estimation (non-parametric) of the baseline survival
function S0 (t) and of the regression coefficients
(Finkelstein, 1986; Alioum and Commenges, 1996;
Goetghebeur and Ryan, 2000).
No available user-friendly program that implements such
models.
Fully parametric approaches
Main advantage : computationally straightforward via
maximum likelihood methods
Common choice for the baseline distribution of T :
exponential, Weibull, log-logistic, log-normal,...
Log-linear representation (AFT model)
Disadvantage : the parametric assumptions can cause
bias in inference if incorrectly specified and are difficult to
assess with interval-censored data
R package Intcox developed by Henschel et al. (2007) based
on a method proposed by Pan (1999) which assumes that S0 (t)
is piecewise constant.
Log-linear representation (AFT model)
Available packages
SAS (lifereg), R (survreg) and other standard statistical
packages are able to fit log-linear models in the presence of
interval-censored data
AFT model : Y = log(T ) = µ + β 0 Z + σ
If a Weibull distribution is chosen for T (extreme value
distribution for ), then AFT model is equivalent to PH
model (pay attention to the parametrization !!!)
S(t) = exp (αt γ ), where σ = γ −1 and α = exp (−µ/σ)
The regression coefficients of the PH model are given by:
βPH = −βAFT /σ
SAS macro PARM ICE: three-parameters family of parametric
regression models for interval-censored survival data with
time-dependent covariates (Sparling et al., 2006)
Penalized likelihood approach
Smooth estimation of hazard functions
Kernel-based approaches
Splines to approximate hazard functions (Rosenberg, 1995)
Penalized likelihood approach (Joly et al., 1998)
Penalized likelihood
Z
pl(λ(.), β|D) = L(λ(.), β|D) − κ
0
+∞
[λ000 (u)]2 du
- PH regression model : λ(t; Z ) = λ0 (t) exp (β 0 Z )
- Choice of the smoothing parameter κ via crossvalidation
- NPMLE of λ(t) approximated using M-splines
Fortran program PHMPL
Program PHMPL (Joly et al., 1999) : available on
http://etudes.isped.u-bordeaux2.fr/BIOSTATISTIQUE/PHMPL/US-Biostats-PHMPL.htm
estimates smooth hazard/survival functions for
interval-censored data, and possibly left-truncated data
estimates baseline hazard function et regression
coefficients in a PH model
The user must fill a file phmpl.inf with information about the
model he wishes to estimate
Bcancer.txt
94
1
Tgroup 1
12
0
1000.
1
surv.gr
risq.gr
Fortran program PHMPL
Regression results
PHMPL
Variable : Tgroup
beta : 9.429119E-01 SE(beta) 2.797310E-01
Wald : 3.370781
RR : 2.567447 95% IC 1.483849 4.442355
SAS lifereg
Paramètre
DF Estimation
Intercept
1
3.9309
Tgroup
1
-0.5716
Scale
1
0.5201
Weibull Shape 1
1.9228
βPH = −βAFT /σ = 1.10
RR=3.00
standard
0.1283
0.1581
0.0619
0.2289
Multi-state models with interval-censored data
The illness-death model
More general multi-state models
Markov models
Case II interval censoring
Non informative censoring
Proportional intensities regression models
The illness-death model
α01 (t)
0: Health
α02 (t)
J
J
J
J
J
J
^
-
2: Death
1: Illness
α12 (t)
In the absence of interval censoring
One can consider the transitions separately and use
standard methods for analyzing survival data with right
censoring and left truncation to estimate the transition
intensities and the regression parameters
Alternative to the PH model: additive model for estimation
of covariate-dependent transition intensities (Aalen et al.,
2001)
R package mstate (de Wreede, Fiocco and Putter, 2011)
Non-parametric and semi-parametric models : cumulative
transition intensities, transition probabilities, prediction,
graphical representation of the results
Illness-death model with interval-censored data
Observations pattern : times of transition from 0 to 1 are
interval-censored, but times to the absorbing state are
assumed to be known exactly or to be right-censored
If you’re just interested in α01 , you can use survival methods
for interval censored data, but α01 will be underestimated
(e.g. incidence of dementia ; Joly et al., 2002)
Hence the advantage of using multi-state models for
interval-censored data
Non-parametric inference :
Frydman (1995) ; Frydman and Szarek (2008)
Computationally difficult and instable
Parametric/Smooth non-parametric
Parametric
Easy to implement using maximum likelihood methods
A R package is under construction to fit a Weibull
illness-death model for interval-censored, and possibly left
truncated, data (Biostatistics Team Bordeaux; MOBIDYCK
project)
Smooth non-parametric via penalized likelihood (Joly et al.,
2002)
0 Z ), hj ∈ {01, 02, 12}
αhj (t; Z ) = αhj,0 (t) exp (βhj
pl(α01 (.), α02 (.), α12 (.); β01 , β02 , β12 ) =
Z +∞
00
L(αhj (.); βhj ) − κ01
[α01,0
(u)]2 du
0
Z +∞
Z +∞
00
2
00
−κ02
[α02,0 (u)] du − κ12
[α12,0
(u)]2 du
0
0
Parametric/Smooth non-parametric
NPMLE α̂01 (.), α̂02 (.), α̂12 (.) are approximated using
M-splines
α̃hj (t) =
X
Λ̃hj (t) =
X
l
θhj
Ml (t)
l
l
θhj
Il (t)
l
κ01 , κ02 , κ12 are estimated simultaneously by maximizing
the standard crossvalidation score
A package R is currently under implementation and will be
submitted soon to fit this model for interval-censored, and
possibly left truncated, data (Biostatistics Team Bordeaux;
MOBIDYCK project)
More complex models : Dementia-Institution-Death
More complex models : Dementia-Institution-Death
Smooth non-parametric by penalized likelihood
Need some additional assumptions (Joly et al., 2009)
α14 (t) = α04 (t)eθ14
α24 (t) = α04 (t)eθ24
α34 (t) = α04 (t)eθ34
α23 (t) = α01 (t)eθ23
α13 (t) = α02 (t)eθ13
pl(α01 (.), α02 (.), α04 (.); θ) =
Z
+∞
00
L(α01 , α02 , α04 ; θ) − κ01
[α01
(u)]2 du
0
Z +∞
Z +∞
00
2
00
−κ02
[α02 (u)] du − κ04
[α04
(u)]2 du
0
0
Panel data
States X i (t) of individual i(i = 1, 2, ..., n) is only known at
discrete consecutive follow-up times t = (ti,0 , ti,1 , ...., ti,mi ), and
the exact time of transitions from one state to another is not
observed
Likelihood conditionally on X (ti,0 )
mi
n Y
Y
pX (ti,j−1 ),X (ti,j ) (ti,j−1 , ti,j ; Zi (ti,j−1 ))
i=1 j=1
If X (ti,mi ) = K (single absorbing state) and ti,mi is the exact
time of transition to K , then the expression of the last term
of the product is given by
X
pX (ti,m −1 ),l (ti,mi −1 , ti,mi ; Zi (ti,mi −1 ))αl,K (ti,mi )
i
l6=K
Example of more complex multi-state models
Disablement process in the elderly (Paquid study)
HMM/NHMM with PCI
Time-homogeneous Makov models are often used to
analyze such data : αhj (t) = αhj and phj (s, t) depends only
on t − s, that is phj (s, t) = phj (0, t − s) = phj (t − s)
Example: transition probabilities in the time-homogeneous
illness-death model
p00 (t) = exp (−(α01 + α02 )t)
p01 (t) =
α01 [exp (−α12 t) − exp (−(α01 + α02 )t)]
α01 + α02 − α12
p11 (t) = exp (−α12 t)
PIM : αhj (t; Z (t)) = αhj0 exp (β 0 Z (t))
HMM/NHMM with PCI
NHMM with piecewise constant intensities: partitioning
time into 2 or more intervals and assume constant intensity
in each interval
PIM : αhj (t; Z (t)) = αhj0 (t) exp (β 0 Z (t))
l
where αhj0 (t) = αhj0
for τl−1 < t ≤ τl , (l = 1, 2, ..., r )
The expressions of transition probabilities phj (s, t) become
more complicated (even if we keep the idea of fitting of
homogeneous models to a series of r intervals)
This method needs prior specification of the cutpoints; the
choice of cutpoints should make sure that an appropriate
number of observations fall in all intervals
Programs and Packages for fitting HMM/NHMM with
PCI
MKVPCI
Fortran program (Alioum and Commenges, 2001) available
on http://etudes.isped.u-bordeaux2.fr/BIOSTATISTIQUE/MKVPCI/US-Biostats-MKVPCI.htm
Fitting general k-state Markov models (k − 1 transient
states and the k th state can be optionally chosen as an
absorbing state) with PCI and covariates
The output includes estimates of the baseline transition
intensities, regression coefficients, transition intensities in
time intervals (τl−1 , τl ], l = 1, 2, ..., r , together with their
estimated standard errors, and the results of the
multivariate hypotheses tests.
Example 2: disablement process in the elderly
File of input parameter for MKVPCI
disab.dat
15439
5
10
01001
10101
01011
00101
00000
1
80.0
1 2 3 4 5 6 7 8 9 10
1
2
gender 0 1 2 3 4 5 6 7 8 9 10
educ 0 1 2 3 4 5 6 7 8 9 10
1
1
0
Example 2: disablement process in the elderly
Multi-state Markov model with piecewise constant intensities
Program developed by Ahmadou ALIOUM
ISPED - University Victor Segalen Bordeaux 2
Markov model with piecewise constant intensities : two periods
Data file : disab.dat
Number of records : 15439
Number of subjects : 3461
Number of states : 5
Number of transitions : 10
Number of artificial covariates : 1
Cut points : 80.000000
Number of observed covariates : 0
Number of parameters : 20
Exact transition times to the absorbing state : 1
Baseline transition intensities
Time intervals
Tau(0) - Tau(1)
Tau(1) - Tau(2)
TRANSITION ESTIMATES STD ERROR
ESTIMATES STD ERROR
1 –> 2
.28432 .01180
.57785 .04278
1 –> 5
.01099 .00274
.02167 .01165
2 –> 1
.17916 .00870
.09423 .01089
2 –> 3
.13516 .00661
.29438 .01282
2 –> 5
.02707 .00339
.04016 .00665
3 –> 2
.17757 .01238
.05033 .00475
3 –> 4
.07652 .00834
.14481 .00827
3 –> 5
.05382 .00775
.08822 .00711
4 –> 3
.10510 .02144
.04460 .00794
4 –> 5
.21518 .02759
.35568 .01823
R package msm
Package developped by Christopher H. Jackson
http://www.jstatsoft.org/v38/i08/ available on http://CRAN.R-project.org/package=msm
Markov model with any number of states and any pattern
of transitions to panel data : HMM, NHMM with PCI
Proportional intensities models : piecewise constant
time-dependant covariates
Exact death times, censored states (alive but in an
unknown disease state at the end of the study)
Hidden Markov models in which the states of the Markov
chain are not observed : observed data Xij are governed
by some probability distribution conditionally on the
unobserved state Sij
Tools for model assessment
R package tdc.msm
Meira-Machado et al. (2008)
R package tdc.msm
Package available on http://www.mct.uminho.pt/lmachado/Rlibrary
In the absence of interval censoring : Time-dependant Cox
regression model, Cox Markov model , Cox semi-Markov
model, Non-parametric Makov model (fitting separate
intensities to all permitted transitions using standard
survival methods)
Panel data : HMM/NHMM with PCI
Summary
Two-state survival model with interval-censored data:
Most of the standard methods for right-censored data were
extended to interval-censored data
SAS: macros ICE, EMICM, ICSTEST ; lifereg
R packages: Intcox, survreg,
Fortran program : PHMPL
Illness death model with interval-censored data
Non-parametric, smooth non-parametric via penalized
likelihood, parametric
R package under construction by the Biostatistics Team
Bordeaux (Weibull, PL)
More general multi-state models (HMM/NHMM with PCI)
R packages: msm, tdc.msm
Fortran program: MKVPCI
References
1
General books on survival data analysis and multistate models are by Andersen et al. (1993),
Hougaard (2000). A good book for learning stochastic processes in discrete time is Williams
(1991). Review papers for survival models with interval-censored data are Lesaffre et al. (2005),
Gómez et al. (2009), Zhang and Sun (2010). Reviews on multi-state models are by Putter et al.
(2007) and for interval-censored observations from multistate models by Commenges (1999),
Commenges (2002). Survival with interval censored data have been studied Peto (1973), Turnbull
(1976), Pan (1999) and extended to regression modeling by Finkelstein (1986), Alioum and Commenges (1996), Joly et al. (1998). Interesting studies of multistate models are by Keiding (1991),
Andersen (1988), Gentleman et al. (1994), Hougaard (1999), Keiding et al. (2001), Schumacher
et al. (2007). Interval-censored data in multistate models have been studied non-parametrically
by Frydman (1995) and Frydman and Szarek (2009) and using penalized likelihood by Joly and
Commenges (1999), Joly et al. (2002), Commenges et al. (2004), Commenges et al. (2007), Joly
et al. (2009). A rigorous derivation of the likelihood for interval censored observations in multistate
models has been given by Commenges and Gégout-Petit (2007).
Software for survival with interval-censored data are by Joly et al. (1999), Henschel et al. (2009),
Sparling et al. (2006), Gentleman and Vandal (2008). Andersen and Keiding (2002) expalins how
to estimate multistate models observed in continuous time using standard software for survival
data analysis. Software for multistate models observed in continuous time is mstate proposed in
De Wreede et al. (2010). Aalen et al. (2001) has proposed a R program for the additive model.
Software for multistate model for interval-censored data are described in Alioum and Commenges
(2001), Jackson (2011), Meira-Machado et al. (2007).
References
Aalen, O., Borgan, Ø., and Fekjaer, H. (2001). Covariate adjustment of event histories estimated
from Markov chains: the additive approach. Biometrics 57, 993–1001.
2
Shortcourse
Alioum, A. and Commenges, D. (1996). A proportional hazards model for arbitrarily censored and
truncated data. Biometrics 52, 512–524.
Alioum, A. and Commenges, D. (2001). MKVPCI: a computer program for Markov models with
piecewise constant intensities and covariates. Computer methods and programs in biomedicine
64, 109–119.
Andersen, P. (1988). Multistate models in survival analysis: a study of nephropathy and mortality
in diabetes. Statistics in Medicine 7, 661–670.
Andersen, P., Borgan, O., Gill, R., and Keiding, N. (1993). Statistical Models Based on Counting
Processes. New-York: Springer-Verlag.
Andersen, P. and Keiding, N. (2002). Multi-state models for event history analysis. Statistical
Methods in Medical Research 11, 91.
Commenges, D. (1999). Multi-state models in epidemiology. Lifetime data analysis 5, 315–327.
Commenges, D. (2002). Inference for multi-state models from interval-censored data. Statistical
Methods in Medical Research 11, 167.
Commenges, D. and Gégout-Petit, A. (2007). Likelihood for generally coarsened observations
from multistate or counting process models. Scandinavian Journal of Statistics Theory and
Applications 34, 432.
Commenges, D., Joly, P., Gégout-Petit, A., and Liquet, B. (2007). Choice between semi-parametric
estimators of Markov and non-Markov multi-state models from coarsened observations. Scandinavian journal of statistics 34, 33–52.
Commenges, D., Joly, P., Letenneur, L., and Dartigues, J. (2004). Incidence and mortality of
Alzheimer’s disease or dementia using an illness-death model. Statistics in medicine 23, 199–
210.
De Wreede, L., Fiocco, M., and Putter, H. (2010). The mstate package for estimation and prediction
in non-and semi-parametric multi-state and competing risks models. Computer methods and
References
3
programs in biomedicine .
Finkelstein, D. (1986). A proportional hazards model for interval-censored failure time data.
Biometrics 42, 845–854.
Frydman, H. (1995). Nonparametric estimation of a Markov illness-deathprocess from intervalcensored observations, with application to diabetes survival data. Biometrika 82, 773.
Frydman, H. and Szarek, M. (2009). Nonparametric Estimation in a Markov Illness–Death Process
from Interval Censored Observations with Missing Intermediate Transition Status. Biometrics
65, 143–151.
Gentleman, R., Lawless, J., Lindsey, J., and Yan, P. (1994). Multi-state Markov models for
analysing incomplete disease history data with illustrations for hiv disease. Statistics in
Medicine 13, 805–821.
Gentleman, R. and Vandal, A. (2008). Icens: NPMLE for Censored and Truncated Data. R package
version 1,.
Gómez, G., Calle, M., Oller, R., and Langohr, K. (2009). Tutorial on methods for interval-censored
data and their implementation in R. Statistical Modelling 9, 259.
Henschel, V., Heiß, C., and Mansmann, U. (2009). intcox: Compendium to apply the iterative
convex minorant algorithm to interval censored event data.
Hougaard, P. (1999). Multi-state models: a review. Lifetime Data Analysis 5, 239–264.
Hougaard, P. (2000). Analysis of multivariate survival data. Springer Verlag.
Jackson, C. (2011). Multi-State Models for Panel Data: The msm Package for R. Journal of
Statistical Software 38, 1–29.
Joly, P. and Commenges, D. (1999). A Penalized Likelihood Approach for a Progressive ThreeState Model with Censored and Truncated Data: Application to AIDS. Biometrics 55, 887–
890.
Joly, P., Commenges, D., Helmer, C., and Letenneur, L. (2002). A penalized likelihood approach
4
Shortcourse
for an illness-death model with interval-censored data: application to age-specific incidence of
dementia. Biostatistics 3, 433.
Joly, P., Commenges, D., and Letenneur, L. (1998). A penalized likelihood approach for arbitrarily
censored and truncated data: application to age-specific incidence of dementia. Biometrics 54,
185–194.
Joly, P., Durand, C., Helmer, C., and Commenges, D. (2009). Estimating life expectancy of
demented and institutionalized subjects from interval-censored observations of a multi-state
model. Statistical Modelling 9, 345.
Joly, P., Letenneur, L., Alioum, A., and Commenges, D. (1999). PHMPL: a computer program
for hazard estimation using a penalized likelihood method with interval-censored and lefttruncated data. Computer methods and programs in biomedicine 60, 225–31.
Keiding, N. (1991). Age-specific incidence and prevalence: a statistical perspective. Journal of the
Royal Statistical Society. Series A (Statistics in Society) 154, 371–412.
Keiding, N., Klein, J., and Horowitz, M. (2001). Multi-state models and outcome prediction in
bone marrow transplantation. Statistics in Medicine 20, 1871–1885.
Lesaffre, E., Komrek, A., and Declerck, D. (2005). An overview of methods for interval-censored
data with an emphasis on applications in dentistry. Statistical Methods in Medical Research
14, 539–552.
Meira-Machado, L., Cadarso-Suárez, C., and de Uña-Álvarez, J. (2007). tdc. msm: an R library
for the analysis of multi-state survival data. Computer methods and programs in biomedicine
86, 131–140.
Pan, W. (1999). Extending the iterative convex minorant algorithm to the cox model for intervalcensored data. Journal of Computational and Graphical Statistics 8, 109–120.
Peto, R. (1973). Experimental survival curves for interval-censored data. Applied Statistics 22,
86–91.
References
5
Putter, H., Fiocco, M., and Geskus, R. (2007). Tutorial in biostatistics: competing risks and multistate models. Statistics in Medicine 26, 2389–2430.
Schumacher, M., Wangler, M., Wolkewitz, M., and Beyersmann, J. (2007). A Simple and Useful
Application of Multistate Models. Methods Inf Med 46, 595–600.
Sparling, Y., Younes, N., Lachin, J., and Bautista, O. (2006). Parametric survival models for
interval-censored data with time-dependent covariates. Biostatistics 7, 599.
Turnbull, B. (1976). The empirical distribution function with arbitrarily grouped, censored and
truncated data. Journal of the Royal Statistical Society. Series B (Methodological) 38, 290–
295.
Williams, D. (1991). Probability with martingales. Cambridge.
Zhang, Z. and Sun, J. (2010). Interval censoring. Statistical Methods in Medical Research 19, 53.
Related documents