Cover image for Unified methods for censored longitudinal data and causality
Title:
Unified methods for censored longitudinal data and causality
Personal Author:
Series:
Springer series in statistics
Publication Information:
New York, NY : Springer, 2003
ISBN:
9780387955568
Added Author:

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010119442 QA278.8 L32 2003 Open Access Book Book
Searching...

On Order

Summary

Summary

During the last decades, there has been an explosion in computation and information technology. This development comes with an expansion of complex observational studies and clinical trials in a variety of fields such as medicine, biology, epidemiology, sociology, and economics among many others, which involve collection of large amounts of data on subjects or organisms over time. The goal of such studies can be formulated as estimation of a finite dimensional parameter of the population distribution corresponding to the observed time- dependent process. Such estimation problems arise in survival analysis, causal inference and regression analysis. This book provides a fundamental statistical framework for the analysis of complex longitudinal data. It provides the first comprehensive description of optimal estimation techniques based on time-dependent data structures subject to informative censoring and treatment assignment in so called semiparametric models. Semiparametric models are particularly attractive since they allow the presence of large unmodeled nuisance parameters. These techniques include estimation of regression parameters in the familiar (multivariate) generalized linear regression and multiplicative intensity models. They go beyond standard statistical approaches by incorporating all the observed data to allow for informative censoring, to obtain maximal efficiency, and by developing estimators of causal effects. It can be used to teach masters and Ph.D. students in biostatistics and statistics and is suitable for researchers in statistics with a strong interest in the analysis of complex longitudinal data.


Author Notes

Mark J. van der Laan is Professor of Biostatistics and Statistics at the University of California, Berkeley, and is a prominent researcher in the area of censored data and causality. His methodological research is inspired by collaborations with biologists, epidemiologists, and medical researchers. He has designed courses in survival analysis, censored data, causal inference, and statistical methods in computational biology, and advises a group of Ph.D. students in these fields. He is currently Associate Editor of the statistical journals Biometrics, Journal of Statistical Planning and Infernece, and Statistical Applications in Genetics and Molecular Biology
James M. Robins is the Mitchell L. and Robin LaFoley Dong Professor of Epidemiology and Professor of Biostatistics at the Harvard School of Public Health. Over the past two decades, Professor Robins has developed novel statistical methods for inferring the causal effects of time-varying treatments or exposures from both observational and experimental data, and for appropriately adjusting for missing and censored data in very high-dimensional statistical models


Table of Contents

Prefacep. v
Notationp. 1
1 Introductionp. 8
1.1 Motivation, Bibliographic History, and an Overview of the bookp. 8
1.2 Tour through the General Estimation Problemp. 16
1.2.1 Estimation in a high-dimensional full data modelp. 17
1.2.2 The curse of dimensionality in the full data modelp. 21
1.2.3 Coarsening at randomp. 23
1.2.4 The curse of dimensionality revisitedp. 27
1.2.5 The observed data modelp. 40
1.2.6 General method for construction of locally efficient estimatorsp. 40
1.2.7 Comparison with maximum likelihood estimationp. 45
1.3 Example: Causal Effect of Air Pollution on Short-Term Asthma Responsep. 48
1.4 Estimating Functionsp. 55
1.4.1 Orthogonal complement of a nuisance tangent spacep. 55
1.4.2 Review of efficiency theoryp. 61
1.4.3 Estimating functionsp. 62
1.4.4 Orthogonal complement of a nuisance tangent space in an observed data modelp. 64
1.4.5 Basic useful results to compute projectionsp. 68
1.5 Robustness of Estimating Functionsp. 69
1.5.1 Robustness of estimating functions against misspecification of linear convex nuisance parametersp. 69
1.5.2 Double robustness of observed data estimating functionsp. 77
1.5.3 Understanding double robustness for a general semiparametric modelp. 79
1.6 Doubly robust estimation in censored data modelsp. 81
1.7 Using Cross-Validation to Select Nuisance Parameter Modelsp. 93
1.7.1 A semiparametric model selection criterianp. 94
1.7.2 Forward/backward selection of a nuisance parameter model based on cross-validation with respect to the parameter of interestp. 97
1.7.3 Data analysis example: Estimating the causal relationship between boiled water use and diarrhea in HIV-positive menp. 99
2 General Methodologyp. 102
2.1 The General Model and Overviewp. 102
2.2 Full Data Estimating Functionsp. 103
2.2.1 Orthogonal complement of the nuisance tangent space in the multivariate generalized linear regression model (MGLM)p. 105
2.2.2 Orthogonal complement of the nuisance tangent space in the multiplicative intensity modelp. 107
2.2.3 Linking the orthogonal complement of the nuisance tangent space to estimating functionsp. 111
2.3 Mapping into Observed Data Estimating Functionsp. 114
2.3.1 Initial mappings and reparametrizing the full data estimating functionsp. 114
2.3.2 Initial mapping indexed by censoring and protected nuisance parameterp. 124
2.3.3 Extending a mapping for a restricted censoring model to a complete censoring modelp. 125
2.3.4 Inverse weighting a mapping developed for a restricted censoring modelp. 126
2.3.5 Beating a given RAL estimatorp. 128
2.3.6 Orthogonalizing an initial mapping w.r.t. G: Double robustnessp. 131
2.3.7 Ignoring information on the censoring mechanism improves efficiencyp. 135
2.4 Optimal Mapping into Observed Data Estimating Functionsp. 137
2.4.1 The corresponding estimating equationp. 139
2.4.2 Discussion of ingredients of a one-step estimatorp. 141
2.5 Guaranteed Improvement Relative to an Initial Estimating Functionp. 142
2.6 Construction of Confidence Intervalsp. 144
2.7 Asymptotics of the One-Step Estimatorp. 145
2.7.1 Asymptotics assuming consistent estimation of the censoring mechanismp. 146
2.7.2 Proof of Theorem 2.4p. 150
2.7.3 Asymptotics assuming that either the censoring mechanism or the full data distribution is estimated consistentlyp. 151
2.7.4 Proof of Theorem 2.5p. 152
2.8 The Optimal Indexp. 153
2.8.1 Finding the optimal estimating function among a given class of estimating functionsp. 159
2.9 Estimation of the Optimal Indexp. 166
2.9.1 Reparametrizing the representations of the optimal full data functionp. 167
2.9.2 Estimation of the optimal full data structure estimating functionp. 169
2.10 Locally Efficient Estimation with Score-Operator Representationp. 170
3 Monotone Censored Datap. 172
3.1 Data Structure and Modelp. 172
3.1.1 Cause-specific censoringp. 175
3.2 Examplesp. 176
3.2.1 Right-censored data on a survival timep. 176
3.2.2 Right-censored data on quality-adjusted survival timep. 177
3.2.3 Right-censored data on a survival time with reporting delayp. 179
3.2.4 Univariately right-censored multivariate failure time datap. 181
3.3 Inverse Probability Censoring Weighted (IPCW) Estimatorsp. 183
3.3.1 Identifiability conditionp. 183
3.3.2 Estimation of a marginal multiplicative intensity modelp. 184
3.3.3 Extension to proportional rate modelsp. 191
3.3.4 Projecting on the tangent space of the Cox proportional hazards model of the censoring mechanismp. 192
3.4 Optimal Mapping into Estimating Functionsp. 195
3.5 Estimation of Qp. 196
3.5.1 Regression approach: Assuming that the censoring mechanism is correctly specifiedp. 197
3.5.2 Maximum likelihood estimation according to a multiplicative intensity model: Doubly robustp. 198
3.5.3 Maximum likelihood estimation for discrete models: Doubly robustp. 200
3.5.4 Regression approach: Doubly robustp. 201
3.6 Estimation of the Optimal Indexp. 204
3.6.1 The multivariate generalized regression modelp. 205
3.6.2 The multivariate generalized regression model when covariates are always observedp. 206
3.7 Multivariate failure time regression modelp. 208
3.8 Simulation and data analysis for the nonparametric full data modelp. 211
3.9 Rigorous Analysis of a Bivariate Survival Estimatep. 217
3.9.1 Proof of Theorem 3.2p. 221
3.10 Prediction of Survivalp. 224
3.10.1 General methodologyp. 225
3.10.2 Prediction of survival with Regression Treesp. 230
4 Cross-Sectional Data and Right-Censored Data Combinedp. 232
4.1 Model and General Data Structurep. 232
4.2 Cause Specific Monitoring Schemesp. 234
4.2.1 Overviewp. 235
4.3 The Optimal Mapping into Observed Data Estimating Functionsp. 236
4.3.1 Identifiability conditionp. 239
4.3.2 Estimation of a parameter on which we have current status datap. 241
4.3.3 Estimation of a parameter on which we have right-censored datap. 243
4.3.4 Estimation of a joint-distribution parameter on which we have current status data and right-censored datap. 244
4.4 Estimation of the Optimal Index in the MGLMp. 245
4.5 Example: Current Status Data with Time-Dependent Covariatesp. 246
4.5.1 Regression with current status datap. 248
4.5.2 Previous work and comparison with our resultsp. 250
4.5.3 An initial estimatorp. 251
4.5.4 The locally efficient one-step estimatorp. 252
4.5.5 Implementation issuesp. 253
4.5.6 Construction of confidence intervalsp. 255
4.5.7 A doubly robust estimatorp. 256
4.5.8 Data-adaptive selection of the location parameterp. 257
4.5.9 Simulationsp. 257
4.5.10 Example 1: No unmodeled covariatep. 258
4.5.11 Example 2: Unmodeled covariatep. 258
4.5.12 Data Analysis: California Partners' Studyp. 260
4.6 Example: Current Status Data on a Process Until Deathp. 262
5 Multivariate Right-Censored Multivariate Datap. 266
5.1 General Data Structurep. 266
5.1.1 Modeling the censoring mechanismp. 268
5.1.2 Overviewp. 270
5.2 Mapping into Observed Data Estimating Functionsp. 271
5.2.1 The initial mapping into observed estimating data functionsp. 271
5.2.2 Generalized Dabrowska estimator of the survival function in the nonparametric full data modelp. 273
5.2.3 Simulation study of the generalized Dabrowka estimatorp. 275
5.2.4 The proposed mapping into observed data estimating functionsp. 276
5.2.5 Choosing the full data estimating function in MGLMp. 282
5.3 Bivariate Right-Censored Failure Time Datap. 282
5.3.1 Introductionp. 282
5.3.2 Locally efficient estimation with bivariate right-censored datap. 286
5.3.3 Implementation of the locally efficient estimatorp. 290
5.3.4 Inversion of the information operatorp. 292
5.3.5 Asymptotic performance and confidence intervalsp. 293
5.3.6 Asymptoticsp. 294
5.3.7 Simulation methods and results for the nonparametric full data modelp. 299
5.3.8 Data analysis: Twin age at appendectomyp. 302
6 Unified Approach for Causal Inference and Censored Datap. 311
6.1 General Model and Method of Estimationp. 311
6.2 Causal Inference with Marginal Structural Modelsp. 318
6.2.1 Closed Form Formula for the Inverse of the Nonparametric Information Operator in Causal Inference Modelsp. 324
6.3 Double Robustness in Point Treatment MSMp. 326
6.4 Marginal Structural Model with Right-Censoringp. 329
6.4.1 Doubly robust estimators in marginal structural models with right-censoringp. 334
6.4.2 Data Analysis: SPARCSp. 338
6.4.3 A simulation for estimators of a treatment-specific survival functionp. 343
6.5 Structural Nested Model with Right-Censoringp. 347
6.5.1 The orthogonal complement of a nuisance tangent space in a structural nested model without censoringp. 353
6.5.2 A class of estimating functions for the marginal structural nested modelp. 357
6.5.3 Analyzing dynamic treatment regimesp. 359
6.5.4 Simulation for dynamic regimes in point treatment studiesp. 360
6.6 Right-Censoring with Missingnessp. 362
6.7 Interval Censored Datap. 366
6.7.1 Interval censoring and right-censoring combinedp. 368
Referencesp. 371
Author indexp. 388
Subject indexp. 394
Example indexp. 397