Journal of the Royal Statistical Society: Series C (Applied Statistics)

A partially linear regression model for data from an outcome‐dependent sampling design

Journal Article

Summary.  The outcome‐dependent sampling scheme has been gaining attention in both the statistical literature and applied fields. Epidemiological and environmental researchers have been using it to select the observations for more powerful and cost‐effective studies. Motivated by a study of the effect of in utero exposure to poly‐chlorinated biphenyls on children's intelligence quotient at age 7 years, in which the effect of an important confounding variable is non‐linear, we consider a semiparametric regression model for data from an outcome‐dependent sampling scheme where the relationship between the response and covariates is only partially parameterized. We propose a penalized spline maximum likelihood estimation for inference on both the parametric and the non‐parametric components and develop their asymptotic properties. Through simulation studies and an analysis of the intelligence study, we compare the proposed estimator with several competing estimators. Practical considerations of implementing those estimators are discussed.

Related Topics

Related Publications

Related Content

Site Footer


This website is provided by John Wiley & Sons Limited, The Atrium, Southern Gate, Chichester, West Sussex PO19 8SQ (Company No: 00641132, VAT No: 376766987)

Published features on are checked for statistical accuracy by a panel from the European Network for Business and Industrial Statistics (ENBIS)   to whom Wiley and express their gratitude. This panel are: Ron Kenett, David Steinberg, Shirley Coleman, Irena Ograjenšek, Fabrizio Ruggeri, Rainer Göb, Philippe Castagliola, Xavier Tort-Martorell, Bart De Ketelaere, Antonio Pievatolo, Martina Vandebroek, Lance Mitchell, Gilbert Saporta, Helmut Waldl and Stelios Psarakis.