figshare
Browse
uasa_a_1498346_sm7564.zip (235.77 kB)

On the Use of the Lasso for Instrumental Variables Estimation with Some Invalid Instruments

Download (235.77 kB)
Version 5 2021-09-29, 13:54
Version 4 2020-08-24, 09:06
Version 3 2019-10-25, 13:12
Version 2 2018-11-13, 16:22
Version 1 2018-07-19, 20:43
dataset
posted on 2021-09-29, 13:54 authored by Frank Windmeijer, Helmut Farbmacher, Neil Davies, George Davey Smith

We investigate the behavior of the Lasso for selecting invalid instruments in linear instrumental variables models for estimating causal effects of exposures on outcomes, as proposed recently by Kang et al. Invalid instruments are such that they fail the exclusion restriction and enter the model as explanatory variables. We show that for this setup, the Lasso may not consistently select the invalid instruments if these are relatively strong. We propose a median estimator that is consistent when less than 50% of the instruments are invalid, and its consistency does not depend on the relative strength of the instruments, or their correlation structure. We show that this estimator can be used for adaptive Lasso estimation, with the resulting estimator having oracle properties. The methods are applied to a Mendelian randomization study to estimate the causal effect of body mass index (BMI) on diastolic blood pressure, using data on individuals from the UK Biobank, with 96 single nucleotide polymorphisms as potential instruments for BMI. Supplementary materials for this article are available online.

Funding

This research was partly funded by the Medical Research Council, MC_UU_12013/1, MC_UU_12013/9, and MC_UU_00011/1. Neil Davies further acknowledges support from the Economics and Social Research Council via a Future Leaders Grant, ES/N000757/1. Helmut Farbmacher acknowledges funding from the Fritz Thyssen Stiftung.

History