Working paper

Linear Regressions with Combined Data

Xavier D'Haultfoeuille, Christophe Gaillac, and Arnaud Maurel

Abstract

We study best linear predictions in a context where the outcome of interest and some of the covariates are observed in two different datasets that can-not be matched. Traditional approaches obtain point identification by relying, often implicitly, on exclusion restrictions. We show that without such restric-tions, coefficients of interest can still be partially identified and we derive a constructive characterization of the sharp identified set. We then build on this characterization to develop computationally simple and asymptotically normal estimators of the corresponding bounds. We show that these estimators exhibit good finite sample performances.

Keywords

Best linear prediction; data combination; partial identification; inference.;

Reference

Xavier D'Haultfoeuille, Christophe Gaillac, and Arnaud Maurel, Linear Regressions with Combined Data, TSE Working Paper, n. 24-1602, December 2024.

See also

Published in

TSE Working Paper, n. 24-1602, December 2024