DP14352 Imputing Missing Values in the US Census Bureau's County Business Patterns
|Author(s):||Fabian Eckert, Teresa C Fort, Peter K. Schott, Natalie Yang|
|Publication Date:||January 2020|
|Programme Areas:||Labour Economics, International Trade and Regional Economics, Macroeconomics and Growth|
|Link to this Page:||cepr.org/active/publications/discussion_papers/dp.php?dpno=14352|
The County Business Patterns data published by the US Census Bureau track employment by county and industry from 1946 to the present. Two features of the data limit their usefulness to researchers in practice: (1) employment for the majority of county-industry cells is suppressed to protect confidentiality, and (2) industry classifications change over time. We address both issues. First, we develop a linear programming method that exploits the large set of adding-up constraints implicit in the hierarchical arrangement of the data to impute missing employment. Second, we provide concordances to map all data to a consistent set of industry codes.