Sanko, N, Hess, S, Dumont, J et al. (1 more author) (2014) Contrasting imputation with a latent variable approach to dealing with missing income in choice models. Journal of Choice Modelling, 12. 47 - 57. ISSN 1755-5345
Abstract
Income is a key variable in many choice models. It is also one of the most salient examples of a variable affected by data problems. Issues with income arise as measurement errors in categorically captured income, correlation between stated income and unobserved variables, systematic over- or under-statement of income and missing income values for those who refuse to answer or do not know their (household) income. A common approach for dealing especially with missing income is to use imputation based on the relationship among those who report income between their stated income for reporters and their socio-demographic characteristics. A number of authors have also recently put forward a latent variable treatment of the issue, which has theoretical advantages over imputation, not least by drawing not just on data on stated income for reporters, but also choice behaviour of all respondents. We contrast this approach empirically with imputation as well as simpler approaches in two case studies, one with stated preference data and one with revealed preference data. Our findings suggest that, at least with the data at hand, the latent variable approach produces similar results to imputation, possibly an indication of non-reporters of income having similar income distributions from those who report it. But in other data sets the efficiency advantage over imputation could help in revealing issues in the complete and accurate reporting of income.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | (c) 2014 Elsevier Ltd. All rights reserved. NOTICE: this is the author’s version of a work that was accepted for publication in Journal of Choice Modelling. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Journal of Choice Modelling, 12, (2014) DOI 10.1016/j.jocm.2014.10.001 |
Keywords: | Latent variables; Missing income; Discrete choice; Random heterogeneity |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Environment (Leeds) > Institute for Transport Studies (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 30 Mar 2015 12:19 |
Last Modified: | 21 Jan 2018 22:33 |
Published Version: | http://dx.doi.org/10.1016/j.jocm.2014.10.001 |
Status: | Published |
Publisher: | Elsevier Ltd |
Identification Number: | 10.1016/j.jocm.2014.10.001 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:84331 |