Smith, M.T., Álvarez, M.A. and Lawrence, N.D. orcid.org/0000-0001-9258-1030 (Submitted: 2019) Gaussian process regression for binned data. arXiv. (Submitted)
Abstract
Many datasets are in the form of tables of binned data. Performing regression on these data usually involves either reading off bin heights, ignoring data from neighbouring bins or interpolating between bins thus over or underestimating the true bin integrals. In this paper we propose an elegant method for performing Gaussian Process (GP) regression given such binned data, allowing one to make probabilistic predictions of the latent function which produced the binned data. We look at several applications. First, for differentially private regression; second, to make predictions over other integrals; and third when the input regions are irregularly shaped collections of polytopes. In summary, our method provides an effective way of analysing binned data such that one can use more information from the histogram representation, and thus reconstruct a more useful and precise density for making predictions.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2018 The Author(s). For reuse permissions, please contact the Author(s). |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Funding Information: | Funder Grant number Engineering and Physical Science Research Council EP/N014162/1 |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 17 Jan 2020 12:59 |
Last Modified: | 17 Jan 2020 12:59 |
Published Version: | https://arxiv.org/abs/1809.02010v2 |
Status: | Submitted |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:155248 |