Last data update: 2014.03.03

R: Hot Deck Imputation Methods for Missing Data
HotDeckImputation-packageR Documentation

Hot Deck Imputation Methods for Missing Data

Description

This package provides hot deck imputation methods to resolve missing data. Methods provided are popular in survey methodology, mostly used in the context of large national statistics, but are also finding their way to data mining due to their computational simplicity. A key aspect of this package is the implementation of the commonly advocated donor-limit.

Details

Package: HotDeckImputation
Type: Package
Version: 1.1.0
Date: 2014-10-21
License: GPL-3

HotDeckImputation is the ever expanding implementation of hot deck imputation methods, such as the nearest neighbor, the CPS-sequential and random hot deck. The package aims to be comprehensive in the functionality provided, covering key aspects of hot deck imputation not found elsewhere.
Currently implemented functions include:
Nearest neighbor hot deck imputation.
Sequential hot deck imputation.
CPS sequential hot deck imputation.
Development requests are welcome.

Author(s)

Dieter William Joenssen

Maintainer: Dieter William Joenssen <Dieter.Joenssen@googlemail.com>

References

Andridge, R.R. and Little, R.J.A. (2010) A Review of Hot Deck Imputation for Survey Non-response. International Statistical Review. 78, 40–64.

Bankhofer, U. and Joenssen, D.W. (2014) On Limiting Donor Usage for Imputation of Missing Data via Hot Deck Methods. In: M. Spiliopoulou, L. Schmidt-Thieme, and R. Jannings (Eds.): Data Analysis, Machine Learning and Knowledge Discovery. Studies in Classification, Data Analysis and Knowledge Organization, 3–11. Berlin/Heidelberg: Springer.

Domschke, W. (1995) Logistik: Transport. Munich: Oldenbourg. [in German]

Ford, B. (1983) An Overview of Hot Deck Procedures. In: W. Madow, H. Nisselson and I. Olkin (Eds.): Incomplete Data in Sample Surveys. New York: Academic Press, 185–207.

Joenssen, D.W. (2015) Donor Limited Hot Deck Imputation: A Constrained Optimization Problem. In: B. Lausen, S. Krolak-Schwerdt, and M. B"ohmer (Eds.): Data Science, Learning by Latent Structures, and Knowledge Discovery. Studies in Classification, Data Analysis and Knowledge Organization, pages tba. Berlin/Heidelberg: Springer. (in press)

Joenssen, D.W. (2015) Hot-Deck-Verfahren zur Imputation fehlender Daten – Auswirkungen des Donor-Limits. Ilmenau: Ilmedia. [in German, Dissertation]

Joenssen, D.W. and Bankhofer, U. (2012) Donor Limited Hot Deck Imputation: Effects on Parameter Estimation. Journal of Theoretical and Applied Computer Science. 6, 58–70.

Joenssen, D.W. and Muellerleile, T. (2014) Fehlende Daten bei Data-Mining. HMD Praxis der Wirtschaftsinformatik. 51, 458–468, 2014. doi: 10.1365/s40702-014-0038-8 [in German]

Kalton, G. and Kasprzyk, D. (1986) The Treatment of Missing Survey Data. Survey Methodology. 12, 1–16.

Sande, I. (1983) Hot-Deck Imputation Procedures. In: W. Madow, H. Nisselson and I. Olkin (Eds.): Incomplete Data in Sample Surveys. New York: Academic Press, 339–349.

Results