This package provides hot deck imputation methods to resolve missing data. Methods provided are popular in survey methodology, mostly used in the context of large national statistics, but are also finding their way to data mining due to their computational simplicity. A key aspect of this package is the implementation of the commonly advocated donor-limit.
A comprehensive function that performs nearest neighbor hot deck imputation. Aspects such as variable weighting, distance types, and donor limiting are implemented. New concepts such as the optimal distribution of donors are also available.
This function imputes the column mean of the complete cases for the missing cases. Utilized by impute.NN_HD as a method for dealing with missing values in distance calculation.