United States Department of Agriculture, Forest Service, National Agroforestry Center

United States Department of Agriculture, Forest Service / University of Nebraska-Lincoln: Faculty Publications

The roles of nearest neighbor methods in imputing missing data in forest inventory and monitoring databases

Bianca N. I. Eskelson, Oregon State UniversityFollow
Hailemariam Temesgen, Oregon State UniversityFollow
Valerie Lemay, University of British ColumbiaFollow
Tara M. Barrett, Pacific Northwest Research StationFollow
Nicholas L. Crookston, Rocky Mountain Research StationFollow
Andrew T. Hudak, Rocky Mountain Research StationFollow

Document Type

Article

Date of this Version

2009

Citation

Scandinavian Journal of Forest Research, 2009; 24: 235-246; DOI: 10.1080/02827580902870490

Abstract

Almost universally, forest inventory and monitoring databases are incomplete, ranging from missing data for only a few records and a few variables, common for small land areas, to missing data for many observations and many variables, common for large land areas. For a wide variety of applications, nearest neighbor (NN) imputation methods have been developed to fill in observations of variables that are missing on some records (Y-variables), using related variables that are available for all records (X-variables). This review attempts to summarize the advantages and weaknesses of NN imputation methods and to give an overview of the NN approaches that have most commonly been used. It also discusses some of the challenges of NN imputation methods. The inclusion of NN imputation methods into standard software packages and the use of consistent notation may improve further development of NN imputation methods. Using X-variables from different data sources provides promising results, but raises the issue of spatial and temporal registration errors. Quantitative measures of the contribution of individual X-variables to the accuracy of imputing the Y-variables are needed. In addition, further research is warranted to verify statistical properties, modify methods to improve statistical properties, and provide variance estimators.

Download

COinS

United States Department of Agriculture, Forest Service, National Agroforestry Center

United States Department of Agriculture, Forest Service / University of Nebraska-Lincoln: Faculty Publications

The roles of nearest neighbor methods in imputing missing data in forest inventory and monitoring databases

Document Type

Date of this Version

Citation

Abstract

Search

Browse

Author Corner

Links

United States Department of Agriculture, Forest Service, National Agroforestry Center

United States Department of Agriculture, Forest Service / University of Nebraska-Lincoln: Faculty Publications

The roles of nearest neighbor methods in imputing missing data in forest inventory and monitoring databases

Authors

Document Type

Date of this Version

Citation

Abstract

Share

Search

Browse

Author Corner

Links