Computer Science and Engineering, Department of

Computer Science, Computer Engineering, and Bioinformatics: Faculty Publications

Comparative evaluation of machine learning models for groundwater quality assessment

Shine Bedi, University of Nebraska-LincolnFollow
Ashok Samal, University of Nebraska - LincolnFollow
Chittaranjan Ray, University of Nebraska-LincolnFollow
Daniel D. Snow, University of Nebraska-LincolnFollow

Document Type

Article

Date of this Version

2020

Citation

Published in Environmental Monitoring and Assessment 192 (2020), 776.

doi:10.1007/s10661-020-08695-3

Comments

Abstract

Contamination from pesticides and nitrate in groundwater is a significant threat to water quality in general and agriculturally intensive regions in particular. Three widely used machine learning models, namely, artificial neural networks (ANN), support vector machines (SVM), and extreme gradient boosting (XGB), were evaluated for their efficacy in predicting contamination levels using sparse data with non-linear relationships. The predictive ability of the models was assessed using a dataset consisting of 303 wells across 12 Midwestern states in the USA. Multiple hydrogeologic, water quality, and land use features were chosen as the independent variables, and classes were based on measured concentration ranges of nitrate and pesticide. This study evaluates the classification performance of the models for two, three, and four class scenarios and compares them with the corresponding regression models. The study also examines the issue of class imbalance and tests the efficacy of three class imbalance mitigation techniques: oversampling, weighting, and oversampling and weighting, for all the scenarios. The models’ performance is reported using multiple metrics, both insensitive to class imbalance (accuracy) and sensitive to class imbalance (F1 score and MCC). Finally, the study assesses the importance of features using game-theoretic Shapley values to rank features consistently and offer model interpretability.

Download

Included in

Computer Sciences Commons, Environmental Indicators and Impact Assessment Commons, Environmental Monitoring Commons, Water Resource Management Commons

COinS

DigitalCommons@University of Nebraska - Lincoln

Computer Science and Engineering, Department of

Computer Science, Computer Engineering, and Bioinformatics: Faculty Publications

Comparative evaluation of machine learning models for groundwater quality assessment

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Search

Browse

Author Corner

Links

DigitalCommons@University of Nebraska - Lincoln

Computer Science and Engineering, Department of

Computer Science, Computer Engineering, and Bioinformatics: Faculty Publications

Comparative evaluation of machine learning models for groundwater quality assessment

Authors

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Share

Search

Browse

Author Corner

Links