Biological Systems Engineering, Department of


Document Type


Date of this Version





In this study, we explored the use of statistical machine learning and long-term groundwater nitrate monitoring data to estimate vadose zone and saturated zone lag times in an irrigated alluvial agricultural setting. Unlike most previous statistical machine learning studies that sought to predict groundwater nitrate concentrations within aquifers, the focus of this study was to leverage available groundwater nitrate concentrations and other environmental variables to determine mean regional vertical velocities (transport rates) of water and solutes in the vadose zone and saturated zone (3.50 and 3.75 m yr−1 , respectively). The statistical machine learning results are consistent with two primary recharge processes in this western Nebraska aquifer, namely (1) diffuse recharge from irrigation and precipitation across the landscape and (2) focused recharge from leaking irrigation conveyance canals. The vadose zone mean velocity yielded a mean recharge rate (0.46 m yr−1 ) consistent with previous estimates from groundwater age dating in shallow wells (0.38 m yr−1 ). The saturated zone mean velocity yielded a recharge rate (1.31 m yr−1 ) that was more consistent with focused recharge from leaky irrigation canals, as indicated by previous results of groundwater age dating in intermediate-depth wells (1.22 m yr−1 ). Collectively, the statistical machine learning model results are consistent with previous observations of relatively high water fluxes and short transit times for water and nitrate in the primarily oxic aquifer. Partial dependence plots from the model indicate a sharp threshold in which high groundwater nitrate concentrations are mostly associated with total travel times of 7 years or less, possibly reflecting some combination of recent management practices and a tendency for nitrate concentrations to be higher in diffuse infiltration recharge than in canal leakage water. Limitations to the machine learning approach include the non-uniqueness of different transport rate combinations when comparing model performance and highlight the need to corroborate statistical model results with a robust conceptual model and complementary information such as groundwater age.