Agronomy and Horticulture, Department of

Department of Agronomy and Horticulture: Faculty Publications

Yield prediction through integration of genetic, environment, and management data through deep learning

Daniel R. Kick, USDA, Agricultural Research Service, University of Missouri
Jason G. Wallace, University of Georgia
James C. Schnable, University of Nebraska-LincolnFollow
Judith M. Kolkman, Cornell University
Barış Alaca, University of Goettingen
Timothy M. Beissinger, University of Goettingen
Jode Edwards, USDA, Agricultural Research Service
David Ertl, Iowa Corn Promotion Board
Sherry Flint-Garcia, USDA, Agricultural Research Service
Joseph L. Gage, North Carolina State University
Candice N. Hirsch, University of Minnesota
Joseph E. Knoll, USDA, Agricultural Research Service
Natalia de Leon, University of Wisconsin
Dayane C. Lima, University of Wisconsin
Danilo E. Moreta, Cornell University
Maninder P. Singh, Michigan State University
Addie Thompson, Michigan State University
Teclemariam Weldekidan, University of Delaware
Jacob D. Washburn, USDA, Agricultural Research Service, University of Missouri

Document Type

Article

Date of this Version

12-23-2022

Citation

G3, 2023, 13(4), jkad006. https://doi.org/10.1093/g3journal/jkad006

Comments

This work is written by (a) US Government employee(s) and is in the public domain in the US.

Abstract

Accurate prediction of the phenotypic outcomes produced by different combinations of genotypes, environments, and management interventions remains a key goal in biology with direct applications to agriculture, research, and conservation. The past decades have seen an expansion of new methods applied toward this goal. Here we predict maize yield using deep neural networks, compare the efficacy of 2 model development methods, and contextualize model performance using conventional linear and machine learning models. We examine the usefulness of incorporating interactions between disparate data types. We find deep learning and best linear unbiased predictor (BLUP) models with interactions had the best overall performance. BLUP models achieved the lowest average error, but deep learning models performed more consistently with similar average error. Optimizing deep neural network submodules for each data type improved model performance relative to optimizing the whole model for all data types at once. Examining the effect of interactions in the best-performing model revealed that including interactions altered the model’s sensitivity to weather and management features, including a reduction of the importance scores for timepoints expected to have a limited physiological basis for influencing yield—those at the extreme end of the season, nearly 200 days post planting. Based on these results, deep learning provides a promising avenue for the phenotypic prediction of complex traits in complex environments and a potential mechanism to better understand the influence of environmental and genetic factors.

Download

Included in

Agricultural Science Commons, Agriculture Commons, Agronomy and Crop Sciences Commons, Botany Commons, Horticulture Commons, Other Plant Sciences Commons, Plant Biology Commons

COinS

Agronomy and Horticulture, Department of

Department of Agronomy and Horticulture: Faculty Publications

Yield prediction through integration of genetic, environment, and management data through deep learning

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Search

Browse

Author Corner

Links

Agronomy and Horticulture, Department of

Department of Agronomy and Horticulture: Faculty Publications

Yield prediction through integration of genetic, environment, and management data through deep learning

Authors

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Share

Search

Browse

Author Corner

Links