Agronomy and Horticulture, Department of

Department of Agronomy and Horticulture: Faculty Publications

Genotyping by sequencing for genomic prediction in a soybean breeding population

Diego Jarquin, University of Nebraska-LincolnFollow
Kyle Kocak, University of Nebraska-Lincoln
Luis Posadas, University of Nebraska-LincolnFollow
Katie Hyma, Cornell University
Joseph Jedlicka, University of Nebraska-Lincoln
George L. Graef, University of Nebraska-LincolnFollow
Aaron Lorenz, University of Nebraska-LincolnFollow

ORCID IDs

George L. Graef

Document Type

Article

Date of this Version

2014

Citation

Jarquín et al.: Genotyping by sequencing for genomic prediction in a soybean breeding population. BMC Genomics 2014 15:740.

Comments

Abstract

Background: Advances in genotyping technology, such as genotyping by sequencing (GBS), are making genomic prediction more attractive to reduce breeding cycle times and costs associated with phenotyping. Genomic prediction and selection has been studied in several crop species, but no reports exist in soybean. The objectives of this study were (i) evaluate prospects for genomic selection using GBS in a typical soybean breeding program and (ii) evaluate the effect of GBS marker selection and imputation on genomic prediction accuracy. To achieve these objectives, a set of soybean lines sampled from the University of Nebraska Soybean Breeding Program were genotyped using GBS and evaluated for yield and other agronomic traits at multiple Nebraska locations.

Results: Genotyping by sequencing scored 16,502 single nucleotide polymorphisms (SNPs) with minor-allele frequency (MAF) > 0.05 and percentage of missing values ≤ 5% on 301 elite soybean breeding lines. When SNPs with up to 80% missing values were included, 52,349 SNPs were scored. Prediction accuracy for grain yield, assessed using cross validation, was estimated to be 0.64, indicating good potential for using genomic selection for grain yield in soybean. Filtering SNPs based on missing data percentage had little to no effect on prediction accuracy, especially when random forest imputation was used to impute missing values. The highest accuracies were observed when random forest imputation was used on all SNPs, but differences were not significant. A standard additive G-BLUP model was robust; modeling additive-by-additive epistasis did not provide any improvement in prediction accuracy. The effect of training population size on accuracy began to plateau around 100, but accuracy steadily climbed until the largest possible size was used in this analysis. Including only SNPs with MAF > 0.30 provided higher accuracies when training populations were smaller.

Conclusions: Using GBS for genomic prediction in soybean holds good potential to expedite genetic gain. Our results suggest that standard additive G-BLUP models can be used on unfiltered, imputed GBS data without loss in accuracy.

Download

Included in

Agricultural Science Commons, Agriculture Commons, Agronomy and Crop Sciences Commons, Botany Commons, Horticulture Commons, Other Plant Sciences Commons, Plant Biology Commons

COinS

Agronomy and Horticulture, Department of

Department of Agronomy and Horticulture: Faculty Publications

Genotyping by sequencing for genomic prediction in a soybean breeding population

ORCID IDs

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Search

Browse

Author Corner

Links

Agronomy and Horticulture, Department of

Department of Agronomy and Horticulture: Faculty Publications

Genotyping by sequencing for genomic prediction in a soybean breeding population

Authors

ORCID IDs

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Share

Search

Browse

Author Corner

Links