Agronomy and Horticulture, Department of

Department of Agronomy and Horticulture: Faculty Publications

Generating High Density, Low Cost Genotype Data in Soybean [Glycine max (L.) Merr.]

Mary M. Happ, University of Nebraska-LincolnFollow
Haichuan Wang, University of Nebraska-LincolnFollow
George L. Graef, University of Nebraska-LincolnFollow
David L. Hyten, University of Nebraska-LincolnFollow

ORCID IDs

0000-0002-5897-2617

0000-0001-6324-9389

Document Type

Article

Date of this Version

2019

Citation

Genes | Genomes | Genetics Volume 9

Comments

Open access

https://doi.org/10.1534/g3.119.400093

Abstract

Obtaining genome-wide genotype information for millions of SNPs in soybean [Glycine max (L.) Merr.] often involves completely resequencing a line at 5X or greater coverage. Currently, hundreds of soybean lines have been resequenced at high depth levels with their data deposited in the NCBI Short Read Archive. This publicly available dataset may be leveraged as an imputation reference panel in combination with skim (low coverage) sequencing of new soybean genotypes to economically obtain high-density SNP information. Ninety-nine soybean lines resequenced at an average of 17.1X were used to generate a reference panel, with over 10 million SNPs called using GATK’s Haplotype Caller tool. Whole genome resequencing at approximately 1X depth was performed on 114 previously ungenotyped experimental soybean lines. Coverages down to 0.1X were analyzed by randomly subsetting raw reads from the original 1X sequence data. SNPs discovered in the reference panel were genotyped in the experimental lines after aligning to the soybean reference genome, and missing markers imputed using Beagle 4.1. Sequencing depth of the experimental lines could be reduced to 0.3X while still retaining an accuracy of 97.8%. Accuracy was inversely related to minor allele frequency, and highly correlated with marker linkage disequilibrium. The high accuracy of skim sequencing combined with imputation provides a low cost method for obtaining dense genotypic information that can be used for various genomics applications in soybean.

Download

Included in

Agricultural Science Commons, Agriculture Commons, Agronomy and Crop Sciences Commons, Botany Commons, Horticulture Commons, Other Plant Sciences Commons, Plant Biology Commons

COinS

Agronomy and Horticulture, Department of

Department of Agronomy and Horticulture: Faculty Publications

Generating High Density, Low Cost Genotype Data in Soybean [Glycine max (L.) Merr.]

ORCID IDs

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Search

Browse

Author Corner

Links

Agronomy and Horticulture, Department of

Department of Agronomy and Horticulture: Faculty Publications

Generating High Density, Low Cost Genotype Data in Soybean [Glycine max (L.) Merr.]

Authors

ORCID IDs

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Share

Search

Browse

Author Corner

Links