Agronomy and Horticulture, Department of

 

Department of Agronomy and Horticulture: Faculty Publications

Accessibility Remediation

If you are unable to use this item in its current form due to accessibility barriers, you may request remediation through our remediation request form.

Authors

ORCID IDs

David L. Hyten

Document Type

Article

Date of this Version

1-2010

Comments

Published in Nature Vol 463, 14 January 2010, doi:10.1038/nature08670.

Abstract

Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.

All supplemental information is included in the downloadable PDF file, except for the data file for Supplementary Table S5, which is attached (below) as an "Additional file."

Specht NATURE 2010 Suppl Table S5.txt (928 kB)
Data file for Supplementary Table S5

Share

COinS