Plant Science Innovation, Center for

Center for Plant Science Innovation: Faculty Publications

Non-homology-based prediction of gene functions in maize (Zea mays ssp. mays)

Xiuru Dai, Shandong Agricultural University & University of Nebraska-Lincoln
Zheng Xu, Wright State University
Zhikai Liang, University of Nebraska - LincolnFollow
Xiaoyu Tu, Chinese University of Hong Kong
Silin Zhong, Chinese University of Hong Kong
James C. Schnable, University of Nebraska-LincolnFollow
Pinghua Li, Shandong Agricultural UniversityFollow

ORCID IDs

https://orcid.org/0000-0002-2516-1068

https://orcid.org/0000-0002-9963-8631

https://orcid.org/0000-0001-6739-5527

Document Type

Article

Date of this Version

2020

Citation

Plant Genome. 2020;e20015. wileyonlinelibrary.com/journal/tpg2 1 of 13 https://doi.org/10.1002/tpg2.20015

Comments

2020 The Author

Abstract

Advances in genome sequencing and annotation have eased the difficulty of identifying new gene sequences. Predicting the functions of these newly identified genes remains challenging. Genes descended from a common ancestral sequence are likely to have common functions. As a result, homology is widely used for gene function pre- diction. This means functional annotation errors also propagate from one species to another. Several approaches based on machine learning classification algorithms were evaluated for their ability to accurately predict gene function from non-homology gene features. Among the eight supervised classification algorithms evaluated, random- forest-based prediction consistently provided the most accurate gene function predic- tion. Non-homology-based functional annotation provides complementary strengths to homology-based annotation, with higher average performance in Biological Process GO terms, the domain where homology-based functional annotation performs the worst, and weaker performance in Molecular Function GO terms, the domain where the accuracy of homology-based functional annotation is highest. GO prediction models trained with homology-based annotations were able to successfully predict annotations from a manually curated “gold standard” GO annotation set. Non-homology-based functional annotation based on machine learning may ultimately prove useful both as a method to assign predicted functions to orphan genes which lack functionally characterized homologs, and to identify and correct functional annotation errors which were propagated through homology-based functional annotations.

Download

Included in

Plant Biology Commons, Plant Breeding and Genetics Commons, Plant Pathology Commons

COinS

Plant Science Innovation, Center for

Center for Plant Science Innovation: Faculty Publications

Non-homology-based prediction of gene functions in maize (Zea mays ssp. mays)

ORCID IDs

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Search

Browse

Author Corner

Links

Plant Science Innovation, Center for

Center for Plant Science Innovation: Faculty Publications

Non-homology-based prediction of gene functions in maize (Zea mays ssp. mays)

Authors

ORCID IDs

Document Type

Date of this Version

Citation

Comments

Abstract

Included in

Share

Search

Browse

Author Corner

Links