Ted Kalbfleisch https://orcid.org/0000-0002-2370-8189
Jessica L. Petersen https://orcid.org/0000-0001-5438-8555
R. G. Tait Jr. https://orcid.org/0000-0002-2379-7740
Peter H. Hackett https://orcid.org/0000-0002-6882-4853
Michael P. Heaton https://orcid.org/0000-0003-1386-1208
Date of this Version
Kalbfleisch T, Petersen JL, Tait Jr. RG et al. Using triallelic SNPs for determining parentage in North American yak ( Bos grunniens) and estimating cattle ( B. taurus) introgression [version 2; peer review: 2 approved] F1000Research 2020, 9:1096 https://doi.org/10.12688/f1000research.25803.2
Background: Genetic testing for pedigree accuracy is critical for managing genetic diversity in North American (NA) yak (Bos grunniens ), a population expanded mostly from imported zoological park specimens. DNA testing also enhances species conservation by identifying recent B. taurus F1 hybrid ancestors (within three generations). Biallelic single nucleotide polymorphisms (SNPs) can accomplish either task, but increases the marker count and costs necessary to achieve both. Our aim was to identify novel, multifunctional, triallelic yak SNPs (tySNPs), with each having two alleles for yak parentage testing, and a third allele for identifying recent cattle introgression.
Methods: Genome sequences were aligned to the cattle UMD3.1 assembly and SNPs were screened for 1) heterozygosity in a NA and a Chinese yak, 2) a third allele at high frequency in cattle, and 3) flanking sequences conserved in both species. Subsequently, tySNPs were filtered for unique alignment to the haplotype-resolved F1 yak assembly. Allele frequencies were estimated in a subset of 87 tySNPs by genotyping 170 NA yak.
Results: We identified 610 autosomal tySNPs, distributed in 441 clusters with 5 Mb average genome spacing. The average NA yak minor allele frequency was high (0.296), while average introgressed cattle alleles were low (0.004). In simulations with tySNPs, 28 were sufficient for globally-unique animal identification (PI=5.81x10-12), 87 were able to exclude 19 random bulls from parentage at the 99% level without using the dam’s genotype (PE=5.3x10-4), and 87 were able to detect F1 hybridization events after three generations of yak backcrosses (1/16th B. taurus germplasm).
Conclusions: Identifying animals, determining parentage and detecting recent hybridization events was efficient with as few as 87 tySNPs. A similar triallelic approach could be used with other bottlenecked Bos species that hybridize with cattle, such as NA plains bison (B. bison).