1
GENOME-WIDE ASSOCIATION STUDY FOR MILK PRODUCTION IN IRANIAN BUFFALO
Mahdi Mokhbera,*
a: Department of animal science, Faculty of Agriculture, Urmia University, Urmia, Iran
*Corresponding author: [email protected] (Mahdi Mokhber)
ABSTRACT
A genome wide association study (GWAS) was conducted to identify regions of the genome associated with milk production trait in Iranian water buffaloes. In order to do this study, 303 water buffalo from Azeri (N=207), Khuzestani (N=96) breeds were genotyped with Axiom® Buffalo Genotyping 90K Array. Average of monthly recording of one lactation period were used as phenotypic data for each individual. After adjustments of records for fixed effects, phenotypes were regressed on each single nucleotide polymorphisms (SNP) using a linear regression model by GenABEL package in R. In total, 10 SNPs were identified on BTAs 4, 5, 7, 13, 14, 15 and X as probably associated regions with milk production. These selected SNPs with 400 Kbp neighboring genomic regions were surveyed to find probable candidate genes by annotation. In total, 11 genes identified from the annotation of selected SNPs on the cattle genome (UMD3.1 Bos Taurus). Among the candidate genes identified in these regions, the most important gene that associated with milk production was SCOS2 gene on chromosome BTA5. Further research with additional individual and records is essential to detect associated regions with milk production and to suggestion for commercial application to the genetic improvement.
Keywords: Genome wide association study, single nucleotide polymorphisms, Azeri and Khuzestani buffalo breeds.
INTRODUCTION
Identifying the genes that contributing in milk production and functional traits is considered in dairy cattle since their high economic values (Raven et al., 2014). Genome-wide association studies (GWAS) have led to finding of high number of associated genes with milk production traits in dairy cattle (Grisart et al., 2002). Therefore, kind of studies are required to obtaining genetic progress in production traits. In recent years, research in genomics area has been increased with development of high-density SNP arrays in many livestock species (Cole et al., 2009). Recently, availability of special commercial SNP chip array for water buffalo, genomic base studies is possible for water buffalo (Iamartino et al., 2013). To date, several studies were used GWAS for some economically important traits in domestic animals including cattle (Hayes et al., 2009; Pryce et al., 2010; Alama et al., 2011; Lu et al., 2013), poultry (Yuan et al., 2015; Gu et al., 2011 ), pig (Bergfelder-Drüing et al., 2015; Le et al., 2017) and water buffalo by Bovine 50K and 777K SNP Beadchips (Wu et al., 2013; Borquis et al., 2014) and specific buffalo 90K SNP array (Iamartino et al., 2013; Camargo et al., 2015; El-Halawany et al., 2017). The aim of the current study was to perform a GWAS for milk production trait in
2
Iranian buffalo using with Axiom® Buffalo Genotyping 90K Array, to detect genomic regions associated with milk production trait
MATERIALS & METHODS
A set of 303 water buffalo from Azeri (N=207), Khuzestani (N=96) breeds were genotyped with Axiom® Buffalo Genotyping Array which contains 89,988 single nucleotide polymorphism (SNP) markers. Average of monthly recording of one lactation period was used as phenotypic data for each individual. After adjustments of records for fixed effects, phenotypes were regressed on each single nucleotide polymorphisms (SNP) using a linear regression model by GenABEL package in R. Genotype quality control was accomplished using Plink software (Purcell et al., 2007). Genotypes were controlled for Individual call rate, SNP call rate and minor allele frequency (MAF). Then, Individual and SNPs with more than 5% missing data and MAF less than 10% were excluded. After quality control (QC) 303 Individual with a total of 65,185 SNPs remained for future analysis. Before to carrying out the GWAS, genetic homogeneity and population sub-structure of the dataset was tested by Multi Dimension Scaling (MDS) using GenABEL package in R (The R project website, http://www.r-project.org/). Also, Phenotypes were preadjusted using SAS GLM procedure of SAS version 9.4 (SAS 2014, SAS Institute Inc., Cary, NC, USA). Then significant factors and principal component variants were fitted as fixed effects in the GWAS model. Detecting genomic regions that associated with milk production was performed by GenABEL package in R (The R project website, http://www.r-project.org/). The final model was:
LactRecijk= μ+ Farmi+ Parj+ PC1+PC2+PC3+Polygenick+ eijk
where LactRecijk is phenotypic value of each individual; µ is the overall population mean for milk production; Farmi is the fixed effect of the ith farm or herd; Parj is the fixed effect of the jth parturition in recording time; PC1-3 are principal components; Polygenick is the effect of the lth genotype; and eijk is the residual random effect.
Associated genomic regions were used to extract genes from the corresponding areas in UMD3.1 Bos Taurus Genome Assembly using Biomart (www.ensembl.org/biomart/martview). Finally, DAVID (Huang et al., 2009) Enrichment Map Cytoscape plug-in (Merico et al., 2010) were used to perform a gene ontology analysis and construct networks, respectively
RESULTS AND DISCUSSION
The MDS results was revealed stratification between and within each breed, Therefore, structured association analysis was preferred for association model with identified fixed effected that have significant effect on milk production trait. A total of 10 SNPs were detected on BTAs 4(53,514 - 53,914 & 56,047 - 56,447 Kb), 5 (23,396 – 23,796 Kb), 7 (98,075 – 98,475 & 95,760 – 96,160 Kb), 13 (32,530 – 32,930 Kb), 14 (15,856 – 16,256 & 54,200 -54,600 Kb), 15 (25,227 – 25,627 Kb) and X (48,548 – 48,948 Kb) that could be associated with milk production trait.
These SNPs with 400Kbp neighboring genomic regions were surveyed to find probably candidate genes by annotation. In total, 11 gene including C7orf60, MRPL42- SOCS2- CRADD
3
, CAST, MRC1, SLC39A12, CSMD3, RBM7, REXO2 and NXPE2 identified from the annotation of selected SNPs on the cattle genome (UMD3.1 Bos Taurus).
Among the candidate genes identified in these regions, the most important gene that associated with milk production was SCOS2 gene on chromosome BTA5. The SOCS family is contain of 8 members that require for the attenuation of cytokine signals in mammary epithelial cells, and acts to limit proliferation through a negative feedback mechanism (Jegalian and Wu, 2002). The SOCS1 and SOCS2 genes are required for the attenuation of the prolactin receptor signaling pathway during pregnancy and lactogenesis (Wei et al., 2013). Arun et al. (2015) suggests that the JAK-STAT-SOCS genes have a significant effect on milk and milk component performance.
Finally, gene ontology analysis was performed using DAVID to detecting biological network among provided genes and the Enrichment Map Cytoscape plug-in to construct networks.
However, there no significant biological pathway was found in gene ontology analysis.
CONCLUSIONS (FACULTATIVE)
Present study is the opening GWAS in Iranian water buffalo. Despite of small sample size, some of detected SNPs were associated with milk production. The most important gene associated with milk production was SCOS2 gene on chromosome BTA5.
Further research with additional individual and records is essential to detect associated regions with milk production to suggestion for commercial application in genomic selection for genetic improvement.
REFERENCES CITED
Alama , M., Lee, Y. M., Park, B. L., Kim, J. H., Lee, S. S., Shin, H. D., Kim, K. S., Kim, N. S., and Kim, J. J. (2011). “A Whole Genome Association Study to Detect Single Nucleotide Polymorphisms for Body Conformation Traits in a Hanwoo Population,” Asian-Aust. J.
Anim. Sci 24(3), 322 – 329.
Bergfelder-Drüing, S., Grosse-Brinkhaus, C., Lind, B., Erbe, M., Schellander, K., Simianer, H., Tholen, E. (2015). “A genome-wide association study in large white and landrace pig populations for number piglets born alive”. PloS one 10(3), e0117468.
Borquis, R. R., Baldi, F., de Camargo, G. M., Cardoso, D. F., and et al. (2014). “Water buffalo genome characterization by the Illumina BovineHD BeadChip,” Genet. Mol. Res 13, 4202–
4215.
Cole, J., Van-Raden, P., O’Connell, J., Van-Tassell, C., Sonstegard, T., Schnabel, R., Taylor, J., Wiggans, G. (2009). “Distribution and location of genetic effects for dairy traits,” J. dairy sci 92(6), 2931-2946.
De Camargo, G. M., Aspilcueta-Borquis, R. R., Fortes, M. R., and et al. (2015). “Prospecting major genes in dairy buffaloes,” BMC Genomics 16, 872.
El-Halawany, N., Abdel-Shafy, H., Abd-El-Monsif, A. S., Abdel-Latif, M. A., Al-Tohamy, A.
F., and El-Moneim, O. M. A. (2017). “Genome-wide association study for milk production in Egyptian buffalo,” Livestock Science 198, 10-16.
Ensembl BioMart: Ensembl online genome data base BioMart Tool.
FAO. 2014. FAO statistics website. http://www.fao.org/statistics/en/
4
Grisart, B., Coppieters, W., Farnir, F., Karim, L., and et al. (2002). “Ford C, Berzi P, Cambisano N, Mni M, Reid S, Simon P: Positional candidate cloning of a QTL in dairy cattle:
identification of a missense mutation in the bovine DGAT1 gene with major effect on milk yield and composition,” Genome research 12(2), 222-231.
Gu, X., Feng, C., Ma, L., Song, C., Wang, Y., Da, Y., Li, H., Chen, K., Ye, S., and Ge, C.
(2011). “Genome-wide association study of body weight in chicken F2 resource population,”
PLoS One 6(7), e21872.
Hayes, B. J., Bowman, P. J., Chamberlain, A. J., Savin, K., van Tassell, C .P., Sonstegard, T. S., and Goddard, M. E. (2009). “A validated genome wide association study to breed cattle adapted to an environment altered by climate change,” PLoS One 4, e6676.
Huang, D. W., Sherman, B. T., and Lempicki, R. A. (2009a). “Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources,” Nature Prot 4(1): 44–
57.
Iamartino, D., Williams, J. L., Sonstegard, T., Reecy, J., et al. (2013). “The buffalo genome and the application of genomics in animal management and Improvement,” Buffalo Bulletin 32(1), 151-158.
Le, T. H., Christensen, O. F., Nielsen, B., Sahana, G. (2014). “Genome-wide association study for conformation traits in three Danish pig breeds,” GSE 49(1), 12.
Lu, D., Sargolzaei, M., Kelly, M., Vander-Verf, G., Wang, Z., Mandell, I., Moore, S., Plastow, G., and Miller, S. P. (2013). “Genome-wide association analyses for carcass quality in crossbred beef cattle,” BMC Genet 14, 80.
Merico, D., Isserlin, R., Stueker, O., Emili, A., and Bader, G. D. (2010). “Enrichment Map: A Network Based Method for Gene-Set Enrichment Visualization and Interpretation,” Plos One 5(11), e13984. doi:10.1371/journal.pone.0013984
Pryce, J. E., Bolormaa, S., Chamberlain, A. J., Bowman, P. J., Savin, K., Goddard, M. E., and Hayes, B. J. (2010). “A validated genome-wide association study in 2 dairy cattle breeds for milk production and fertility traits using variable length haplotypes,” J. Dairy Sci 93, 3331–
3345.
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., and et al. (2007). “PLINK: a tool set for whole-genome association and population-based linkage analyses,” American j. human genet 81(3):559-575.
Raven, L-A, Cocks, B. G., Goddard, M. E., Pryce, J. E., and Hayes, B. J. (20014). “Genetic variants in mammary development, prolactin signalling and involution pathways explain considerable variation in bovine milk production and milk composition,” GSE 46(1):29.
The R project website, http://www.r-project.org/
Vallée, A., Daures, J., Van Arendonk, J., and Bovenhuis, H. (2016). “Genome-wide association study for behavior, type traits, and muscular development in Charolais beef cattle”. J. Anim sci 94(6), 2307-2316.
Wei, J., Ramanathan, P., Martin, I. C., Moran, C., Taylor, R. M., and Williamson, P. (2013).
“Identification of gene sets and pathways associated with lactation performance in mice,”
Physiol. Genomics 45, 171–181. doi: 10.1152/physiolgenomics.00139.2011
Wu, J. J., Song, L. J., Wu, F. J., Liang, X. W., and et al. (2013). Investigation of transferability of BovineSNP50 BeadChip from cattle to water buffalo for genome wide association study.
Mol. Biol. Rep 40, 743–750.
5
Yuan, J., Wang, K., Yi, G., Ma, M., Dou, T., Sun, C., Qu, L., Shen, M., Qu, L., and Yang, N.
(2015). “Genome-wide association studies for feed intake and efficiency in two laying periods of chickens,” GSE 47(1), 82.