INTRODUCTION
As travel and trade among countries increases, more substances that threaten national security, such as drugs and illegal agricultural products, are imported. Therefore, dogs that can detect and identify these threats are becoming more important. These dogs are trained for special purposes, such as detection dogs to detect explosives, quarantine-detection dogs to detect illegal agricultural products, drug-detection dogs to detect drug trafficking, and life-saving dogs. Training regular dogs as special-purpose dogs is costly due to their low efficiency. Therefore, special-purpose dogs that have greater ability are being developed using advanced somatic cell technology. Although we produce dogs that perform special tasks by replicating excellent breeds, musculoskeletal disorders such as hip dysplasia (HD) can be a major obstacle to producing these dogs. Therefore, a technology capable of diagnosing these musculoskeletal diseases in advance will be very useful for the production of dogs that perform special tasks.
Hip dysplasia is a very common genetic disorder with very negative effects on animal welfare. HD is a musculoskeletal disorder involving an unstable connection between the femoral head and acetabulum and is accompanied by severe pain. HD is very common, affecting more than 75% of dogs in American households [1]. Currently, treatment for HD is limited to relieving symptoms with mass therapy. HD, which was first reported to result from inbreeding 70 years ago, is a polygenic disease [2]. Therefore, the most effective way to eliminate HD is to exclude individuals with a genetic predisposition to HD through artificial selection [3]. To breed dogs selectively to eliminate HD, it is necessary to record information on the disease phenotype and lineage. However, disease records and lineage information is very limited. Bartolomé, Segarra [4] reported a predictive model that can detect genetic loci associated with HD using a genome-wide association study (GWAS). Lavrijsen [5] explored the susceptibility loci associated with HD in Labrador Retrievers and proposed a predictive model. The present study performed a GWAS analysis through a case–control study of HD and explored various genetic loci associated with HD.
MATERIALS AND METHODS
Blood samples were collected from dogs in accordance with animal welfare guidelines in cooperation with the National Institute of Animal Science (NIAS) and Department of Defense Military Observer Corps, Korea, to search for susceptibility loci associated with HD. Blood samples were obtained from Shepherd (n = 57) and Malinois (n = 12) breeds that were being trained for special purposes at the Department of Defense Military Observer Corps. Collected all samples followed The Institutional Animal Care and Use Committee (IACUC) of National Institute of Animal Science (NIAS, RDA) protocol approval by ‘Development of early diagnosis technology for degenerative muscular skeleton system in special purpose dog’ project (No. 2016-177). Using the Norberg angle, the phenotypes associated with HD were classified into seven intervals, as shown in Table 1, and divided into the normal range (> 80°) and the case (< 79°) groups. For case–control analysis, scores of 1 to 6 were used as controls, and scores of 7 as cases. We also distinguished left hip dysplasia (LHD), right hip dysplasia (RHD), and total hip dysplasia (THD) phenotypes based on these criteria. This enabled different hypotheses to identify the differences based on how the case group was divided.
Score | 1 (Normal) | 2 | 3 | 4 | 5 | 6 | 7 (Disease) |
---|---|---|---|---|---|---|---|
Norberg angle | 105 ↑ | 100–104 | 95–99 | 90–94 | 85–89 | 80–84 | 79 ↓ |
Genomic DNA was extracted from blood collected from 69 army dogs using standard protocols [6]. Two breeds were genotyped using an Illumina CanineHD BeadChip (Illumina, San Diego, CA, USA) array, which contains 165,929 genetic variants and genotypes single nucleotide polymorphisms (SNPs) in the dog genome. The standardization level of genetic variants determined using the Illumina CanineHD BeadChip was managed using PLINK1.9 software [7] The Korean army dogs consisted of Shepherd and Malinois groups. When genetic variation was controlled for, almost all of the mutations were removed, so missing genotypes were limited to 10%. Finally, the minor allele frequency was set at less than 0.05 110,356 SNPs remaining after quality control were used for analysis.
Case–control association analysis was performed for the GWAS for HD in dogs. Using the phenotype information for Shepherds and Malinois, principal component analysis was performed to understand population stratification (Fig. 1). We used GWAS statistical model following below
Where, Y is phenotype, X and Z are design matrices, β is a vector of fixed effects. u is additive genetic effect for each marker. and e is vector of residual.
In this analysis, the eigenvalue of the principal component was set to the covariance to correct for population stratification. We assumed that there are no environmental effects on the phenotype, and a correlation analysis between genotype and phenotype was performed using the association analysis option, which has a case–control function on Plink v 1.9 [7]. To test for statistical significance, the chi-square test was performed, and the p-value obtained was corrected using the Bonferroni correction. The risk allele (minor allele) was used to calculate the odds ratio for predicting disease. We used the -assoc option of Plink v1.9 to perform the GWAS, converted the p-value to -log p, and made a Manhattan plot using R-software.
RESULTS AND DISCUSSION
Studies of HD have suggested that SNPs present on many chromosomes have a polygenic effect on the expression of HD [8]. HD may be the most common musculoskeletal disorder caused by a genetic effect, but it is also strongly influenced by non-genetic factors [9]. For example, it is more likely to occur in large dogs than small dogs and is more frequent in old dogs than in young dogs because of bone degeneration. In this study, 69 Korean army dogs [Shepherd (n = 57) and Malinois (n = 12)] were classified according to the Norberg angle standards shown in Table 1. As shown in Table 2, the control group was classified as a relatively normal group with scores of 1 to 6, and the case group was classified as the disease group with a score of 7. In fact, setting the criteria for controls and cases was difficult and involved a subjective decision regarding whether only a 7 would be considered a case or whether scores of 5 and 6 could also be classified as cases. We assigned only scores of 7 to the case group because the phenotype information was based on the evaluation process for selection before military service. Ultimately, there were 62 controls and 7 cases for LHD or RHD, whereas the THD group comprised 58 controls and 11 cases. Interestingly, all of the cases were Shepherds, perhaps due to the small sample of Malinois.
Each of the given phenotypes was scored as 1 (control) or 2 (case), and GWAS analyses were performed for left or right hip and for both hips. Manhattan plots for LHD, RHD, and THD suggested SNP effects based on the Bonferroni correction criterion (0.05/number of SNPs tested). The results, presented in Figs. 2–4, revealed a single candidate SNP on chromosome 4 in the LHD group, whereas there were no significant SNPs for RHD or THD. The BICF2S2352810 (4:34585340) SNP was significantly detected in LHD. This SNP is located in the multimerin 2 (MMRN2) gene, but it is unclear whether this gene affects HD directly. The MMRN2 gene is associated with gonadotropin-releasing hormone (GnRH) expression via the integrin pathway. GnRH regulates a variety of reproductive activities and affects the secretion of growth hormone (GH); activating GnRH inhibits the release of GH [10–12]. GH secretion may adversely affect bone development and may thereby ultimately contribute to HD expression. In addition, the ENSCAFG00000016100 gene (4:34756078–34763159) located near the BICF2S2352810 SNP controls calcium excretion and might be associated with this gene in a larger sample. Among the pathways between these genes, there is arginine biosynthesis, which acts as a stimulator of GH. Therefore, this function estimated that it can be involved in bone growth and development. In addition, two suggestive SNPs, one of BICF2P650805 (7:79852044) and one of BICF2G630691124 (18:45206432), were identified for RHD traits, and one suggestive SNP of BICF2S2352810 (4:34585340) was identified for THD traits (Figs. 3 and 4, Table 3). These SNPs were located in the cap-binding complex-dependent translation initiation factor (CTIF), CD151 molecule (CD151), tetraspanin 4 (TSPAN4), and MMRN2 genes. However, these genes are not considered direct factors in the development of HD. For the RHD trait, the DYM (7:79286614–79629043) and ALX4 (18:44922150–44967550) genes are located adjacent to the BICF2P650805 and BICF2G630691124 SNPs, respectively, and their functions are related to bone and muscle development. These functions related to candidate genes need to be confirmed as causal mutations using a larger sample size. In addition, it is necessary to consider the different aspects of LHD, RHD, and THD. We anticipate that this may be the result of problems such as ambiguity in the criteria for dividing phenotypes and small sample sizes. Similar patterns may be identified if more clear phenotypes or sufficient sample sizes can be obtained through the further studies.
Several studies based on various approaches have examined HD in dogs. Huang, Hayward [8] performed a GWAS of HD in 921 dogs of various breeds and found that the CTBP2 gene associated with the BICF2G630265083 SNP on chromosome 28 had a major effect on HD compared with other genes. In the GWAS results for HD in Korean army dogs, only one SNP on chromosome 4 had a significant effect. In addition, the BICF2S23731107 (28:37925846) SNP on chromosome 28 was found to have a suggestive effect; it is approximately 3 Mb from the CTBP2 gene (28:34260856–34415696) but has potential effects due to linkage disequilibrium relationships. However, there is still much ambiguity regarding HD expression, especially with our results and with the explanation of HD from previous results and variation in the CTBP2 gene and BICF2S23731107 (28:37925846) SNP presented in the current study. The possibility of additional causal mutations between the two cannot be ruled out. If causal mutations were found, it would facilitate our understanding of HD expression.
Recently, it was suggested that GWASs that assume polygenic traits not only have difficulty identifying specific causes and treatments for diseases but also are limited for understanding the population genetic structure of the population with a particular disease [13]. An alternative approach is the next-generation sequencing (NGS) method, in which rare variants at very low frequencies are associated with disease, and the effects of the rare variants can be combined with the common variant. Therefore, more insight can be achieved by combining various approaches, such as NGS and other methods, in future research [14].
CONCLUSION
This study used a GWAS to identify genes related to HD, a disease directly implicated in the ability of Korean army dogs to perform their missions. We identified the BICF2S2352810 SNP; however, because HD is the result of polygenic effects, it is difficult to explain HD expression using only a single SNP. Therefore, although the GWAS results did not show any direct significance, we speculate that the influence of SNPs on chromosomes 7 and 18 is derives from genes involved in bone growth, such as DYM and ALX4. The use of NGS in future studies to identify rare variants associated with HD expression might suggest better solutions for the prevention and treatment of HD.