Article

Enhancing Animal Breeding through Quality Control in Genomic Data - A Review

Jungjae Lee1, Jong Hyun Jung2, Sang-Hyon Oh3,*
Author Information & Copyright
1Jenomics Jenetics Company, Pyeongtaek 17869,, Korea.
2Jung P&C Institute, Yongin 16950, Korea.
3Division of Animal Science, Gyeongsang National University, Jinju 52725, Korea.
*Corresponding Author: Sang-Hyon Oh, Division of Animal Science, Gyeongsang National University, Jinju 52725, Korea, Republic of. E-mail: shoh@gnu.ac.kr.

© Copyright 2024 Korean Society of Animal Science and Technology. This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

High-throughput genotyping and sequencing has revolutionized animal breeding by providing access to vast amounts of genomic data to facilitate precise selection for desirable traits. This shift from traditional methods to genomic selection provides dense marker information for predicting genetic variants. However, the success of genomic selection heavily depends on the accuracy and quality of the genomic data. Inaccurate or low-quality data can lead to flawed predictions, compromising breeding programs and reducing genetic gains. Therefore, stringent quality control (QC) measures are essential at every stage of data processing. Quality control in genomic data involves managing single nucleotide polymorphism (SNP) quality, assessing call rates, and filtering based on minor allele frequency (MAF) and Hardy-Weinberg equilibrium (HWE). High-quality SNP data is crucial because genotyping errors can bias the estimates of breeding values. Cost-effective low-density genotyping platforms often require imputation to deduce missing genotypes. QC is vital for genomic selection, genome-wide association studies (GWAS), and population genetics analyses because it ensures data accuracy and reliability. This paper reviews QC strategies for genomic data and emphasizes their applications in animal breeding programs. By examining various QC tools and methods, this review highlights the importance of data integrity in achieving successful outcomes in genomic selection, GWAS, and population analyses. Furthermore, this review covers the critical role of robust QC measures in enhancing the reliability of genomic predictions and advancing animal breeding practices.

Keywords: Animal Breeding; Genomic Selection; Quality Control; Single Nucleotide Polymorphism; Genome-Wide Association Studies