RESEARCH ARTICLE

Genome analysis of Lactococcus taiwanensis strain K_LL001 with potential cellulose degrading functions

Eun Sol Kim1,2,#https://orcid.org/0000-0001-8801-421X, Jin Ho Cho3,#https://orcid.org/0000-0001-7151-0778, Minho Song4,#https://orcid.org/0000-0002-4515-5212, Sheena Kim1https://orcid.org/0000-0002-5410-1347, Gi Beom Keum1https://orcid.org/0000-0001-6006-9577, Hyunok Doo1https://orcid.org/0000-0003-4329-4128, Jinok Kwak1https://orcid.org/0000-0003-1217-3569, Sriniwas Pandey1https://orcid.org/0000-0002-6947-3469, Sumin Ryu1https://orcid.org/0000-0002-1569-3394, Yejin Choi1https://orcid.org/0000-0002-7434-299X, Juyoun Kang1https://orcid.org/0000-0002-3974-2832, Hyeun Bum Kim1,*https://orcid.org/0000-0003-1366-6090, Ju-Hoon Lee5,*https://orcid.org/0000-0003-0405-7621
Author Information & Copyright
1Department of Animal Biotechnology, Dankook University, Cheonan 31116, Korea
2Division of Infectious Diseases, Department of Pediatrics, University of North Carolina at Chapel Hill, North Carolina, 27599-7509, USA
3Division of Food and Animal Science, Chungbuk National University, Cheongju 28644, Korea
4Division of Animal and Dairy Science, Chungnam National University, Daejeon 34134, Korea
5Department of Food and Animal Biotechnology, Department of Agricultural Biotechnology, Research Institute of Agriculture and Life Sciences, Center for Food and Bioconvergence, Seoul National University, Seoul 08826, Korea
*Corresponding author: Hyeun Bum Kim, Department of Animal Biotechnology, Dankook University, Cheonan 31116, Korea. Tel: +82-41-550-3653, E-mail: hbkim@dankook.ac.kr
*Corresponding author: Ju-Hoon Lee Department of Food and Animal Biotechnology, Department of Agricultural Biotechnology, Research Institute of Agriculture and Life Sciences, Center for Food and Bioconvergence, Seoul National University, Seoul 08826, Korea. Tel: +82-2-880-4854, E-mail: juhlee@snu.ac.kr

# These authors contributed equally to this work.

© Copyright 2025 Korean Society of Animal Science and Technology. This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Received: Aug 10, 2023; Revised: Sep 23, 2023; Accepted: Oct 10, 2023

Published Online: Jan 31, 2025

Abstract

Lactococcus taiwanensis starin K_LL001 was isolated from the gut of grasshopper (Oxya chinensis sinuosa). In this study, we presented the complete genome sequence of L. taiwanensis stain K_LL001. The genome of K_LL001 genome was composed of 1 circular chromosome without plasmids. The length of the whole genome was 2,018,259bp, guanin + cytosine (G+C) content (%) was 38.75%, with 2,021 predicted protein-coding sequences (CDS). The most abundant CAZyme class in L. taiwanensis strain K_LL001 was glycoside hydrolases (GH) class. GHs is the key enzymes involved in carbohydrate metabolism, and they catalyze the hydrolysis of glycosidic bonds in complex carbohydrates such as cellulose, hemicellulose, and starch. Moreover, L. taiwanensis strain K_LL001 has genes encoding enzymes which can catalyze the transformation of one glycoside to another. Overall, this study will contribute to a further understanding of L. taiwanensis strain K_LL001 at the genomic level and provide a theoretical basis for its future application in swine industry.

Keywords: Lactococcus taiwanensis; Pig; Grasshopper; Glycoside hydrolases; Carbohydrates

Beneficial microorganisms can colonize the host’s intestinal tract and offer benefits to the host due to their unique abilities such as the production of digestive enzymes that enhance feed digestion and absorption [1]. Complex carbohydrate, such as cellulose, cannot be digested by pigs but can only be metabolized by the swine gut microbiota, serving as an important energy source for pigs. Since a significant quantity of cellulose is present in nursery and finisher pig feed, it is desirable to enhance its utilization for improved energy efficiency. As a result, there is a growing interest in researching bacteria associated with cellulose utilization in the swine industry [2], given the substantial cellulose content in swine feed. Insects, such as termites (Isoptera), bookworms (Lepidoptera), and others have been found to harbor symbiotic microflora in their guts, which is responsible for the digestion of cellulosic feed [3]. In this study, we isolated Lactococcus taiwanensis from the gut of a grasshopper (Oxya chinensis sinuosa). It exhibited a low DNA sequence similarity with Lactococcus lactis spp. [4]. Due to its recent discovery, there is relatively limited genomic information available for L. taiwanensis. The aim of this study was to contribute to a more comprehensive understanding of L. taiwanensis at the genomic level.

L. taiwanensis strain K_LL001 was isolated from the gut of a grasshopper (Oxya chinensis sinuosa), collected from the local grasshopper farm in Yangyang, Gangwon-do, Korea. The K_LL001 was grown in MRS broth (BD Difco™, Franklin Lakes, NJ, USA) at 37°C for 24 hours. Genomic DNA was extracted using the MagAttract HMW DNA Kit (QIAGEN, Hilden, Germany), according to the manufacturer’s instructions. The complete genome of the L. taiwanensis strain K_LL001 was sequenced using the PacBio RS II (Pacific Biosciences, Menlo Park, CA, USA) platform at Insilicogen (Yongin, Korea). Library preparation was performed using SMRTbell™ Template Prep Kit 1.0 following the manufacturer’s instructions (Pacific Biosciences). A total number of 139,220 long read sequences (854,450,914 base pairs) were produced after subreads filtering sequences. De novo assembly of the gene sequences were performed using the hierarchical genome assembly process (HGAP v2.3.0) workflow, and further polished with Quiver. Quality Assessment Tool for Genome Assemblies (QUAST) v5.2.0 was used for evaluating genome assembly [5]. Benchmarking Universal Single-Copy Orthologs (BUSCO) v5.4.7 was used for assessing genome completeness and contamination [6]. Prediction of protein coding genes, rRNA and tRNA genes were identified through Rapid Annotation using Subsystem Technology (RAST) server v2.0. Clusters of Orthologous Groups (COGs)-based EggNOG-mapper v2.0 was used to predict functional categorization of protein coding genes [7]. Virulence Factor Database (VFDB) and the Comprehensive Antibiotic Resistance Database (CARD) was used to predict potential virulence factors and antibiotic resistances [8,9]. For the identification of Carbohydrate-Active enzyme (CAZyme), data were submitted to automated carbohydrate-active enzyme and substrate annotation (dbCAN3) website [10].

The complete genome of the L. taiwanensis strain K_LL001 contained one circular chromosome (2,018,259 bp) with a guanine + cytosine (GC) content of 38.75% and no plasmid was found. A total of 2,021 predicted protein-coding sequences, 19 rRNA genes, and 60 tRNA genes were identified in L. taiwanensis strain K_LL001 (Table 1). L. taiwanensis strain K_LL001 had no virulence factor and antibiotic resistance gene. The genome features and genome map of L. taiwanensis strain K_LL001 were illustrated in Fig. 1.

Table 1. Genome features of Lactococcus taiwanensis strain K_LL001
Property Term
Library used PacBio SMRTbell™ library
Sequencing platforms PacBio RS II sequencer
Average genome coverage 327x
Chromosome length (bp) 2,018,259 bp
No. of contigs 1
G + C content (%) 38.75%
Protein-coding genes 2,021
rRNA genes 19
tRNA genes 60
GenBank accession no. SAMN17922033

bp, base pair; G + C, guanine + cytosine.

Download Excel Table
jast-67-1-273-g1
Fig. 1. Genome map of Lactococcus taiwanensis K_LL001. (A) From the outside to the inside: the outer circle indicates all annotated open reading frames (ORFs), while the inner circle in red represents guanine + cytosine (GC) content. The annotated ORFs are color-coded based on Clusters of Orthologous Groups (COG) assignments, and rRNAs and tRNA operons are denoted by orange and sky-blue arrows. The inner circle indicates GC skew by pink and green peaks, respectively. (B) Functional categories of predicted protein-coding genes according to COG.
Download Original Figure

All predicted proteins were subjected to the COG database for functional classification assignment. Top four classifications except S (uncharacterized genes) were J, K, E, and F. The genes categorized under Category J and Category K are associated with translation and transcription of genes in the bacterium. Category E (Amino acid transport and metabolism) and Category F (Nucleotide transport and metabolism) are involved in transport and metabolize amino acids and nucleotides, which are key for bacterial growth and survival.

The summarized distribution of predicted CAZyme was as follows: auxiliary activity (AA) 39; carbohydrate esterases (CE) 34, glycoside hydrolases (GH) 587, glycosyl transferases (GT) 189, carbohydrate-binding module (CBM) 22, and polysaccharide lyases (PL) 2. The CAZyme class mostly possessed in L. taiwanensis strain K_LL001 was GH class. According to Architecture et Fonction des Macromolécules Biologiques (AFMB) laboratory, GHs are the key enzymes involved in carbohydrate metabolism, and they catalyze the hydrolysis of glycosidic bonds in complex carbohydrates such as cellulose, hemicellulose, and starch. In addition, it was verified that L. taiwanensis strain K_LL001 had enzymes which can catalyze the transformation of one glycoside to another. For instance, glycosyltransferase (EC 2.4) is enzyme that catalyze the formation of the glycosidic linkage to form a glycoside, and GH92 family glycosyl hydrolase (EC 3.2.1) genes are a group of glycosyl hydrolases that catalyze the hydrolysis of specific glycosidic bonds in carbohydrates. Moreover, L. taiwanensis strain K_LL001 had no virulence factors or antibiotic-resistant genes. Overall, the genomic characteristics of L. taiwanensis strain K_LL001 suggest that it could be used as probiotics to increase swine performance through enhanced carbohydrate utilization in feed.

NUCLEOTIDE SEQUENCE ACCESSION NUMBER

The complete genome sequences of of Lactococcus taiwanensis strain K_LL001 were deposited in GenBank under the accession numbers NZ_CP070381. The BioSample accession number is SAMN17922033, and BioProject accession number is PRJNA702013.

Competing interests

No potential conflict of interest relevant to this article was reported.

Funding sources

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2021R1I1A3059910 and NRF-2019M3A9F3065227).

Acknowledgements

Not applicable.

Availability of data and material

Upon reasonable request, the datasets of this study can be available from the corresponding author.

Authors’ contributions

Conceptualization: Kim HB, Lee JH.

Data curation: Kim ES, Kim HB.

Formal analysis: Kim ES.

Methodology: Keum GB, Kwak J, Pandey S, Ryu S.

Software: Doo H, Choi Y, Kang J.

Validation: Cho JH, Song M, Kim S, Kim HB.

Writing - original draft: Kim ES.

Writing - review & editing: Kim ES, Cho JH, Song M, Kim S, Keum GB, Doo H, Kwak J, Pandey S, Ryu S, Choi Y, Kang J, Kim HB, Lee JH.

Ethics approval and consent to participate

This article does not require IRB/IACUC approval because there are no human and animal participants.

REFERENCES

1.

Keum GB, Pandey S, Kim ES, Doo H, Kwak J, Ryu S, et al. Understanding the diversity and roles of the ruminal microbiome. J Microbiol. 2024; 62:217-30

2.

Varel VH. Activity of fiber-degrading microorganisms in the pig large intestine. J Anim Sci. 1987; 65:488-96

3.

Gupta P, Samant K, Sahu A. Isolation of cellulose-degrading bacteria and determination of their cellulolytic potential. Int J Microbiol. 2012; 2012:578925

4.

Chen YS, Chang CH, Pan SF, Wang LT, Chang YC, Wu HC, et al. Lactococcus taiwanensis sp. nov., a lactic acid bacterium isolated from fresh cummingcordia. Int J Syst Evol Microbiol. 2013; 63:2405-9

5.

Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013; 29:1072-5

6.

Seppey M, Manni M, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness.In In: Kollmar M, editor.editor Gene prediction: methods and protocols. New York, NY: Humana. 2019; p p. 227-45

7.

Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008; 9:75

8.

Chen L, Yang J, Yu J, Yao Z, Sun L, Shen Y, et al. VFDB: a reference database for bacterial virulence factors. Nucleic Acids Res. 2005; 33Suppl 1:D325-8

9.

Alcock BP, Raphenya AR, Lau TTY, Tsang KK, Bouchard M, Edalatmand A, et al. CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res. 2020; 48:D517-25

10.

Ausland C, Zheng J, Yi H, Yang B, Li T, Feng X, et al. dbCAN-PUL: a database of experimentally characterized CAZyme gene clusters and their substrates. Nucleic Acids Res. 2021; 49:D523-8