Title A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms
Authors Zhou, Yizhuang
Zhang, Wenting
Wu, Huixian
Huang, Kai
Jin, Junfei
Affiliation Guilin Med Univ, Affiliated Hosp, Lab Hepatobiliary & Pancreat Surg, Guilin 541001, Guangxi, Peoples R China
Peking Univ, Acad Adv Interdisciplinary Studies, Peking Tsinghua Ctr Life Sci, Beijing 100871, Peoples R China
Guilin Med Univ, China USA Lipids Hlth & Dis Res Ctr, Guilin 541001, Guangxi, Peoples R China
Guilin Med Univ, Guangxi Key Lab Mol Med Liver Injury & Repair, Guilin 541001, Guangxi, Peoples R China
Keywords Composition
Species delineation
Metagenomic binning
Strain typing
TETRA
TZMD
Taxonomy
Issue Date 2019
Publisher BMC GENOMICS
Abstract Background Genomic composition has been found to be species specific and is used to differentiate bacterial species. To date, almost no published composition-based approaches are able to distinguish between most closely related organisms, including intra-genus species and intra-species strains. Thus, it is necessary to develop a novel approach to address this problem. Results Here, we initially determine that the "tetranucleotide-derived z-value Pearson correlation coefficient" (TETRA) approach is representative of other published statistical methods. Then, we devise a novel method called "Tetranucleotide-derived Z-value Manhattan Distance" (TZMD) and compare it with the TETRA approach. Our results show that TZMD reflects the maximal genome difference, while TETRA does not in most conditions, demonstrating in theory that TZMD provides improved resolution. Additionally, our analysis of real data shows that TZMD improves species differentiation and clearly differentiates similar organisms, including similar species belonging to the same genospecies, subspecies and intraspecific strains, most of which cannot be distinguished by TETRA. Furthermore, TZMD is able to determine clonal strains with the TZMD = 0 criterion, which intrinsically encompasses identical composition, high average nucleotide identity and high percentage of shared genomes. Conclusions Our extensive assessment demonstrates that TZMD has high resolution. This study is the first to propose a composition-based method for differentiating bacteria at the strain level and to demonstrate that composition is also strain specific. TZMD is a powerful tool and the first easy-to-use approach for differentiating clonal and non-clonal strains. Therefore, as the first composition-based algorithm for strain typing, TZMD will facilitate bacterial studies in the future.
URI http://hdl.handle.net/20.500.11897/553328
ISSN 1471-2164
DOI 10.1186/s12864-019-6119-x
Indexed SCI(E)
EI
Appears in Collections: 前沿交叉学科研究院

Files in This Work
There are no files associated with this item.

Web of Science®


0

Checked on Last Week

Scopus®



Checked on Current Time

百度学术™


0

Checked on Current Time

Google Scholar™





License: See PKU IR operational policies.