Title | A high-resolution genomic composition-based method with the ability to distinguish similar bacterial organisms |
Authors | Zhou, Yizhuang Zhang, Wenting Wu, Huixian Huang, Kai Jin, Junfei |
Affiliation | Guilin Med Univ, Affiliated Hosp, Lab Hepatobiliary & Pancreat Surg, Guilin 541001, Guangxi, Peoples R China Peking Univ, Acad Adv Interdisciplinary Studies, Peking Tsinghua Ctr Life Sci, Beijing 100871, Peoples R China Guilin Med Univ, China USA Lipids Hlth & Dis Res Ctr, Guilin 541001, Guangxi, Peoples R China Guilin Med Univ, Guangxi Key Lab Mol Med Liver Injury & Repair, Guilin 541001, Guangxi, Peoples R China |
Keywords | Composition Species delineation Metagenomic binning Strain typing TETRA TZMD Taxonomy |
Issue Date | 2019 |
Publisher | BMC GENOMICS |
Abstract | Background Genomic composition has been found to be species specific and is used to differentiate bacterial species. To date, almost no published composition-based approaches are able to distinguish between most closely related organisms, including intra-genus species and intra-species strains. Thus, it is necessary to develop a novel approach to address this problem. Results Here, we initially determine that the "tetranucleotide-derived z-value Pearson correlation coefficient" (TETRA) approach is representative of other published statistical methods. Then, we devise a novel method called "Tetranucleotide-derived Z-value Manhattan Distance" (TZMD) and compare it with the TETRA approach. Our results show that TZMD reflects the maximal genome difference, while TETRA does not in most conditions, demonstrating in theory that TZMD provides improved resolution. Additionally, our analysis of real data shows that TZMD improves species differentiation and clearly differentiates similar organisms, including similar species belonging to the same genospecies, subspecies and intraspecific strains, most of which cannot be distinguished by TETRA. Furthermore, TZMD is able to determine clonal strains with the TZMD = 0 criterion, which intrinsically encompasses identical composition, high average nucleotide identity and high percentage of shared genomes. Conclusions Our extensive assessment demonstrates that TZMD has high resolution. This study is the first to propose a composition-based method for differentiating bacteria at the strain level and to demonstrate that composition is also strain specific. TZMD is a powerful tool and the first easy-to-use approach for differentiating clonal and non-clonal strains. Therefore, as the first composition-based algorithm for strain typing, TZMD will facilitate bacterial studies in the future. |
URI | http://hdl.handle.net/20.500.11897/553328 |
ISSN | 1471-2164 |
DOI | 10.1186/s12864-019-6119-x |
Indexed | SCI(E) EI |
Appears in Collections: | 前沿交叉学科研究院 |