We compared the species names in the Reptile Database, a dedicated taxonomy database, with those in the NCBI taxonomy database, which provides the taxonomic backbone for the GenBank sequence database. About 67% of the known ~11,000 reptile species are represented with at least one DNA sequence and a binary species name in GenBank. However, a common problem arises through the submission of preliminary species names (such as “Pelomedusa sp. A CK-2014”) to GenBank and thus the NCBI taxonomy. These names cannot be assigned to any accepted species names and thus create a disconnect between DNA sequences and species. While these names of unknown taxonomic meaning sometimes get updated, often they remain in GenBank which now contains sequences from ~1,300 such “putative” reptile species tagged by informal names (~15% of its reptile names). We estimate that NCBI/GenBank probably contain tens of thousands of such “disconnected” entries. We encourage sequence submitters to update informal species names after they have been published, otherwise the disconnect will cause increasing confusion and possibly misleading taxonomic conclusions.
Adamowicz, S.J. (2015) International Barcode of Life: evolution of a global research community. Genome, 58, 151–162.
Benson, D.A., Cavanaugh, M., Clark, K., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J. & Sayers, E.W. (2017) GenBank. Nucleic Acids Research, 45, D37–D42.
Benson, D.A., Karsch-Mizrachi, I., Clark, K., Lipman, D.J., Ostell, J. & Sayers, E.W. (2012) GenBank. Nucleic Acids Research, 40, D48–D53. [database issue]
Chambers, E.A. & Hebert, P.D. (2016) Assessing DNA barcodes for species identification in North American reptiles and amphibians in natural history collections. PLoS One, 11, e0154363.
Ciufo, S., Kannan, S., Sharma, S., Badretdin, A., Clark, K., Turner, S., Brover, S., Schoch, C.L., Kimchi, A. & DiCuccio, M. (2018) Using average nucleotide identity to improve taxonomic assignments in prokaryotic genomes at the NCBI. International Journal of Systematic and Evolutionary Microbiology, 68, 2386–2392.
Federhen, S. (2012) The NCBI Taxonomy database. Nucleic Acids Research, 40, D136–143.
Federhen, S. (2015) Type material in the NCBI taxonomy database. Nucleic Acids Research, 43, D1086–98.
Heinicke, M.P., Turk, D. & Bauer, A.M. (2017) Molecular phylogeny reveals strong biogeographic signal and two new species in a Cape Biodiversity Hotspot endemic mini-radiation, the pygmy geckos (Gekkonidae: Goggia). Zootaxa, 4312 (3), 449–470.
Karsch-Mizrachi, I., Takagi, T. & Cochrane, G. (2017) The international nucleotide sequence database collaboration. Nucleic Acids Research, 46 (D1), D48–D51.
Sayers, E.W., Cavanaugh, M., Clark, K., Ostell, J., Pruitt, K.D. & Karsch-Mizrachi, I. (2019) GenBank. Nucleic Acids Research, 47 (D1), D94–D99.
Schoch, C.L., Aime, M.C., de Beer, W., Crous, P.W., Hyde, K.D., Penev, L., Seifert, K.A., Stadler, M., Zhang, N. & Miller, A.N. (2017) Using standard keywords in publications to facilitate updates of new fungal taxonomic names. IMA Fungus, 8 (2), 70–73.
Sharma, S., Ciufo, S., Starchenko, E., Darji, D., Chlumsky, L., Karsch-Mizrachi, I. & Schoch, C.L. (2018) The NCBI BioCollections Database. Database, Oxford, 2018, bay006.
Tillack, F., Ziegler, T. & Le Khac Quyet (2004) Eine neue Art der Gattung Boiga Fitzinger 1826 (Serpentes: Colubridae: Colubrinae) aus dem zentralen Vietnam. Sauria, 26 (4), 3–13.
Uetz, P. & Garg, A. (2017) Molecular taxonomy: Species disconnected from DNA sequences. Nature, 545 (7655), 412.
Uetz, P., Freed, P. & Hošek, J. (2019) The Reptile Database. Available from: http://www.reptile-database.org (accessed 10 July 2019)
Uetz, P. & Stylianou, A. (2018) The original descriptions of reptiles and their subspecies. Zootaxa, 4375 (2), 257–264.