Nothing Special   »   [go: up one dir, main page]

DNA Barcoding, An Effective Tool For Species Identification: A Review

Download as pdf or txt
Download as pdf or txt
You are on page 1of 15

Molecular Biology Reports (2023) 50:761–775

https://doi.org/10.1007/s11033-022-08015-7

REVIEW

DNA barcoding, an effective tool for species identification: a review


Sandeep Antil1 · Jeeva Susan Abraham1 · S. Sripoorna1 · Swati Maurya1 · Jyoti Dagar1 · Seema Makhija1 ·
Pooja Bhagat1 · Renu Gupta2 · Utkarsh Sood3 · Rup Lal3 · Ravi Toteja1

Received: 4 May 2022 / Accepted: 7 October 2022 / Published online: 29 October 2022
© The Author(s), under exclusive licence to Springer Nature B.V. 2022

Abstract
DNA barcoding is a powerful taxonomic tool to identify and discover species. DNA barcoding utilizes one or more standard-
ized short DNA regions for taxon identification. With the emergence of new sequencing techniques, such as Next-generation
sequencing (NGS), ONT MinION nanopore sequencing, and Pac Bio sequencing, DNA barcoding has become more accu-
rate, fast, and reliable. Rapid species identification by DNA barcodes has been used in a variety of fields, including forensic
science, control of the food supply chain, and disease understanding. The Consortium for Barcode of Life (CBOL) presents
various working groups to identify the universal barcode gene, such as COI in metazoans; rbcL, matK, and ITS in plants; ITS
in fungi; 16S rRNA gene in bacteria and archaea, and creating a reference DNA barcode library. In this article, an attempt has
been made to analyze the various proposed DNA barcode for different organisms, strengths & limitations, recent advance-
ments in DNA barcoding, and methods to speed up the DNA barcode reference library construction. This study concludes
that constructing a reference library with high species coverage would be a major step toward identifying species by DNA
barcodes. This can be achieved in a short period of time by using advanced sequencing and data analysis methods.

Keywords Biodiversity · DNA barcoding · Next-generation sequencing (NGS) · Oxford Nanopore Technologies (ONT)’s
MinION™ · PacBio sequencing · Consortium for barcode of life (CBOL)

Introduction among living organisms from all sources comprising ter-


restrial, marine, and other aquatic ecosystems and ecologi-
The existence of life is one of the most unique aspects cal complexes of which they are a part; it encompasses
of Earth, and the diversity of life is the most astonishing diversity within species, between species, and within eco-
feature of life. Biological diversity refers to the variation systems [1]. Biodiversity plays a key role in maintaining

* Ravi Toteja Renu Gupta


ravitoteja@andc.du.ac.in rgupta17@maitreyi.du.ac.in
Sandeep Antil Utkarsh Sood
sandeep@andc.du.ac.in c_utkarsh.sood@teri.res.in
Jeeva Susan Abraham Rup Lal
jeevasa@andc.du.ac.in rup.lal@teri.res.in
S. Sripoorna 1
Acharya Narendra Dev College, University of Delhi,
sripoorna@andc.du.ac.in
New Delhi, Delhi, India
Swati Maurya 2
Maitreyi College, University of Delhi, New Delhi,
swatimaurya@andc.du.ac.in
Delhi 110 021, India
Jyoti Dagar 3
The Energy and Resources Institute, IHC Complex,
jyotidagar@andc.du.ac.in
New Delhi 110003, India
Seema Makhija
seemamakhija@andc.du.ac.in
Pooja Bhagat
poojabhagat@andc.du.ac.in

13
Vol.:(0123456789)
762 Molecular Biology Reports (2023) 50:761–775

ecological balance [2]. The diversity of species described satisfy three criteria (1) A distinct ‘barcoding gap’ between
so far is a very tiny portion of biodiversity present on the maximum intra-specific and inter-specific divergence within
earth (approximately 9 million). This means that it is very a group of organisms; (2) conserved flanking sites for cre-
hard to estimate the diversity of life that go extinct every ating universal PCR primers; (3) short sequence length to
day, as scientists have only described 10–15% of the total facilitate current capabilities of DNA extraction and ampli-
diversity of the earth [3]. A considerable portion around fication [8].
86–91% (~ 7.2 million) of diversity remains undescribed due This technique involves a collection of a sample from
to several reasons such as scarcity of funds for taxonomy, a the field, extracting DNA, selection of barcoding gene for
very less number of the trained taxonomist, and absence of amplification by using a universal primer (Table 1), ampli-
accurate species identification methods. For describing the fied DNA molecule is sequenced by Sanger sequencing or
remaining diversity an accurate method for taxonomic iden- High-throughput sequencing for assessing the diversity and
tification, trained taxonomist, and funding are required. Tra- analysis of obtained data by using data analysis software
ditionally, taxonomic assessment has relied on the basis of such as Mothur, Qiime2 etc. (Fig. 1) [10, 11].
morphological character which is time-consuming, requires The Consortium for the Barcode of Life (CBOL, https://​
taxonomic specialists, and gives false identification when www.​ibol.​org) is an international organisation that was
cryptic species and phenotypic plasticity are concerned [4, founded in 2004 to facilitate the establishment and use of
5]. Furthermore, globally, the number of traditional taxono- DNA barcodes as a global standard for biological species
mists is declining [6]. Therefore, the majority of the diver- identification. CBOL includes various group, such as Plant
sity of microorganisms and invertebrates may have to be working group for plants, Protist working group for eukar-
distinguished solely by DNA-based molecular techniques, yotic microorganism, Fungal working group for fungi, to
without accompanying live cultures or physical specimens. identify the universal barcode gene and creating a reference
These molecular techniques have several advantages over DNA barcode library [12]. A reference database, Barcode of
traditional approaches. Because, molecular tools are stand- life data system (BOLD, http://​www.​bolds​ystems.​org), has
ardized tools which allow direct comparison among different been developed that aids in acquiring, storage, analysis, and
users and they do not require taxonomic expertise, can be publication of DNA barcode and allows a significant number
applied to environmental samples which comprise a mix of of species to be identified [13]. The present study aims to
several species, like soil or a water sample and can be used review the various proposed/available DNA barcodes the
in early warning allowing detection of low concentration of for animals, plants, fungi, bacteria, virus, and protists (more
potential invaders, or even imprints of potential invader [7]. specifically ciliates). Over the period of time, significant
Over the last decade, DNA barcoding emerges as a new advances have been made in DNA barcoding. One impor-
molecular tool for taxonomists to identify species. DNA tant advancement in barcoding by mitogenomics and nuclear
barcoding utilizes one or more standardized short genetic ribosomal RNA repeats obtained by genome skimming. The
markers in an organism’s DNA to recognize it as belong- same has been discussed in the present review along with the
ing to a particular species, and through this strategy, DNA mitogenomics approach for species identification. In the end,
sample from the unidentified species is compared to identi- strengths and limitations of the technique have also been
fied sequences present in a DNA barcode reference library, briefly described.
developed by Hebert and his collaborators [8]. DNA barcod-
ing is based on the principle of barcoding gap that refers to
the difference between mean intra- and interspecific genetic Barcodes for identification of animals
distances. The wider the barcoding gap is, the more reliable
species discrimination will be achieved. DNA barcoding is Hebert et al. had suggested a 650 bp fragment of the mito-
budget-friendly, less time-consuming, objective method, and chondrial cytochrome-c oxidase subunit 1 (COI) gene as a
a powerful tool for species identification when cryptic spe- universal marker or ‘DNA barcode’ for global biological
cies and phenotypic plasticity is a concern or morphology identification of animal species. COI gene is a mitochon-
keys are not available [9]. Due to the better precision and drial gene that is highly conserved [14], codes for respira-
ease of DNA barcoding, this technique is gaining popular- tory electron transport chain protein that reduce molecular
ity, and it can be used to identify species in any stage of oxygen into water, present in all aerobic organisms. Mito-
life (i.e. both adults and immature stage including eggs). chondrial genes are preferred over nuclear genes because
DNA barcoding mostly differs from other molecular tools mitochondrial genes are generally haploid, lack introns,
by use of standard markers, such as COI in metazoans; rbcL, and contain limited recombination. Mitochondria repro-
matK, and ITS in plants; ITS in fungi; 16S rRNA gene in duce by binary fission and without sexual recombination,
bacteria and archaea [7]. For DNA barcoding, the selection so the mitochondrial genes are subjected less to insertions,
of the barcoding gene is crucial. A barcoding gene must deletions or other large-scale rearrangements that introduce

13
Molecular Biology Reports (2023) 50:761–775 763

Table 1  Represents list of barcoding gene for species identification and their primer pairs
Organisms Barcode gene Primers (5′–3′) References

Animals COI F-GGT​CAA​CAA​ATC​ATA​AAG​ATA​TTG​G [15]


R-TAA​ACT​TCA​GGG​TGA​CCA​AAA​AAT​CA
COII F-TCA​CWA​TAC​TAY​TAA​CAG​ATC​GCA​A [95]
R-AAT​AGC​TGT​ACA​GTG​GGT​
Plant COI F-CAA​CAT​TT ATT​TTG​ATT​TTT​TGG​ [16]
R-TCC​AAT​GCA​CTA​ATC​TGC​CAT​ATT​A
RbcL F-ATG​TCA​CCA​CAA​ACA​GAA​AC [19]
R-TCG​CAT​GTA​CCT​GCA​
MatK F-CCT​CAT​CTG​GAA​ATC​TTG​GTT​ [19]
R-GCT​TAT​AAT​GAG​AAA​GAT​TTC​TGC​
PsbA-trnH F-CGC​GCA​TGG​TGG​ATT​CAC​AATCC​ [19]
R-GTT​ATG​CAT​GAA​CGT​AAT​GCTC​
rpoC1 F-CCSATT​GTA​TGG​GAA​ATA​CTT​ [96]
R-CTT​ACA​AAC​TAA​TGG​ATG​TAA​
rpoB F1-AAG​TGC​ATT​GTT​GGA​ACT​GG [19]
F2-ATG​CAA​CGT​CAA​GCA​GTT​CC
R1-CCG​TAT​GTG​AAA​AGA​AGT​ATA​
R2-GAT​CCC​AGC​ATC​ACA​ATT​CC
atpF-atpH F-ACT​CGC​ACA​CAC​TCC​CTT​TCC​ [19]
R-GCT​TTT​ATG​GAA​GCT​TTA​ACAAT​
psbK-psbI F-TTA​GCC​TTT​GTT​TGG​CAA​G [19]
R-AGA​GTT​TGA​GAG​TAA​GCA​T
ITS F-ATG​CGA​TAC​TTG​GTG​TGA​AT [9]
R-GAC​GCT​TCT​CCA​GAC​TAC​AAT​
Fungi ITS F-TCC​TCC​GCT​TAT​TGA​TAT​GC [21, 24]
R-GGA​AGT​AAA​AGT​CGT​AAC​AAGG​
D1-D2 F-ACC​CGC​TGA​ACT​TAAGC​ [22]
R-TCC​TGA​GGG​AAA​CTTCG​
RPB1 F-GAR​TGY​CCDGGDCAY​TTY​GG [24]
R-CCNGCDATNTCR​TTR​TCC​ATR​TA
RPB2 F-GAY​GAY​MGWG​ATC​AYT​TYG​G
R-CCC​ATW​GCY​TGC​TTMCCCAT​
COI F1-TTA​CAA​GGT​GAT​CAT​CAA​TT [23]
F2-GTA​TTA​AAA​TTT​CTA​TCT​GTAAG​
R1-TTT​CTA​TCT​GTA​AGT​AAC​AT
R2-TTT​ACA​AGG​TGA​TCAA​
ACT​ F-ATG​TGC​AAG​GCC​GGT​TTC​G [97]
R-TAC​GAG​TCC​TTC​TGG​CCC​AT
TEF1a F-GCY​CCY​GGHCAY​CGT​TTYAT​ [98]
R-ACHGTR​CCR​ATA​CCA​CCR​ATC​TT

13
764 Molecular Biology Reports (2023) 50:761–775

Table 1  (continued)
Organisms Barcode gene Primers (5′–3′) References

Protist ITS F-CGT​AAC​AAG​GTT​TCC​GTA​GG [45]


R-TCC​TCC​GCT​TAC​TGA​TAT​GC
V4 F-CCA​GCA​SCYG​CGG​TAA​TTC​C [45]
R-ACT​TTC​GTT​CTT​GAT​YRA​
V9 F-GTA​CAC​ACC​GCC​CGTC​ [42]
R-TGA​TCC​TTC​TGC​AGG​TTC​ACC​TAC​
D1-D2/D2-D3 F-AGC​GGA​GGA​AAA​GAA​ACT​A [42]
R-ACG​ATC​GAT​TTG​CAC​GTC​AG
D3-D5 F-GAC​CCG​TCT​TGA​AAC​ACG​GA [50]
R-TCG​GAA​GGA​ACC​AGC​TAC​TA
COI F-GWT​GRG​CKATG​ATY​ACACC​ [5]
R-ACC​ATR​TAC​ATA​TGA​TGW​CC
RbcL F-TTA​ACC​TCC​ATC​GTG​GGT​AACG​ [99]
R-CAG​GCA​TAG​AAG​CCC​AAT​CTTG​
H4 F-GGT​ATT​ACT​AAG​CCC​GCT​ATC​AGA​AGA​ [53]
R-GGT​CTT​TCT​TCT​GGC​GTG​TTC​AGT​GTA​
Spliced F-GTA​TAA​GAG​ACA​GNNNNNNN [44]
leader RNA gene R-TCA​GTT​TCT​GTA​
Archaea 16S F-GCY TAA AGS RIC CGT AGC​ [26]
R-TTM GGG GCA TRC IKA CCT​
Type II chaperonin (High F-GGC CCG AAG GGC ATG GAC AAG ATG​ [26]
GC primers) R-GGC ATG TCG TCG ATG CCC TTC TG
Type II chaperonin F-GGI CCI MRR GGI ITI GAY AAR ATG​ [26]
(Universal primers) R-GCI AII TCR TCI ATI CCY TTY TG
Bacteria RpoB F-TTT​CCC​TAC​ACG​ACG​CTC​TTC​CGA​TCT​GGY​TWY​GAA​GTNCGH- [30]
GACGTDCA
R-GGA​GTT​CAG​ACG​TGT​GCT​CTT​CCG​ATC​TTG​ACG​YTG​CATGT TBGM-
RCC​CAT​MA
16S rRNA gene F-GAG​TTT​GAT​CCT​GGC​TCA​G [29]
R-GTA​TTA​CCG​CGG​CTG​CTG​
cpn60 F1-TCG​TCG​GCA​GCG​TCA​GAT​GTG​TAT​AAG​AGA​CAGGAIIIIGCIGGI- [33]
GAYGGIACIACIAC
F2-GCT​TCG​TGG​GCT​CGG​AGA​TGT​GTA​ TAA​GAG​ACAGYKIYKITCIC-
CRAAICC IGGIGCYTT​
R1-TCG​TCG​GCA​GCG​TCA​GAT​GTG TAT​AAG​AGA​CAG​GAIIIIGCIGGY​GAC​
GGYACSACSAC
R2-GCT​TCG​TGG​GCT​CGG​AGA​TGT​GTA​ TAA​GAG​ACA​GCG​RCG​RTC​RCC​
GAA​GCCSGGIGCCTT​
Tuf F-GCT​CCT​GAA​GAA​ARA​GAA​CGTGG​ [31]
R-ACTTGDCCT​CTT​TCKACT​CTA​CCAGT​
RIF F-TAC​GGC​TTC​GAC​ACC​TTC​G [32]
R-CGG​TGA​TCT​TCT​TGT​TGG​CG
Gnd F-CGC​GGA​TCC​GGW​CCWWSWAT​WAT​GCC​WGG​WGG​ [31]
R-CGC​GGG​CCC​GTA​TGW​GCW​CCA​AAA​TAA​TCW​CKTTG​WGC​TTG​

The lower-case letters indicate nucleotides added to the genomic sequence to facilitate PCR; D = A/G/T, Y = C/T, R = A/G, W = A/T, M = A/C,
B = C/G/T, V = A/C/G, H = A/C/T, K = G/T, N = A/G/C/T

more ambiguous variation in the sequence. The mitochon- barcoding gene for moths, butterflies, collembolans, beetles,
drial genome evolves at a higher rate than the nuclear bats, spiders, wasps, ants, fishes, Reptilia, birds, chickens,
genome. Therefore, mitochondrial genomic sequences are musk deer, fruit fly, and crustacean larvae [4, 16]. Some pri-
more informative in differentiating or distinguishing closely mate taxonomists recommend that ND5 (Mitochondrial gene
related species [15]. So far COI gene has been used as encoding NADH:Ubiquinone Oxidoreductase Core Subunit

13
Molecular Biology Reports (2023) 50:761–775 765

species [19]. Comparatively, 600–800 base-pair region of


matK in association with rbcL gives the most satisfactory
result and designated as core barcoding gene, while psbA-
trnH work as a good marker for other plant species and iden-
tified as an important supplementary marker, but there is no
single marker for identifying all the plant species [20].

Barcodes for identification of fungi

Identification of fungi through morphological methods is


often difficult because they only occasionally display mor-
phological characters suitable for identification. A molecu-
lar tool such as DNA barcoding is the best way to evaluate
fungal diversity. The ITS, D1-D2 region of the large subu-
nit of ribosomal RNA gene, RPB1 and RPB2 of the large
subunit of RNA polymerase II, γ-actin (ACT), β -tubulin
II (TUB2), translation elongation factor 1-α (TEF/α), DNA
topoisomerase I (TOPI), phosphoglycerate kinase (PKG) are
used as a barcode for identifying fungal species [20–22].
COI has a higher resolution in few groups of related species
such as Penicillium, and Entolomasarcopum but in other
groups it may not give satisfactory results [23]. Schoch
and his group has proposed ITS as a universal barcode for
the identification of fungi [24]. The length of ITS region
in fungi is around 600 bp long, with two variable spacers,
ITS-1 and ITS-2, interrupted by the highly conserved 5.8S
Fig. 1  Schematic representation of the methodology for DNA barcod-
ing rRNA gene. Another significant benefit of utilizing ITS as
a barcode is that each haploid genome often contains sev-
eral tandemly repeated copies of the ribosomal rRNA gene
5) and COII should be used as a barcode in primates species cluster (including ITS), allowing it to be amplified even from
delineation and suggest that these two genes should be more small amounts of biological materials [6]. Stielow et al. have
appropriate markers than COI due to a more pronounced assessed the potentiality of D1-D2 region of LSU, β-tubulin
barcoding gap [17]. II (TUB2), γ-actin (ACT), translation elongation factor 1-α
(TEF/α), the second largest subunit of RNA-polymerase II
(RPB2), DNA topoisomerase I (TOPI), phosphoglycerate
Barcodes for identification of plants kinase (PKG), hypothetical protein LNS2 as an alternative
DNA barcode. Among these genes TEF/α has the potential
For plant species identification, the selection of barcod- as a secondary DNA barcode due to sufficient intra- and
ing genes remains very controversial. Plant mitochondrial inter-specific variation, while TOPI and PKG show high
genome exhibits a low rate of mutation (nucleotide substitu- resolution for the phylum Ascomycota, and TOPI and LNS2
tion) that restricts COI as a universal plant barcode. Plant for the subphylum Pucciniomycotina [22].
taxonomists have spent a large amount of time and found
the Chloroplast genome as an alternative to the mitochon-
drial genome. In 2009, CBOL plant working group pro- Barcodes for identification of archaea
posed seven potential barcodes such as rbcL (large subunit
of ribulose 1,5 bisphosphate carboxylase), matK (maturase Archaea is a major component of microbial diversity and
K), psbA-trnH (intergenic spacer region, rpoC1 (RNA poly- has a prominent place in the Tree of Life [25]. 16S rRNA
merase C1), rpoB (RNA polymerase B), atpF-atpH (encodes gene has been widely utilised as a barcode for evaluating
for ATP synthase subunits CFO I and CFO III) and psbK- the diversity of archaea [25, 26]. The 16S rRNA gene is not
psbI (encodes for polypeptide K and L of photosystem II) sensitive enough to discriminate closely related microbes,
[18]. Nuclear gene ITS (internal transcribed spacer) and all particularly at the species level [27]. In a study, type 2 chap-
the chloroplast barcodes have been positively tested in plant eronin or thermosome (e.g. TCP-1 ring complex/chaperonin

13
766 Molecular Biology Reports (2023) 50:761–775

containing TCP-1), which are present in both archaea and identification [12]. The hypervariable V4 region of 18S
eukaryotic cytoplasm, proposed as a potential complemen- rRNA gene is proposed as the universal eukaryotic pre-
tary barcode for 16S rRNA gene to assess the archaeal diver- barcode, while group-specific barcode is defined separately
sity since it has larger barcoding gap and generate more for each significant protistan lineage [34]. So far, ITS,
OTUs (operational taxonomic units) than 16S rRNA gene COI, rbcL,18S rRNA gene, 28S rRNA gene region have
[28]. been proposed as a protistan DNA barcode [35–37]. ITS,
the universal barcode in fungi, also has high discriminatory
power for ciliates, dinoflagellates, and oomycetes [20, 37,
Barcodes for identification of bacteria 38]. Mitochondrial COI, which is the universal barcode for
animals and default barcode for other organisms as well, is
16S rRNA gene is a universal marker as it is highly con- also positively tested in protist [36]. Hypervariable regions
served in all the species of bacteria. The length of the 16S V4 and V9 of 18S ribosomal RNA gene are promising bar-
rRNA gene is 1600 base pairs and contains nine hypervari- codes to access the diversity and phylogenetic relationship
able regions of V1–V9. More conservative regions are valu- of diatoms, dinoflagellates and ciliates [39–41]. D1-D2 and/
able for identifying higher-ranking taxa, whilst more rapidly or D2-D3 regions at 5′ end of large subunit of rRNA gene
evolving ones can aid in genus or species identification. The serve as potential barcodes for many protists lineages such
V2–V3 region of 16S rRNA gene has higher resolution for as diatoms, ciliates, and dinoflagellates [35, 42, 43]. Some
identifying lower-ranked taxa (species and genus) [29]. The group-specific barcodes such as rbcL and spliced leader
diversity of bacteria can also be accessed by using COI, RNA gene are also utilized in photosynthetic protists and
rpoB, cpn60 (encodes for chaperonin protein), tuf (elon- trypanosomatids, respectively [44].
gation factor), RIF (Replication initiation factor), and gnd
(Gluconate-6-phosphate dehydrogenase) gene as barcode
[30–33]. These genes have several benefits over frequently Barcodes for identification of ciliates
used 16S rRNA gene i.e. as they are frequently found in sin-
gle copies in bacterial genome, and develop silent mutations Large public reference libraries of DNA barcodes are being
owing to codon degeneracy, resulting in improved species developed for animals, plants, and fungi, but no universal
resolution. Of these cpn60 gives better results and can be barcode has been accepted for ciliates species identification
used as a possible alternative for assessing bacterial diversity [37]. Various barcodes for ciliate identification are (1) mito-
[20, 26] and cpn60 is the only target that can be addressed chondrial cytochrome c oxidase subunit I gene (COI gene);
with ‘universal’ PCR primers, and a curated sequence data- (2) hypervariable regions of the small subunit (SSU) rRNA
base, cpnDB, is available. For closely related species, the gene such as V4 and V9 region; (3) ITS region; (4) D1-D2
cpn60 gene has stronger discriminating power than the 16S regions of the large subunit of rRNA gene (LSU) and (5)
rRNA gene, and the uniform size and sequence variability of histone H4.
the cpn60 ‘universal target’ (UT) make sequence compari-
sons and other bioinformatics tasks easier [26].
Mitochondrial cytochrome‑c oxidase
subunit 1 gene (COI)
Barcodes for identification of viruses
Within ciliates, taxonomic and molecular phylogenetic
Viruses are the most abundant (approx. 10–12 times higher studies using COI gene have been used in Paramecium, Tet-
than the total no. of cells) life forms on earth. So far, there rahymena, Carchesium, Miamiensis, Sterkiella and Pseu-
is no standardized barcode fragment for detection of viruses dokeronopsis [5, 45]. All the above studies prove that the
[20]. highly variable COI gene of ciliates can identify closely
related species and cryptic species since it has a distinct
barcode gap between maximum intraspecific and minimum
Barcodes for identification of protist inter-specific genetic divergence (Table 1). Within ciliates,
the COI gene have been successfully sequenced from Tet-
CBOL has initiated a Protist working group (ProWG) to rahymena and Paramecium[4, 45]. The COI gene (average
identify barcode region across all protist lineages and set- 2000–2200 nucleotides long) have been found to be widely
ting up a reference DNA barcode library. CBOL ProWG has dissimilar from other eukaryotes as it includes > 300 nucleo-
introduced a 2-step pipeline for protists: first, the universal tides long insert region which has exceptional variation in a
pre-barcode to be used for preliminary identification; sec- genetic distance value and intraspecific genetic divergence
ond, a group-specific barcode to be applied for species-level [46]. This insert region is used as a barcode to discriminate

13
Molecular Biology Reports (2023) 50:761–775 767

closely related species based on genetic divergence [8]. Ear- the most effective for revealing interspecific relationship. On
lier studies have shown that the COI gene of ciliates has high the other hand, the V9 region seems appropriate at the fam-
intraspecific genetic divergence than nuclear gene [5, 46]. ily level or higher [47]. It is recommended to use V4 and V9
Park et al. (2019) have reported a 478 bp long COI sequence together to assess the diversity and phylogenetic relationship
of 69 population of spirotricheans ciliates, which has maxi- of eukaryotic microbes [39].
mal intraspecific genetic divergence ranging from 0 to 14.8%
and minimal interspecific genetic variation, i.e.,13.6–47.3%. Internal transcribed spacer (ITS) region
They identified three putative cryptic species, Caudiholos-
tichaylvatica, Diophrys scutum, and Euplotes vannus [5]. Internal transcribed spacer (ITS) and the external transcribed
COI nucleotide tree has a higher resolution to discriminate region (ETR) are the flanking regions of the SSU, and the
closely related and sibling species at and below the spe- 5.8S rRNA is a non-coding part of LSU rRNA. ITS1 is
cies level. Recently, Zhang et al. [36] studied the phyloge- present between SSU rRNA and 5.8S rRNA, and ITS2 is
netic relationship of subclass scuticociliates with the usage present between 5.8S rRNA and LSU rRNA [45]. Various
of nuclear SSU-rRNA gene, mitochondrial SSU-rRNA studies suggest that ITS region has the potential of prom-
gene and COI gene as a molecular marker and showed that ising barcode for ciliate identification and investigation of
sequence divergence of COI (average 24%) is more signifi- intraspecific genetic diversity at species and population
cant than mtdSSU-rRNA gene (average 21%) and nSSU- levels since they shows much higher rate of evolutionary
rRNA gene (average 11.5%). They proved that COI is a bet- changes (> 100 times) than the coding regions of the ribo-
ter choice as a molecular marker to examine phylogenetic somal subunit [34, 48, 49]. Usually, phylogenetic trees of
relationships than mtdSSU-rRNA gene and nSSU-rRNA ITS1-5.8S-ITS2 region usually do not differ significantly
gene[36]. However, consortium for the barcode of life does from those inferred from the 18S rRNA gene, implying that
not consider COI as an appropriate barcode for uncovering the ITS region is a viable proxy for genealogical studies.
ciliates species because of issues like the absence of func- Although both the ITS1 and ITS2 have sufficient conserved
tional mitochondria in some ciliates from the anoxic envi- and variable region, but ITS2 seems to have more informa-
ronment e.g., ciliates belonging to Metopusand Trimyema tion and may be more valuable for comparisons at the family,
genus and presence of heteroplasmy [4, 5]. order, and even higher level. Moreover, the secondary struc-
ture of the ITS2 molecule has been employed to improve the
quality of species-level phylogenetic reconstructions. Apart
Small subunit (SSU) rRNA gene from phylogenetic reconstructions, the compensatory base
changes (CBCs) in the ITS2 region correlate with sexual
SSU rRNA gene was the first and widely used molecular incompatibility and so can be used for species discrimination
marker in genealogy and systematics study of ciliates because [48]. More and more studies suggest that using both primary
it can be sequenced accurately, universally, availability of sequence and secondary structure of ITS2 produce higher
diverse and large database from NCBI, and includes both con- phylogenetic resolution [34, 49]. Zhan et al. used ITS1-5.8S-
served and variable nucleotide sequences allowing combined ITS2 and the ITS2 as a barcode to delimitates Pseudokero-
phylogenetic reconstruction and biota recognition at various nopsis species and found that both the ITS1-5.8S-ITS2 and
taxonomic levels. Within ciliates, the average size of 18S the ITS2 regions shows similar levels of genetic variation
rRNA gene is ~ 1771 bp long except in litostomatea which has and substantial gaps between intraspecific and interspecific
1635-1641 bp [45], but this entire region of 18S rRNA gene distance (0.52–3.72% for ITS2; 0.42–3.84% for ITS-5.8S-
is not used for species identification. Only the hypervariable ITS2). Additionally, they also proposed a genetic divergence
regions (V1–V5 and V7–V9) of 18S rRNA gene are used for of 1.5% as an ideal threshold of ITS1-5.8S-ITS2 and ITS2
species identification. Among them, V4 and V9 hypervariable to distinguish Pseudokeronopsis species and also suggested
regions are considered the famous barcoding gene. The hyper- the ITS1-5.8S-ITS2 can be used as an ideal SGS (Second
variable region V9 is immensely used as a genetic marker for generation sequencing) metabarcode for assessing ciliates
evaluating eukaryotic diversity and also a prime candidate for environmental diversity [34].
assessing protist lineage richness, while the V4 region of SSU
rRNA gene is the primary candidate for studying the phyloge-
netic relationship of eukaryotes. V4 region is more extensive, Large subunit (LSU) of rRNA gene
more variable, and show better resolution to explore the evo-
lutionary relationship of eukaryotes than the V9 region [39]. LSU rRNA gene is a good barcoding gene for discriminat-
The secondary structure of hypervariable region V9, V7, V4, ing closely related taxa because it has a higher evolutionary
V2 of 18SrRNA gene in urostylids shows a high degree of rate than SSU. Similar to SSU, LSU rRNA gene has variable
variability and provides further evidence that the V4 region is region such as D1-D12, of which D1-D3 region show much

13
768 Molecular Biology Reports (2023) 50:761–775

higher variation than other variables such as D4, D5, D7, diversity instead of identifying individual taxa [54]. This
D8, D12 [50]. Over the last decade, the D1-D2 region of advancement has overcome the limitation of traditional
LSU rRNA gene emerged as a promising barcode marker for DNA barcoding, such as extensive sampling efforts. Meta-
species identification up to species level. Santoferrara et al. barcoding relies on the shorter DNA fragments instead of
has proposed D1–D2 region of LSU rRNA gene with a 1% whole 658 bp fragment (standard barcodes) used in classical
threshold value (for tintinnid) as a barcoding marker for cili- DNA barcoding. Metabarcoding approaches on environmen-
ate species identification and potentiality of this marker fur- tal and faecal samples have revealed population structure in
ther assessed by Stoeck et al., Zhao et al., Forster et al. [37, a variety of species [55]. The main problem associated with
42, 51, 52]. D1–D2 region of LSU rRNA gene has several standard barcodes is length, i.e., longer than 500 bp used in
advantages over other frequently used markers such as show- the traditional approach for achieving high discriminatory
ing a clear barcoding gap, rapid evolutionary rate enough to power at the species level. Unfortunately, metabarcoding
provide higher diversity resolution than SSU and higher uni- assess the diversity up to family, order, or higher taxonomic
versality and constant threshold value than COI [51]. LSU level from environmental DNA sample [56, 57]. One of the
has less intra-clonal and intraindividual variability [45]. One most challenging aspects of metabarcoding on which their
study suggested that the D2 region is a suitable marker for accuracy depends is to find new and acceptable primer pair
discriminating all Frontonia morphospecies since it shows a and their corresponding markers. An ideal metabarcoding
clear barcoding gap with a threshold of 4.5%, while the D1 marker should have a short length (e.g., 100 bp) for easy
region alone is not ideal for determining because it shows sequencing, good conserved flanking primer binding sites
the overlap between intraspecific and interspecific genetic to minimise taxonomic bias during PCR amplification,
divergence [37]. So far, D1–D2 region of LSU together have and a sufficiently variable intervening sequence for species
been used as a marker for diatoms, dinoflagellates, tintinnid identification [58]. V4 region is the primary choice meta-
ciliates, Paramecium and Frontonia species [37, 42, 51, 52]. barcode for assessing the richness and phylogenetic rela-
All the above-discussed features such as higher universality, tionship of eukaryotic microorganisms, while COI is widely
conserved primers for its amplification in ciliates and con- used for animals [34]. Primers with fewer template–primer
stant threshold value as well as the presence of high quality mismatches are better for quantitative DNA metabarcod-
manually curated databases (i.e., SILVA), makes hypervari- ing, especially for species of higher relative abundance in
able D1–D2 region of LSU rRNA gene promising DNA bar- a sample. Barcode of life DATABase (BOLD) system has
codes for ciliates species delineation [37]. a primer database (http://​bolds​ystem.​org/​index.​php/​Public_​
Primer_​Prime​rSear​ch) that store all the published primers.
Researchers can either determine the primers of their interest
Histone H4 by searching in primer database or design their primer by
using software like Primer3, QPRIMER, UniPrime, Prima-
The histone H4 is known to be a highly conserved protein clade, Amplicon program, Primer Hunter, Greene SCPrimer
among all eukaryotes with the exception of the high degree andecoPrimer etc.
of variation observed in the ciliate species [45]. The his- Next-generation sequencing (NGS) is a cost and time-
tone protein is responsible for the organization of eukaryotic saving high throughput platform and generate millions of
chromatin. The ciliate histone H4 encoded by the macronu- reads in a single run for only one environmental sample.
clear gene. Due to considerable difference within ciliates, Braukmann et al. compare the performance of three Next-
histone H4 is considered an excellent molecular marker to generation platforms, namely Illumina MiSeq, Ion Tor-
study phylogenetic relationships and can be used as DNA rentS5, and Ion Torrent PGM, and showed that they per-
Barcoding [53]. form equally well for species recovery, although MiSeq is
often recommended because of its low error rate and well-
established bioinformatics methods [59]. Illumina NovaSeq
Advancement in DNA barcoding is the recent advancement in sequencing technology with the
same sequencing depth as MiSeq but assesses more meta-
By using DNA metabarcoding and microarray, it is very zoan diversity. One of the known limitations of NGS for
feasible to develop a powerful taxonomic identification metabarcoding is the generation of short read length, i.e.,
tool. The development of metabarcoding was compelled 400 bp [7]. The development of Illumina MiSeq overcome
by the growth of next-generation sequencing technologies this limitation of short read length by generating longer
capable of producing millions of sequences at a compara- sequence reads (600–800 bp) that provide better taxonomic
bly low price. The Metabarcoding approach uses the same resolution and phylogenetic inference [7, 54]. Metabarcod-
general principle as the traditional DNA barcoding, but ing data has significantly improved the estimates of micro-
this approach focuses on assessing the community’s whole bial communities and offered precise information about the

13
Molecular Biology Reports (2023) 50:761–775 769

structure and spatiotemporal turnover of microbial popu- 68]. Nanopore sequencing has also proven to be a very ver-
lations, particularly in the ocean. According to some esti- satile technology, e.g., allowing for whole genome sequenc-
mates, there are 50,000 to 100,000 protist OTUs (operational ing and assembly of fungal and human genomes, as well as
taxonomic unit) in the world’s oceans, which is five to ten sequencing full-length RNA transcripts using both direct
times the number of bacteria and archaea combined. These RNA and cDNA sequencing [69].
OTUs have different distribution patterns, with varied ocean PacBio sequencing, which is a single molecule real time
regions have various ecosystems in terms of taxonomic com- sequencing, is an alternative DNA barcoding approach for
position and relative abundances. The metabarcoding data large sample sizes: its workflow simplifies and reduces post-
also used to relate microbial community distribution patterns sequencing manipulation, generating longer read length and
with assembly mechanisms [54]. faster running time that provide better taxonomic resolution
Microarray or biochip, or gene chip are other high- [70]. Due to longer reads of PacBio sequencing, one can
throughput platforms for identifying species. The ability to sequence through longer repetitive sequences and detect
identify thousands of targets in a single hybridization experi- mutations, many of which are linked to disease. Further-
ment makes microarray one of the most potent molecular more, because of its potential to sequence full-length tran-
tools [60]. A microarray made up of a DNA barcode that scripts, it is beneficial for identifying gene isoforms and
may be used to design probe sequences in microarray analy- allows reliable discoveries of novel genes and novel iso-
sis. A DNA microarray containing a species-specific oli- forms of annotated genes. Furthermore, PacBio’s sequencing
gonucleotide probe is a viable alternative to the traditional technique can be used to detect base modification such as
Sanger sequencing for identifying species in food sample. methylation [71]. PacBio sequencing also has some draw-
Several commercial DNA chips are available to identify backs including costly, high error rate, and low throughput
animal species in food samples (e.g. CarnoCheck DNA- [71, 72]. The High sequencing error rate can be reduced
Chip, Greiner Bio-One, Austria; LCD Array Kit MEAT by re-sequencing of circular molecules several times. So
5.0, Chipron, Germany) [61]. Fish species are identified in far PacBio sequencing has been used successfully in meta-
both culinary and forensic samples using 16 S rRNA gene, barcoding analysis of arthropods and fungi [72]. Several
Cytochrome b, and COI derived probes [61–63]. Shortly, the researchers suggested that to use PacBio sequencing along
microarray-based identification approach will play a more with SGS since both of them are highly complementary in
prominent role in molecular species identification [56]. term of their advantage [70, 71].
Third generation sequencing such as Oxford Nanopore MALDI–TOF MS (Matrix-assisted laser desorption/
Technologies (ONT)’s MinION™ and PacBio sequencing ionization time of flight mass spectrometry) is being more
is an another sequencing advancement that makes DNA bar- commonly employed as a novel tool for barcoding, however
coding more feasible [64]. MinION nanopore sequencing this method should be based on accurate species identifica-
overcome the limitation associated with the Sanger sequenc- tion both morphologically and genetically. This approach
ing and NGS. Sanger sequencing is costly and requires well is extensively used to identify arthropods [73]. Other than
equipped molecular laboratory and ABI sequencer. On the arthropods, MALDI TOF MS has been successfully used in
other hand, next generation sequencing is cost-effective identification of bacteria and archaea [74].
only when large numbers of specimens are barcoded simul- DNA barcoding in combination with nanotechnology is
taneously, generate sequence reads with high accuracy, another novel approach that has been shown to be highly
also requires expensive equipment in laboratory and has sensitive, allowing for rapid uniplex and multiplex detection
long sequencing run time [65]. ONT MinION™ nanopore of pathogens in food, blood, and other samples [75]. Nano-
sequencing, introduced in 2014, is authentic, quick, third based detection methods increase the sensitivity level up to
generation sequencing, cost-effective, generate long reads, ten times as compare to PCR and other detection methods
enables real time analysis and do not require well-equipped such as radio-immunoassay, microarrays, enzyme-linked
molecular laboratory [64]. Various studies proposed that immunosorbent assay (ELISA) etc. Gold nanoparticles and
complete genome sequence of microbes can be obtained magnetic nanoparticles based “fluorescent bio-barcode DNA
by using multiplexed reads from a single MinION™ run assay” has been used to probe the Salmonella enteritidis
in combination with matched Illumina short reads such as genes [76]. Another bacterial gene Exotoxin A has been
Staphylococcus aureus, Klebsiella pneumoniae, and multid- detected by using magnetic and gold nanoparticles-based
rug resistance encoding plasmid [64, 66]. With the introduc- fluorescence bio-barcode DNA assay [77]. Recently, Ding
tion of MinION nanopore sequencing several full plasmid et al. (2021) identified the DNA marker in liquors, condi-
sequences can now be obtained in a single MinION run ments and milk by using gold nanoparticles [78]. Valen-
using a quick barcoding methodology. MinION™ has also tini et al. (2017) introduced a new approach, NanoTracer
been successfully used in bacterial and plant identification, that streamlines all the analytical steps involved with tradi-
microbiome characterisation, and DNA fingerprinting [67, tional DNA barcoding and enabling it sequencing-free and

13
770 Molecular Biology Reports (2023) 50:761–775

accessible outside the specialized laboratories. NanoTracer data are used, even if the species is represented by a single
enables quick naked eye molecular validation of any food specimen [85].
with simple and inexpensive processing and limited instru- All of the advancements discussed above, particularly
mentation [79]. Species-specific lateral flow dipstick (LFD) HTS sequencing, whole genome sequencing, and metagen-
assays developed by Taboada et al. (2017) for identifying omics, have been viewed as a threat to DNA barcoding. HTS
Atlantic cod, Pacific cod, Alaska pollock and ling in food sequencing, whole genome sequencing, and metagenomics
products, using gold nanoparticles to enable visual identifi- produce massive amounts of genomic data. The genomic
cation with high sensitivity even for processed samples [80]. data analysis takes more time, requires more bioinformatic
expertise compared to standardized DNA barcodes, requires
more energy for data computation and storage, and is dif-
ficult to control quality when shared [86]. Therefore, DNA
Alternatives to DNA barcoding barcoding remains the preferred method for species iden-
tification and biomonitoring, while genomics is useful for
Dip-stick approach is a recent innovation in which lateral understanding genome complexity, diversity, and function.
flow assay combined with species specific primer to detect Rather than being a threat, barcoding and genomics have
wide variety of species from environmental samples [55]. clear mutual benefits, with DNA barcoding establishing a
Non-targeted NGS is an alternative to DNA barcoding platform for well-identified samples in genome sequencing
for species identification, phylogenetics, and phylogeog- projects and genomic studies contributing insights that may
raphy. Non-targeted NGS methods, such as whole genome identify new barcode regions in groups where the standard
sequencing, metagenomics and mitogenomics, do not rely regions are suboptimal [55].
on amplification. Therefore, problems like primer biases and
non-standard amplification have no effect on these methods
[81]. Reference library construction
Mitogenomics is a variant of metagenomics, shotgun
sequencing approach that uses mitochondrial genomes as Currently, DNA Barcoding (Metabarcoding) is the most
references rather than nuclear genomes. Mitogenomes are effective approach for identifying species, and its accuracy is
easily amenable to genome skimming, in which a high copy relied on the resolution of DNA barcodes and the reference
region of the genome is assembled into longer contigs from library. BOLD is the largest reference library or database and
low coverage shotgun sequencing of a specimen mixture its growth has been exponential over the last decades. The
[82]. This method is desirable because of its advantages. International Barcode of Life Consortium (iBOL) launches
Firstly, a mitogenome and its genes are commonly used several projects to expand the DNA barcode reference
molecular markers. Secondly, the mitogenomes structure library or database, including 500K (completed in 2015),
are conserved, whereas sequences can be extremely diverse. BIOSCAN (launched in 2019), and the Earth Biogenome
Thirdly, mitogenomes are small and easy to obtain and can Project [55]. Despite this, very few such libraries have been
be reconstructed directly using bioinformatics methods. developed.
Fourthly, large numbers of mitogenomes are available in Constructing a reference library with extensive species
public databases [83]. Furthermore, this approach is not coverage presents several challenges. The first challenge
affected by problems like Primer biases and non-specific is the high expense of collecting raw data, which can be
amplification. Several studies have shown that mitogenom- accomplished through DNA sequencing. Conventional
ics outperforms metabarcoding in terms of discriminatory sanger sequencing is expensive and of low efficiency [11].
power [83, 84]. However, the utility of mitogenomic is lim- This obstacle must be overcome by acquiring NGS and third
ited as it is quite expensive because each sample requires an generation sequencing platforms such as PacBio and Nanop-
individually prepared library, samples must be sequenced ore. Another challenge is selecting a critical sequencing plat-
more deeply than for metabarcoding, and assembling a form for obtaining high quality results at a low cost, which
mitogenome reference database incurs additional costs for can be accomplished by taking into consideration base qual-
specimen acquisition, sequencing, and assembly [84]. It has ity, data sizes, sequencing depth, and cost efficiency [87].
been found that phylogentics constructed on the basis of There are several NGS platforms but the most appropri-
mitogenomics or nuclear ribosomal RNA repeats are well ate choice for DNA barcoding is Roche-454 [88], which is
resolved and with this, one can distinguish between closely no longer available. In terms of high base quality and low
related species. cost, the Illumina system and Ion Torrent S5 platform are
Bayesian inference under the multispecies coalescent currently the most suitable NGS platform for conventional
model is also an alternative to DNA barcoding. This method DNA barcoding than third generation sequencing platforms
can discriminate species with high power when multi-locus [87]. Several studies have compared the performance of the

13
Molecular Biology Reports (2023) 50:761–775 771

Illumina and Ion torrent platforms, but researchers are still This technique can also help to settle enduring nomen-
unsure which one is better suitable for DNA barcoding. Both clatural debates, leading to the taxonomic revision of
the Illumina system and the Ion Torrent S5 generate massive inadequately defined morphospecies. DNA barcoding
amounts of data, posing new challenges for data analysis approach is also widely used by ecologist due to several
[89, 90]. Several software packages have been developed, reasons. First, the diversity of ecologically essential life
including Vsearch, Usearch, Mothur, Zotu, DADA2, and forms such as ciliates and nematodes are mostly unknown
others. However, these current softwares is not perfect for and the DNA barcoding approach is a better way to assess
creating DNA barcodes, and it was not designed for con- the biodiversity of such life forms [34, 92]. Second, DNA
ventional DNA barcode data analysis. A new data analysis barcode can also detect endangered species from hair and
method called Cotu has been developed for conventional faeces sample left behind by animals [57]. Third, illegal
DNA barcodes, and its performance outperforms other com- trade in animal by-product can be monitored with the help
monly used methods like Zotu and DADA2 [87]. However, of DNA barcoding technique [93]. Fourth, DNA barcoding
more research is needed to confirm and adopt Cotu for data can be advantageous in the field of biosecurity. This is one
analysis. Using an appropriate NGS platform and advanced of the available technique to identify invasive species at a
data analysis methods, a regional or even global DNA bar- very early stage of their life cycle, such as an egg or larval
coding reference library with high species coverage is likely stage [7]. Fifth, the past environment can be reconstructed
to be developed within a few years. by using this technique. Finally, by using the DNA barcod-
The majority of current work on DNA barcoding has ing approach diet of animals can be analysed from faeces
been done in Europe and North America, which could be or stomach content [57]. Within the food industry, DNA
another reason for the limited reference library/database. barcoding reveals mislabelling of processed food that
Financial assistance is also required for the creation of a may lead to health hazards. Recently COI gene is used as
high-quality reference library. Funding for DNA barcoding a DNA barcode to reveal mislabelling of seafood in the
research should encourage the creation and curation of a European market [94]. DNA barcoding can be highly use-
reference library. A large number of national and global col- ful in forensic science [20, 57]. Some species of plants are
laborations will aid in financial support as well as to com- poisonous in nature, such as Datura sp., Brugmansia sp.,
bine local knowledge on species identification with sequenc- and Cannabis sativa, which cause serious health problems
ing capacity [91]. There are several curated natural history to humans and animals when ingested. Rapid identification
museums around the world that house a large number of of the poisonous plant is required for appropriate treat-
vouchered specimens. Obtaining DNA barcoding data from ment, and identification from vomited or excreted sam-
these vouchered specimens should significantly improve ples by visual observation is not feasible because most of
the quality of the reference database [55]. Another possible the plant part can be degraded. So, DNA barcoding will
step would be to incorporate reference barcodes on a regu- be useful for identification from these degraded samples.
lar basis. To improve the reference barcode library, make it Recently rbcL and ITS2 genes are used as a barcoding
mandatory to submit the reference barcode when describing marker for identifying poisonous plant species [20].
a new species. DNA barcoding tool overcomes the limitation of the
classical identification method, but this approach itself has
certain restrictions. One of the most significant drawbacks
Strength and limitation of DNA barcoding of the DNA barcoding method is that there is no universal
primer or universal gene found in all forms of life and has
Apart from taxonomists, the DNA barcoding technique enough sequence divergence to allow for species differen-
can benefit scientists from other fields such as biotech- tiation [56]. Very less number of reference DNA barcode
nology, food industries, forensic science, and animal diet library, and Loss of quantitative information due to primer
[57]. Taxonomist uses a sensu-stricto (refers to the identi- and polymerase biases [84]. DNA barcoding distinguishes
fication of species level using a single standardized DNA species based on intraspecific and interspecific genetic vari-
fragment) approach of DNA barcoding, while other sci- ation, although the ranges of such variation are unclear and
entists use a sensu-lato (refers to the identification of any may differ between taxa [31]. The existence of pseudogenes
taxonomic group using any DNA fragment) approach. The and heteroplasmy reduces the accuracy of DNA barcoding
main application of DNA barcode in taxonomy to acceler- and increases the complexity of database. A pseudogene can
ate the species identification and revealing cryptic species. result in the erroneous division of single species into sev-
DNA barcode data can provide a comprehensive founda- eral species. Pseudogenes can produce heteroplasmy, which
tion for organizing and identifying species-rich groups in causes more than one kind of mtDNA to coexist in the same
the tree of life, serving as a good starting point for tax- individual and limiting species identification by DNA bar-
onomy, biodiversity assessments, and biomonitoring [55]. coding [56].

13
772 Molecular Biology Reports (2023) 50:761–775

Conclusion Consent to publish Not applicable.

Through the rapid development in the last 2 decades, DNA


barcoding has emerged as a highly effective molecular tool
for taxonomic classification. It relies on barcoding gap References
within a short and standardized region of the genome for
1. Feio MJ, Filipe AF, Garcia-Raventós A et al (2020) Advances in
assessing species diversity and phylogenetic relationship. the use of molecular tools in ecological and biodiversity assess-
The DNA barcoding allows more accurate and cost-effective ment of aquatic ecosystems. Avanços no uso de ferramentas
biodiversity characterization and its use in accelerating spe- moleculares na avaliação ecológica e biodiversidade dos ecos-
cies discovery is becoming increasingly important, given the sistemas aquáticos. Limnetica. https://​doi.​org/​10.​23818/​limn.​39.​
27
current threats to biodiversity and elevated rates of extinc- 2. Schweiger AK, Cavender-Bares J, Townsend PA et al (2018) Plant
tion. Several DNA barcodes have been extensively used for spectral diversity integrates functional and phylogenetic compo-
biological species identification, including the mitochondrial nents of biodiversity and predicts ecosystem function. Nat Ecol
COI gene, rbcL, matK, trnH-psbA, 16S rRNA, V4, D1–D2 Evol 2:976–982. https://​doi.​org/​10.​1038/​s41559-​018-​0551-1
3. Rico-Sánchez AE, Sundermann A, López-López E et al (2020)
region, and ITS (nuclear internal transcribed spacer). But Biological diversity in protected areas: not yet known but already
there is no single barcode for all the species and it is very threatened. Glob Ecol Conserv 22:e01006. https://​doi.​org/​10.​
hard to find because of differences in evolutionary rates. 1016/j.​gecco.​2020.​e01006
Over the years, the DNA Barcoding approach has become 4. Chantangsi C, Lynn DH, Brandl MT et al (2007) Barcoding cili-
ates: a comprehensive study of 75 isolates of the genus Tetrahy-
more accurate, sensitive and faster due to several advance- mena. Int J Syst Evol Microbiol 57:2412–2423. https://d​ oi.o​ rg/1​ 0.​
ment such as next generation sequencing, third generation 1099/​ijs.0.​64865-0
sequencing, and Nanotracer. A large-scale DNA barcoding 5. Park M-H, Jung J-H, Jo E et al (2019) Utility of mitochondrial
research using an appropriate NGS platform and advanced CO1 sequences for species discrimination of Spirotrichea ciliates
(Protozoa, Ciliophora). Mitochondrial DNA Part A 30:148–155.
data analysis methods will surely help to create a reference https://​doi.​org/​10.​1080/​24701​394.​2018.​14645​63
DNA barcode library of all organisms in order to avoid 6. Xu J (2017) Fungal DNA barcoding. In: The 6th international
misidentification and definitely simplify the interpretation barcode of life conference 01:913–932. https://​doi.​org/​10.​1139/​
of sequencing results. Also, barcoding by mitogenome and gen-​2016-​0046@​gen-​iblf.​issue​01
7. Comtet T, Sandionigi A, Viard F, Casiraghi M (2015) DNA (meta)
snRNAs can upgrade current barcoding strategies. barcoding of biological invasions: a powerful tool to elucidate
invasion processes and help managing aliens. Biol Invasions
Acknowledgements The authors appreciate the facilities provided by 17:905–922. https://​doi.​org/​10.​1007/​s10530-​015-​0854-y
the Principal, Acharya Narendra Dev College, University of Delhi for 8. Hebert PDN, Cywinska A, Ball SL, deWaard JR (2003) Biologi-
carrying out the present study. The authors also thankfully acknowl- cal identifications through DNA barcodes. Proc R Soc Lond B
edge the financial support provided by CSIR (Council of Scientific and 270:313–321. https://​doi.​org/​10.​1098/​rspb.​2002.​2218
Industrial Research) to Sandeep Antil, Jeeva Susan Abraham, Swati 9. Yao H, Song J, Liu C et al (2010) Use of ITS2 region as the uni-
Maurya, and UGC (University Grants Commission) to Sripoorna versal DNA barcode for plants and animals. PLoS ONE 5:e13102.
Somasundaram and DST-SERB (Department of Science and Technol- https://​doi.​org/​10.​1371/​journ​al.​pone.​00131​02
ogy-Science and Engineering Research Board) to Jyoti Dagar. 10. Stoeck T, Behnke A, Christen R et al (2009) Massively paral-
lel tag sequencing reveals the complexity of anaerobic marine
Authors contribution All authors contributed to designing the study. protistan communities. BMC Biol 7:72. https://​doi.​org/​10.​1186/​
The first draft of manuscript was written by Sandeep Antil and all the 1741-​7007-7-​72
authors commented on previous version of the manuscript. All authors 11. Sire L, Gey D, Debruyne R et al (2019) The challenge of DNA
read and approved the final manuscript. barcoding saproxylic beetles in natural history collections—
exploring the potential of parallel multiplex sequencing with
Funding This study sponsored by CSIR (Council of Scientific and illumina MiSeq. Front Ecol Evol. https://​doi.​org/​10.​3389/​fevo.​
Industrial Research), UGC (University Grants Commission), DST- 2019.​00495
SERB (Department of Science and Technology-Science and Engineer- 12. Pawlowski J, Audic S, Adl S et al (2012) CBOL Protist Working
ing Research Board) and DBT (Department of Biotechnology) STAR Group: barcoding eukaryotic richness beyond the animal, plant,
College Scheme. and fungal kingdoms. PLoS Biol 10:e1001419. https://d​ oi.o​ rg/1​ 0.​
1371/​journ​al.​pbio.​10014​19
Data availability Not applicable. 13. Ratnasingham S, Hebert PDN (2007) bold: the barcode of life
data system. Mol Ecol Notes 7:355–364. https://d​ oi.o​ rg/1​ 0.1​ 111/j.​
Declarations 1471-​8286.​2007.​01678.x
14. Mueller RL (2006) Evolutionary rates, divergence dates, and the
Conflict of interest The authors declare that there are no conflicts of performance of mitochondrial genes in Bayesian phylogenetic
interest. analysis. Syst Biol 55:289–300. https://​doi.​org/​10.​1080/​10635​
15050​05416​72
Ethical approval This article does not contain any studies conducted 15. Hebert PDN, Ratnasingham S, de Waard JR (2003) Barcoding
on humans and or vertebrate animals. animal life: cytochrome c oxidase subunit 1 divergences among
closely related species. Proc R Soc Lond B 270:S96–S99. https://​
Consent to participate Not applicable. doi.​org/​10.​1098/​rsbl.​2003.​0025

13
Molecular Biology Reports (2023) 50:761–775 773

16. Hussain K, Rashid K, Hafeez F et al (2020) Molecular identifica- 33. Vancuren SJ, Santos SJD, Hill JE, Team the MMLP (2020) Evalu-
tion of sugarcane black bug (Cavelarius excavates) from Paki- ation of variant calling for cpn60 barcode sequence-based micro-
stan using cytochrome C oxidase I (COI) gene as DNA barcode. biome profiling. PLoS ONE 15:e0235682. https://d​ oi.o​ rg/1​ 0.1​ 371/​
Int J Trop Insect Sci 40:1119–1124. https://​doi.​org/​10.​1007/​ journ​al.​pone.​02356​82
s42690-​020-​00144-5 34. Zhan Z, Li J, Xu K (2019) Ciliate Environmental diversity can be
17. Jackson A, Nijman V (2020) DNA barcoding of primates and the underestimated by the V4 Region of SSU rDNA: insights from
selection of molecular markers using African Great Apes as a species delimitation and multilocus phylogeny of pseudokeronop-
model. J Anthropol Sci. https://​doi.​org/​10.​4436/​JASS.​98017 sis (Protist, Ciliophora). Microorganisms 7:493. https://d​ oi.o​ rg/1​ 0.​
18. Group CPW, Hollingsworth PM, Forrest LL et al (2009) A DNA 3390/​micro​organ​isms7​110493
barcode for land plants. PNAS 106:12794–12797. https://​doi.​org/​ 35. Lin S, Hu Z, Deng Y et al (2020) An assessment on the intrap-
10.​1073/​pnas.​09058​45106 opulational and intraindividual genetic diversity in LSU rDNA in
19. Soledispa P, Santos-Ordóñez E, Miranda M et al (2021) Molecular the harmful algal blooms-forming dinoflagellate Margalefidinium
barcode and morphological analysis of Smilax purhampuy Ruiz. (= Cochlodinium) fulvescens based on clonal cultures and bloom
Ecuador PeerJ 9:e11028. https://​doi.​org/​10.​7717/​peerj.​11028 samples from Jiaozhou Bay. China Harmful Algae 96:101821.
20. Ahmed S, Ibrahim M, Nantasenamat C et al (2022) Pragmatic https://​doi.​org/​10.​1016/j.​hal.​2020.​101821
applications and universality of DNA barcoding for substantial 36. Zhang T, Fan X, Gao F et al (2019) Further analyses on the phy-
organisms at species level: a review to explore a way forward. logeny of the subclass Scuticociliatia (Protozoa, Ciliophora) based
Biomed Res Int 2022:e1846485. https://​doi.​org/​10.​1155/​2022/​ on both nuclear and mitochondrial data. Mol Phylogenet Evol
18464​85 139:106565. https://​doi.​org/​10.​1016/j.​ympev.​2019.​106565
21. Bradshaw M, Grewe F, Thomas A et al (2020) Characterizing 37. Zhao Y, Yi Z, Gentekaki E et al (2016) Utility of combining
the ribosomal tandem repeat and its utility as a DNA barcode morphological characters, nuclear and mitochondrial genes: An
in lichen-forming fungi. BMC Evol Biol 20:2. https://​doi.​org/​10.​ attempt to resolve the conflicts of species identification for cili-
1186/​s12862-​019-​1571-4 ated protists. Mol Phylogenet Evol 94:718–729. https://​doi.​org/​
22. Stielow JB, Lévesque CA, Seifert KA et al (2015) One fungus, 10.​1016/j.​ympev.​2015.​10.​017
which genes? Development and assessment of universal primers 38. Stern RF, Andersen RA, Jameson I et al (2012) Evaluating the
for potential secondary fungal DNA barcodes. Persoonia 35:242– ribosomal internal transcribed spacer (ITS) as a candidate dino-
263. https://​doi.​org/​10.​3767/​00315​8515X​689135 flagellate barcode marker. PLoS ONE 7:e42780. https://​doi.​org/​
23. Aoki W, Watanabe M, Watanabe M et al (2020) Discrimination 10.​1371/​journ​al.​pone.​00427​80
between edible and poisonous mushrooms among Japanese Ento- 39. Choi J, Park JS (2020) Comparative analyses of the V4 and V9
loma sarcopum and related species based on phylogenetic analy- regions of 18S rDNA for the extant eukaryotic community using
sis and insertion/deletion patterns of nucleotide sequences of the the Illumina platform. Sci Rep 10:6519. https://​doi.​org/​10.​1038/​
cytochrome oxidase 1 gene. Genes Genet Syst. https://​doi.​org/​10.​ s41598-​020-​63561-z
1266/​ggs.​19-​00032 40. Mordret S, Piredda R, Vaulot D et al (2018) dinoref: a curated
24. Schoch CL, Seifert KA, Huhndorf S et al (2012) Nuclear riboso- dinoflagellate (Dinophyceae) reference database for the 18S rRNA
mal internal transcribed spacer (ITS) region as a universal DNA gene. Mol Ecol Resour 18:974–987. https://d​ oi.o​ rg/1​ 0.1​ 111/1​ 755-​
barcode marker for Fungi. PNAS 109:6241–6246. https://​doi.​org/​ 0998.​12781
10.​1073/​pnas.​11170​18109 41. Zimmermann J, Jahn R, Gemeinholzer B (2011) Barcoding dia-
25. Adam PS, Borrel G, Brochier-Armanet C, Gribaldo S (2017) The toms: evaluation of the V4 subregion on the 18S rRNA gene,
growing tree of Archaea: new perspectives on their diversity, including new primers and protocols. Org Divers Evol 11:173.
evolution and ecology. ISME J 11:2407–2425. https://​doi.​org/​10.​ https://​doi.​org/​10.​1007/​s13127-​011-​0050-6
1038/​ismej.​2017.​122 42. Forster D, Filker S, Kochems R et al (2019) A comparison of
26. Chaban B, Hill JE (2012) A ‘universal’ type II chaperonin PCR different ciliate metabarcode genes as bioindicators for environ-
detection system for the investigation of Archaea in complex mental impact assessments of salmon aquaculture. J Eukaryot
microbial communities. ISME J 6:430–439. https://​doi.​org/​10.​ Microbiol 66:294–308. https://​doi.​org/​10.​1111/​jeu.​12670
1038/​ismej.​2011.​96 43. Hamsher SE, Evans KM, Mann DG et al (2011) Barcoding dia-
27. Zeigler DRY (2003) Gene sequences useful for predicting relat- toms: exploring alternatives to COI-5P. Protist 162:405–422.
edness of whole genomes in bacteria. Int J Syst Evol Microbiol https://​doi.​org/​10.​1016/j.​protis.​2010.​09.​005
53:1893–1900. https://​doi.​org/​10.​1099/​ijs.0.​02713-0 44. Cuypers B, Domagalska MA, Meysman P et al (2017) Multiplexed
28. Liu Y-F, Mbadinga SM, Gu J-D, Mu B-Z (2017) Type II chap- spliced-leader sequencing: a high-throughput, selective method
eronin gene as a complementary barcode for 16S rRNA gene in for RNA-seq in trypanosomatids. Sci Rep 7:3725. https://​doi.​org/​
study of Archaea diversity of petroleum reservoirs. Int Biodeterior 10.​1038/​s41598-​017-​03987-0
Biodegrad 123:113–120. https://​doi.​org/​10.​1016/j.​ibiod.​2017.​04.​ 45. Abraham JS, Sripoorna S, Maurya S et al (2019) Techniques and
015 tools for species identification in ciliates: a review. Int J Syst Evol
29. Bukin YS, Galachyants YP, Morozov IV et al (2019) The effect of Microbiol 69:877–894. https://​doi.​org/​10.​1099/​ijsem.0.​003176
16S rRNA region choice on bacterial community metabarcoding 46. Strüder-Kypke MC, Lynn DH (2010) Comparative analysis of the
results. Sci Data 6:190007. https://​doi.​org/​10.​1038/​sdata.​2019.7 mitochondrial cytochrome c oxidase subunit I (COI) gene in cili-
30. Ogier J-C, Pagès S, Galan M et al (2019) rpoB, a promising ates (Alveolata, Ciliophora) and evaluation of its suitability as a
marker for analyzing the diversity of bacterial communities by biodiversity marker. Syst Biodivers 8:131–148. https://d​ oi.o​ rg/1​ 0.​
amplicon sequencing. BMC Microbiol 19:171. https://​doi.​org/​10.​ 1080/​14772​00090​35077​44
1186/​s12866-​019-​1546-z 47. Wang P, Gao F, Huang J et al (2015) A case study to estimate the
31. Purty RS, Chatterjee S (2016) DNA barcoding: an effective tech- applicability of secondary structures of SSU-rRNA gene in taxon-
nique in molecular taxonomy. Austin J Biotechnol Bioeng 3:1059 omy and phylogenetic analyses of ciliates. Zool Scr 44:574–585.
32. Schneider KL, Marrero G, Alvarez AM, Presting GG (2011) Clas- https://​doi.​org/​10.​1111/​zsc.​12122
sification of plant associated bacteria using RIF, a computationally 48. Shazib SUA, Vďačný P, Kim JH et al (2016) Molecular phy-
derived DNA marker. PLoS ONE 6:e18496. https://​doi.​org/​10.​ logeny and species delimitation within the ciliate genus Spi-
1371/​journ​al.​pone.​00184​96 rostomum (Ciliophora, Postciliodesmatophora, Heterotrichea),

13
774 Molecular Biology Reports (2023) 50:761–775

using the internal transcribed spacer region. Mol Phylogenet MinION barcoding sequencing data. GigaScience. https://d​ oi.o​ rg/​
Evol 102:128–144. https://​doi.​org/​10.​1016/j.​ympev.​2016.​05.​ 10.​1093/​gigas​cience/​gix132
041 67. Parker J, Helmstetter AJ, Devey D et al (2017) Field-based
49. Sun P, Clamp JC, Xu D et al (2013) An ITS-based phylogenetic species identification of closely-related plants using real-time
framework for the genus Vorticella: finding the molecular and nanopore sequencing. Sci Rep 7:8345. https://​doi.​org/​10.​1038/​
morphological gaps in a taxonomically difficult group. Proc R s41598-​017-​08461-5
Soc B 280:20131177. https://​doi.​org/​10.​1098/​rspb.​2013.​1177 68. Srivathsan A, Baloğlu B, Wang W et al (2018) A MinION™-
50. Wylezich C, Nies G, Mylnikov AP et al (2010) An evaluation of based pipeline for fast and cost-effective DNA barcoding. Mol
the use of the LSU rRNA D1–D5 domain for DNA-based tax- Ecol Resour. https://​doi.​org/​10.​1111/​1755-​0998.​12890
onomy of eukaryotic protists. Protist 161:342–352. https://​doi.​ 69. Burton AS, Stahl SE, John KK et al (2020) Off earth identification
org/​10.​1016/j.​protis.​2010.​01.​003 of bacterial populations using 16S rDNA nanopore sequencing.
51. Santoferrara LF, McManus GB, Alder VA (2013) Utility of Genes 11:76. https://​doi.​org/​10.​3390/​genes​11010​076
genetic markers and morphology for species discrimination within 70. Zhang H, Xu H, Liu H et al (2020) PacBio single molecule long-
the order Tintinnida (Ciliophora, Spirotrichea). Protist 164:24–36. read sequencing provides insight into the complexity and diversity
https://​doi.​org/​10.​1016/j.​protis.​2011.​12.​002 of the Pinctada fucata martensii transcriptome. BMC Genomics
52. Stoeck T, Przybos E, Dunthorn M (2014) The D1–D2 region of 21:481. https://​doi.​org/​10.​1186/​s12864-​020-​06894-3
the large subunit ribosomal DNA as barcode for ciliates. Mol Ecol 71. Rhoads A, Au KF (2015) PacBio sequencing and its applications.
Resour 14:458–468. https://​doi.​org/​10.​1111/​1755-​0998.​12195 Genomics Proteomics Bioinformatics 13:278–289. https://d​ oi.o​ rg/​
53. Tasneem F, Shakoori FR (2017) Phylogenetic relationship of 10.​1016/j.​gpb.​2015.​08.​002
locally isolated paramecium species inferred from histone H4 72. Tedersoo L, Tooming-Klunderud A, Anslan S (2018) PacBio
genes. Pak J Zool. https://d​ oi.o​ rg/1​ 0.1​ 7582/j​ ourna​ l.p​ jz/2​ 017.4​ 9.5.​ metabarcoding of Fungi and other eukaryotes: errors, biases and
1767.​1774 perspectives. New Phytol 217:1370–1385. https://d​ oi.o​ rg/1​ 0.1​ 111/​
54. Santoferrara L, Burki F, Filker S et al (2020) Perspectives from nph.​14776
ten years of protist studies by high-throughput metabarcoding. 73. Yssouf A, Almeras L, Raoult D, Parola P (2016) Emerging tools
J Eukaryot Microbiol 67:612–622. https://​doi.​org/​10.​1111/​jeu.​ for identification of arthropod vectors. Fut Microbiol 11:549–566.
12813 https://​doi.​org/​10.​2217/​fmb.​16.5
55. Grant DM, Brodnicke OB, Evankow AM et al (2021) The future 74. Dridi B, Raoult D, Drancourt M (2012) Matrix-assisted laser des-
of DNA barcoding: reflections from early career researchers. orption/ionization time-of-flight mass spectrometry identification
Diversity 13:313. https://​doi.​org/​10.​3390/​d1307​0313 of Archaea: towards the universal identification of living organ-
56. Gong S, Ding Y, Wang Y et al (2018) Advances in DNA barcod- isms. APMIS 120:85–91. https://​doi.​org/​10.​1111/j.​1600-​0463.​
ing of toxic marine organisms. Int J Mol Sci 19:2931. https://​doi.​ 2011.​02833.x
org/​10.​3390/​ijms1​91029​31 75. Munir S, Ahmed S, Ibrahim M et al (2020) A spellbinding inter-
57. Valentini A, Pompanon F, Taberlet P (2009) DNA barcoding play between biological barcoding and nanotechnology. Front
for ecologists. Trends Ecol Evol 24:110–117. https://​doi.​org/​10.​ Bioeng Biotechnol 8:883. https://​doi.​org/​10.​3389/​fbioe.​2020.​
1016/j.​tree.​2008.​09.​011 00883
58. Liu M, Clarke LJ, Baker SC et al (2020) A practical guide to 76. Yin H, Jia M, Yang S et al (2012) A nanoparticle-based bio-
DNA metabarcoding for entomological ecologists. Ecol Entomol barcode assay for ultrasensitive detection of ricin toxin. Toxicon
45:373–385. https://​doi.​org/​10.​1111/​een.​12831 59:12–16. https://​doi.​org/​10.​1016/j.​toxic​on.​2011.​10.​003
59. Braukmann TWA, Ivanova NV, Prosser SWJ et al (2019) Meta- 77. Amini B, Kamali M, Salouti M, Yaghmaei P (2017) Fluorescence
barcoding a diverse arthropod mock community. Mol Ecol Resour bio-barcode DNA assay based on gold and magnetic nanoparticles
19:711–727. https://​doi.​org/​10.​1111/​1755-​0998.​13008 for detection of Exotoxin A gene sequence. Biosens Bioelectron
60. Medlin LK, Orozco J (2017) Molecular techniques for the detec- 92:679–686. https://​doi.​org/​10.​1016/j.​bios.​2016.​10.​030
tion of organisms in aquatic environments, with emphasis on 78. Ding S, Wang L, He Z et al (2021) Identifying exogenous DNA
harmful algal bloom species. Sensors 17:1184. https://​doi.​org/​ in liquid foods by gold nanoparticles: potential applications in
10.​3390/​s1705​1184 traceability. ACS Food Sci Technol 1:605–613. https://​doi.​org/​
61. Kappel K, Eschbach E, Fischer M, Fritsche J (2020) Design of a 10.​1021/​acsfo​odsci​tech.​1c000​48
user-friendly and rapid DNA microarray assay for the authentica- 79. Valentini P, Galimberti A, Mezzasalma V et al (2017) DNA
tion of ten important food fish species. Food Chem 311:125884. barcoding meets nanotechnology: development of a universal
https://​doi.​org/​10.​1016/j.​foodc​hem.​2019.​125884 colorimetric test for food authentication. Angew Chem Int Ed
62. Kochzius M, Seidel C, Antoniou A et al (2010) Identifying fishes 56:8094–8098. https://​doi.​org/​10.​1002/​anie.​20170​2120
through DNA barcodes and microarrays. PLoS ONE 5:e12620. 80. Taboada L, Sánchez A, Pérez-Martín RI, Sotelo CG (2017) A new
https://​doi.​org/​10.​1371/​journ​al.​pone.​00126​20 method for the rapid detection of Atlantic cod (Gadus morhua),
63. Teletchea F, Bernillon J, Duffraisse M et al (2008) Molecular Pacific cod (Gadus macrocephalus), Alaska pollock (Gadus chal-
identification of vertebrate species by oligonucleotide microarray cogrammus) and ling (Molva molva) using a lateral flow dipstick
in food and forensic samples. J Appl Ecol 45:967–975. https://d​ oi.​ assay. Food Chem 233:182–189. https://​doi.​org/​10.​1016/j.​foodc​
org/​10.​1111/j.​1365-​2664.​2007.​01415.x hem.​2017.​04.​087
64. Liao Y-C, Cheng H-W, Wu H-C et al (2019) Completing circular 81. Schlenker C, Brooks J, Oostra K, McLaughlin R (2017) whole
bacterial genomes with assembly complexity by using a sampling sample next-generation DNA sequencing method: an alternative
strategy from a single MinION run with barcoding. Front Micro- to DNA barcoding. In: FoodSafetyTech. https://​foods​afety​tech.​
biol 10:2068. https://​doi.​org/​10.​3389/​fmicb.​2019.​02068 com/​featu​re_​artic​le/​whole-​sample-​next-​gener​ation-​dna-​seque​
65. Wang WY, Srivathsan A, Foo M et al (2018) Sorting specimen- ncing-​method-​alter​native-​dna-​barco​ding/. Accessed 19 Jul 2022
rich invertebrate samples with cost-effective NGS barcodes: 82. Crampton-Platt A, Yu DW, Zhou X, Vogler AP (2016) Mitochon-
validating a reverse workflow for specimen processing. Mol Ecol drial metagenomics: letting the genes out of the bottle. GigaS-
Resour 18:490–501. https://​doi.​org/​10.​1111/​1755-​0998.​12751 cience. https://​doi.​org/​10.​1186/​s13742-​016-​0120-y
66. Li R, Xie M, Dong N et al (2018) Efficient generation of com- 83. Jiang M, Xu S-F, Tang T-S et al (2022) Development and evalu-
plete sequences of MDR-encoding plasmids by rapid assembly of ation of a meat mitochondrial metagenomic (3MG) method for

13
Molecular Biology Reports (2023) 50:761–775 775

composition determination of meat from fifteen mammalian and 93. Hou F, Wen L, Peng C, Guo J (2018) Identification of marine
avian species. BMC Genomics 23:36. https://​doi.​org/​10.​1186/​ traditional Chinese medicine dried seahorses in the traditional
s12864-​021-​08263-0 Chinese medicine market using DNA barcoding. Mitochondrial
84. Ji Y, Huotari T, Roslin T et al (2020) SPIKEPIPE: a metagen- DNA Part A 29:107–112. https://d​ oi.o​ rg/1​ 0.1​ 080/2​ 47013​ 94.2​ 016.​
omic pipeline for the accurate quantification of eukaryotic spe- 12484​30
cies occurrences and intraspecific abundance change using DNA 94. Minoudi S, Karaiskou N, Avgeris M et al (2020) Seafood mis-
barcodes or mitogenomes. Mol Ecol Resour 20:256–267. https://​ labeling in Greek market using DNA barcoding. Food Control
doi.​org/​10.​1111/​1755-​0998.​13057 113:107213. https://​doi.​org/​10.​1016/j.​foodc​ont.​2020.​107213
85. Yang Z, Rannala B (2016) Species identification by bayesian fin- 95. Burrell AS, Jolly CJ, Tosi AJ, Disotell TR (2009) Mitochondrial
gerprinting: a powerful alternative to DNA barcoding. 041608 evidence for the hybrid origin of the kipunji, Rungwecebus kipunji
86. Obringer R, Rachunok B, Maia-Silva D et al (2021) The over- (Primates: Papionini). Mol Phylogenet Evol 51:340–348. https://​
looked environmental footprint of increasing Internet use. Resour doi.​org/​10.​1016/j.​ympev.​2009.​02.​004
Conserv Recycl 167:105389. https://​doi.​org/​10.​1016/j.​resco​nrec.​ 96. Osathanunkul M, Suwannapoom C, Osathanunkul K et al (2016)
2020.​105389 Evaluation of DNA barcoding coupled high resolution melting for
87. Liu Y, Xu C, Sun Y et al (2021) Method for quick DNA barcode discrimination of closely related species in phytopharmaceuticals.
reference library construction. Ecol Evol 11:11627–11638. https://​ Phytomedicine 23:156–165. https://​doi.​org/​10.​1016/j.​phymed.​
doi.​org/​10.​1002/​ece3.​7788 2015.​11.​018
88. Guo J, Cheng T, Xu H et al (2019) An efficient and cost-effec- 97. Carbone I, Kohn LM (1999) A method for designing primer sets
tive method for primer-induced nucleotide labeling for massive for speciation studies in filamentous ascomycetes. Mycologia
sequencing on next-generation sequencing platforms. Sci Rep 91:553–556. https://​doi.​org/​10.​1080/​00275​514.​1999.​12061​051
9:3125. https://​doi.​org/​10.​1038/​s41598-​019-​38996-8 98. Rehner SA, Buckley E (2005) A Beauveria phylogeny inferred
89. Marine RL, Magaña LC, Castro CJ et al (2020) Comparison of from nuclear ITS and EF1-α sequences: evidence for cryptic
Illumina MiSeq and the Ion Torrent PGM and S5 platforms for diversification and links to Cordyceps teleomorphs. Mycologia
whole-genome sequencing of picornaviruses and caliciviruses. 97:84–98. https://​doi.​org/​10.​1080/​15572​536.​2006.​11832​842
J Virol Methods 280:113865. https://​doi.​org/​10.​1016/j.​jviro​met.​ 99. Siddiqui ZH, Abbas ZK, Hakeem KR et al (2020) A molecu-
2020.​113865 lar assessment of red algae with reference to the utility of DNA
90. Speranskaya AS, Khafizov K, Ayginin AA et al (2018) Com- barcoding. In: Trivedi S, Rehman H, Saggu S et al (eds) DNA
parative analysis of Illumina and Ion Torrent high-throughput barcoding and molecular phylogeny. Springer, Cham, pp 103–118
sequencing platforms for identification of plant components in
herbal teas. Food Control 93:315–324. https://​doi.​org/​10.​1016/j.​ Publisher’s Note Springer Nature remains neutral with regard to
foodc​ont.​2018.​04.​040 jurisdictional claims in published maps and institutional affiliations.
91. Janzen DH, Hallwachs W, Pereira G et al (2020) Using DNA-
barcoded Malaise trap samples to measure impact of a geother- Springer Nature or its licensor (e.g. a society or other partner) holds
mal energy project on the biodiversity of a Costa Rican old- exclusive rights to this article under a publishing agreement with the
growth rain forest. Genome 63:407–436. https://​doi.​org/​10.​1139/​ author(s) or other rightsholder(s); author self-archiving of the accepted
gen-​2020-​0002 manuscript version of this article is solely governed by the terms of
92. Martínez-Arce A, De Jesús-Navarrete A, Leasi F (2020) DNA such publishing agreement and applicable law.
barcoding for delimitation of putative mexican marine nematodes
species. Diversity 12:107. https://​doi.​org/​10.​3390/​d1203​0107

13

You might also like