<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">Phyton</journal-id>
<journal-id journal-id-type="nlm-ta">Phyton</journal-id>
<journal-id journal-id-type="publisher-id">Phyton</journal-id>
<journal-title-group>
<journal-title>Phyton-International Journal of Experimental Botany</journal-title>
</journal-title-group>
<issn pub-type="epub">1851-5657</issn>
<issn pub-type="ppub">0031-9457</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">61424</article-id>
<article-id pub-id-type="doi">10.32604/phyton.2025.061424</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Chloroplast Genome Sequence Characterization and Phylogenetic Analysis of <italic>Pyrola Atropurpurea</italic> Franch</article-title>
<alt-title alt-title-type="left-running-head">Chloroplast Genome Sequence Characterization and Phylogenetic Analysis of <italic>Pyrola Atropurpurea</italic> Franch</alt-title>
<alt-title alt-title-type="right-running-head">Chloroplast Genome Sequence Characterization and Phylogenetic Analysis of <italic>Pyrola Atropurpurea</italic> Franch</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author" corresp="yes">
<name name-style="western">
<surname>Sheng</surname>
<given-names>Wentao</given-names>
</name>
<email>shengwentao2003@163.com</email>
</contrib>
<aff id="aff1"><institution>Department of Biological Technology, Nanchang Normal University</institution>, <addr-line>Nanchang, 330032</addr-line>, <country>China</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Wentao Sheng. Email: <email>shengwentao2003@163.com</email></corresp>
</author-notes>
<pub-date date-type="collection" publication-format="electronic">
<year>2025</year>
</pub-date>
<pub-date date-type="pub" publication-format="electronic">
<day>06</day><month>03</month><year>2025</year>
</pub-date>
<volume>94</volume>
<issue>2</issue>
<fpage>331</fpage>
<lpage>345</lpage>
<history>
<date date-type="received">
<day>24</day>
<month>11</month>
<year>2024</year>
</date>
<date date-type="accepted">
<day>14</day>
<month>1</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2025 The Author.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Published by Tech Science Press.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_Phyton_61424.pdf"></self-uri>
<abstract>
<p><italic>Pyrola atropurpurea</italic> Franch is an important annual herbaceous plant. Few genomic analyses have been conducted on this plant, and chloroplast genome research will enrich its genomics basis. This study is based on high-throughput sequencing technology and Bioinformatics methods to obtain the sequence, structure, and other characteristics of the <italic>P. atropurpurea</italic> chloroplast genome. The result showed that the chloroplast genome of <italic>P. atropurpurea</italic> has a double-stranded circular structure with a total length of 172,535 bp and a typical four-segment structure. The genome has annotated a total of 132 functional genes, including 43 tRNAs, 8 rRNAs, 76 protein-coding genes, and 5 pseudo-genes. In total, 358 SSR loci were checked out, mainly composed of mononucleotide and trinucleotide repeat. There are three types of scattered repetitive sequences, totaling 4223, including 2452 forward repeats, 1763 palindrome repeats, and eight reverse repeats. The optimal codon usage frequency is relatively high with AT usage preference in this genome. Chloroplast genome comparative analysis in the family Ericaceae shows that the overall sequence is more complex, and there are more variations in the gene interval region. The collinearity analysis indicated that there is a complex rearrangement of species between different genera in Ericaceae. The selection pressure analysis showed that the protein-encoding genes <italic>rpl33</italic> and <italic>rps16</italic> were positively selected among the seven medicinal plants in Ericaceae. The maximum likelihood tree shows that the genetic relationship among <italic>P. atropurpurea</italic>, <italic>Pyrola rotundifolia</italic>, and <italic>Chimaphila japonica</italic> is relatively close. Therefore, an important data basis was provided for species identification, genetic diversity, and phylogenetic studies of <italic>P. atropurpurea</italic> and even this genus of plants.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd><italic>Pyrola atropurpurea</italic></kwd>
<kwd>chloroplast genome</kwd>
<kwd>scattered repeat sequence</kwd>
<kwd>collinearity analysis</kwd>
<kwd>genetic relationship</kwd>
</kwd-group>
<funding-group>
<award-group id="awg1">
<funding-source>Education Reform Program of Jiangxi Provincial Department of Education</funding-source>
<award-id>JXJG-22-23-3</award-id>
<award-id>JXJG-23-23-5</award-id>
</award-group>
<award-group id="awg2">
<funding-source>Discipline Construction Project of Nanchang Normal University</funding-source>
<award-id>100/20149</award-id>
</award-group>
<award-group id="awg3">
<funding-source>Jiangxi Province Key Laboratory of Oil Crops Biology</funding-source>
<award-id>YLKFKT202203</award-id>
</award-group>
<award-group id="awg4">
<funding-source>Education Reform Program of Nanchang Normal University</funding-source>
<award-id>NSJG-21-25</award-id>
</award-group>
<award-group id="awg5">
<funding-source>Nanchang Key Laboratory of Comprehensive Research and Development</funding-source>
<award-id>32060078</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>Chloroplasts are organelles involved in photosynthesis and provide the necessary energy for plant life activities [<xref ref-type="bibr" rid="ref-1">1</xref>]. Chloroplasts have a relatively independent genetic system, consisting of a circular and structurally stable genome, known as the chloroplast genome [<xref ref-type="bibr" rid="ref-2">2</xref>,<xref ref-type="bibr" rid="ref-3">3</xref>]. In comparative analysis with the nuclear genome, the chloroplast DNA molecules are relatively small and generally between 115 and 165 kb in length [<xref ref-type="bibr" rid="ref-4">4</xref>]. Due to its high degree of conservatism and moderate evolutionary rate, chloroplast genomes have been widely utilized in plant discrimination, phylogenetic analysis, and genetic evolution research [<xref ref-type="bibr" rid="ref-5">5</xref>&#x2013;<xref ref-type="bibr" rid="ref-7">7</xref>]. Chloroplast-based genetic engineering is playing an increasingly important role in germplasm resource protection and variety breeding. Currently, chloroplast genomes of commonly used ethnic Chinese medicinal materials such as <italic>Platycodon grandiflorus</italic> [<xref ref-type="bibr" rid="ref-8">8</xref>], <italic>Rubia cordifolia</italic> [<xref ref-type="bibr" rid="ref-9">9</xref>], and <italic>Cynanchum wallichii</italic> [<xref ref-type="bibr" rid="ref-10">10</xref>] have been reported one after another.</p>
<p>There are about 30 species of the genus <italic>Pyrola</italic> plants in the world, mainly distributed in the northern temperate and northern cold regions of the Earth [<xref ref-type="bibr" rid="ref-11">11</xref>]. This genus of plants has a wide variety in China, with a total of 27 species and 3 varieties, mainly distributed in the southwest and northeast regions. Among them, there are 17 species, and the three variants are unique plants of the genus <italic>Pyrola</italic> in China, including the common <italic>P. decorata</italic>, <italic>P. calliantha</italic>, and <italic>P. atropurpura</italic>. The plants of the genus <italic>Pyrola</italic> are widely used in Chinese folk herbal medicine and also play a very important role in Miao ethnic medicine and Tibetan medicine [<xref ref-type="bibr" rid="ref-12">12</xref>]. <italic>P. atropurpura</italic> is often used to treat muscle and bone pain, moisten the lungs, relieve cough, and nourish the liver and kidneys, and its research at home and abroad mainly focuses on the study of its chemical components [<xref ref-type="bibr" rid="ref-13">13</xref>,<xref ref-type="bibr" rid="ref-14">14</xref>]. Up to now, there are few reports on the chloroplast genome of plants in the genus <italic>Pyrola</italic>, and only 10 single nucleotide sequences of <italic>Pyrola</italic> were acquired from GenBank (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/?term=Pyrola%20atropurpurea">https://www.ncbi.nlm.nih.gov/nuccore/?term=Pyrola%20atropurpurea</ext-link>) (accessed on 13 January 2025), the only chloroplast genome with <italic>Pyrola rotundifolia</italic> (KU833271.1) was registered in NCBI. In this study, we successfully assembled the <italic>P. atropurpura</italic> chloroplast genome, the genome structural characteristics, codon bias, repeat sequences, and phylogenetic information were deeply explored, in order to provide chloroplast genome information for the genetic background, molecular evolution, and phylogeny of <italic>P. atropurpura</italic>, and to promote the protection of <italic>P. atropurpura</italic> germplasm resources and genetic engineering research.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Materials and Methods</title>
<sec id="s2_1">
<label>2.1</label>
<title>Materials</title>
<p>The <italic>P. atropurpura</italic> leaves were collected from the campus of Guizhou University (26&#x00B0;25<sup>&#x2032;</sup>39.62<sup>&#x2033;</sup>N, 106&#x00B0;40<sup>&#x2032;</sup>5.81<sup>&#x2033;</sup>E). Healthy and tender leaves were selected, washed 3&#x2013;5 times with distilled water, dried, and stored in a &#x2212;80&#x00B0;C refrigerator for later use.</p>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Extraction, Sequencing, and Assembly of Chloroplast Genomic DNA</title>
<p>The leaves were rapidly frozen in liquid nitrogen and ground into powder, then their genomic DNA was extracted using the modified CTAB method [<xref ref-type="bibr" rid="ref-15">15</xref>]. Preparation of 350 bp DNA fragments was done using a Covaris ultrasonic crusher, followed by end repair and tail addition. The construction of the sequencing library was completed and its sequencing was performed on the Illumina HiSeq X Ten platform using a Paired-End (PE) PE150 sequencing strategy. We obtained 6.75 G of raw data with high throughput sequencing, removed joints and low-quality data regions, and accumulated 27,130,560 clean reads. Using NOVOPlasty was to splice chloroplast genomes with default parameters [<xref ref-type="bibr" rid="ref-16">16</xref>].</p>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Annotation of Chloroplast Genome</title>
<p>The annotation of the <italic>P. atropurpura</italic> chloroplast genome was performed using the Plastid Genome Annotator (<ext-link ext-link-type="uri" xlink:href="https://github.com/quxiaojian/PGA">https://github.com/quxiaojian/PGA</ext-link>) (accessed on 13 January 2025) [<xref ref-type="bibr" rid="ref-17">17</xref>], and manually adjusted; tRNA annotation was made using tRNAscan-SE (<ext-link ext-link-type="uri" xlink:href="https://trna.ucsc.edu/tRNAscan-SE/">https://trna.ucsc.edu/tRNAscan-SE/</ext-link>) (accessed on 13 January 2025) and ARAGORN tools (<ext-link ext-link-type="uri" xlink:href="https://packages.debian.org/bullseye/aragorn">https://packages.debian.org/bullseye/aragorn</ext-link>) (accessed on 13 January 2025); and a circle diagram was generated using Organellar Genome DRAW [<xref ref-type="bibr" rid="ref-18">18</xref>].</p>
</sec>
<sec id="s2_4">
<label>2.4</label>
<title>Codon Usage and Repeated Sequence Analysis</title>
<p>The CodonW1.4.2 software [<xref ref-type="bibr" rid="ref-19">19</xref>] was used to statistically analyze the codon preferences of the <italic>P. atropurpura</italic> chloroplast genome. Using the Reputer software [<xref ref-type="bibr" rid="ref-20">20</xref>], scattered repeat sequences were detected, with a minimum 30 bp repeat length, a minimum permutation value of 50, and a maximum base mismatch of 3. MISA software [<xref ref-type="bibr" rid="ref-21">21</xref>] was made to analyze simple repetitive sequences in the <italic>P. atropurpura</italic> chloroplast genome, the minimum repeat value for single nucleotides was set to 10, for dinucleotides to five, for trinucleotides to four, and tetra-, penta-, and hexanucleotides to three.</p>
</sec>
<sec id="s2_5">
<label>2.5</label>
<title>Chloroplast Genome Comparison of the Family Ericaceae</title>
<p>The chloroplast sequences of seven representative Chinese medicinal plants from the family Ericaceae, including <italic>Pyrola atropurpurea</italic> (PP473790), <italic>Pyrola rotundifolia</italic> (KU833271.1), <italic>Chimaphila japonica</italic> (MG461316.1), <italic>Gaultheria sinensis</italic> (OM048872.1), <italic>Agapetes malipoensis</italic> (NC_058759.1), <italic>Rhododendron simsii</italic> (MW030509.1), and <italic>Vaccinium bracteatum</italic> (LC521967.1) were downloaded from GenBank. The contraction and expansion of LSC, SSC, and IR region boundaries were visualized in the Ericaceae chloroplast genomes using IRscope software [<xref ref-type="bibr" rid="ref-22">22</xref>]. The chloroplast genome rearrangement and collinearity in Ericaceae species were detected using the Mauve multiplex genome alignment method in Geneous10.2.2 software [<xref ref-type="bibr" rid="ref-23">23</xref>]. The Ka/Ks values were calculated with Ka/Ks Calculator v2.0 (<ext-link ext-link-type="uri" xlink:href="https://sourceforge.net/projects/kakscalculator2/">https://sourceforge.net/projects/kakscalculator2/</ext-link>) (accessed on 13 January 2025). The Pi value of chloroplast protein-coding genes (PCGs) was analyzed using DnaSP software [<xref ref-type="bibr" rid="ref-24">24</xref>].</p>
</sec>
<sec id="s2_6">
<label>2.6</label>
<title>Phylogenetic Analysis</title>
<p>The phylogenetic relationship was conducted on all 41 chloroplast genomes in the Ericaceae family, using <italic>Magnolia officinalis</italic> (NC_020316.1) as an outgroup. PCGs alignment was performed using the MAFFT website (<ext-link ext-link-type="uri" xlink:href="https://mafft.cbrc.jp/alignment/server/index.html">https://mafft.cbrc.jp/alignment/server/index.html</ext-link>) (accessed on 13 January 2025) [<xref ref-type="bibr" rid="ref-25">25</xref>], all missing sites were filtered out using TBtools software [<xref ref-type="bibr" rid="ref-26">26</xref>], the optimal model was calculated using ModelTest NG software [<xref ref-type="bibr" rid="ref-27">27</xref>], and this model was determined on the Akaike information criteria. A phylogenetic tree was built using RAXML-n [<xref ref-type="bibr" rid="ref-28">28</xref>] based on the maximum likelihood (ML) method, and the tree-building model was GTR&#x002B;I&#x002B;G4.</p>
</sec>
</sec>
<sec id="s3">
<label>3</label>
<title>Results</title>
<sec id="s3_1">
<label>3.1</label>
<title>Chloroplast Genome Structure</title>
<p>The total length of the <italic>P. atropurpura</italic> chloroplast genome (GenBank accession number: PP473790) is 172,535 bp, with a GC value of 34.95%. The genome has a typical tetrad structure, that is, the entire circular genome can be partitioned into a large single copy (LSC), a small single copy (SSC), and two inverted repeats (IR) regions (<xref ref-type="fig" rid="fig-1">Fig. 1</xref>). The LSC, SSC, and IR region lengths are 105,081, 11,700, and 27,877 bp, with GC values of 34.72%, 27.99%, and 38.74%, respectively. In total, 132 genes were annotated comprising 43 tRNA, 8 rRNA, 76 PCGs, and 5 pseudogenes (Table S1). In tRNA, <italic>trnA-UGC</italic>, <italic>trnC-GCA</italic>, <italic>trnH-GUG</italic>, <italic>trnI-CAU</italic>, <italic>trnI-GAU</italic>, <italic>trnL-CAA</italic>, <italic>trnL-UAG</italic>, <italic>trnN-GUU</italic>, <italic>trnR-ACG</italic>, and <italic>trnV-GAC</italic> each have two copies. And <italic>trnA-UGC, trnG-GCC</italic>, <italic>trnI-GAU</italic>, <italic>trnK-UUU</italic>, <italic>trnL-UAA</italic> and <italic>trnV-UAC</italic> each own one intron. There are four types of rRNA, each with two copies, situated in the IR region. In the encoded protein genes, there are two copies each of ribosomal protein subunit <italic>rpl32</italic>, and <italic>rps7</italic>, and unknown functional protein <italic>ycf15</italic>, <italic>atpF</italic>, <italic>clpP</italic>, <italic>rpoC1</italic>, <italic>ndhB</italic>, <italic>petB</italic>, <italic>petD</italic>, <italic>rpl16</italic>, <italic>rpl2</italic>, <italic>rps12</italic>, and <italic>rps16</italic> all own one intron, while <italic>ycf3</italic> has two introns (Table S2). In the deeply explored chloroplast genome, all <italic>ndh</italic> genes (<italic>ndhK</italic>, <italic>ndhD</italic>, <italic>ndhG</italic>, <italic>ndhA</italic>, and <italic>ndhH</italic>) except <italic>ndhB</italic>, are pseudogenes.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>The <italic>P. atropurpura</italic> chloroplast genome map in the genus <italic>Pyrola</italic></title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Phyton-94-61424-f001.tif"/>
</fig>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Repeat Sequence</title>
<p>In total, 4223 scattered repeat sequences were predicted in the <italic>P. atropurpura</italic> chloroplast genome (Fig. S1). Among them, there were 2452 forward repeats (58.06%), 1763 palindromic repeats (41.75%), and eight reverse repeat (0.19%), and no complementary repeats were found. And 358 SSRs were checked in the <italic>P. atropurpura</italic> chloroplast genome (<xref ref-type="fig" rid="fig-2">Fig. 2</xref>), including 181 single nucleotide repeats, 14 dinucleotide repeats, 139 trinucleotide repeats, 13 tetranucleotide repeats, 3 pentanucleotide repeats, and 8 hexanucleotide repeats. Most SSRs are positioned in the LSC region (244), with only a few were distributed in the SSC (78) and IR region (36) (Fig. S2). In addition, the majority of SSRs are located in 260 intergenic regions (72.62%), followed by 14 in gene coding regions (3.91%) and 84 in intron regions (23.46%), indicating that SSRs are mainly distributed in intergenic regions (Fig. S3). These SSRs are mainly single base repeats composed of A or T, indicating that the SSRs of this genome have a strong preference for A and T.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Distribution of SSR type in the chloroplast genome of <italic>P. atropurpura</italic></title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Phyton-94-61424-f002.tif"/>
</fig>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Codon Preference and Nucleotide Polymorphism</title>
<p>The relative usage of synonymous codons (RSCU) analysis is based on <italic>P. atropurpura</italic> chloroplast genome, 71 coding DNA sequences (CDS) larger than 200 bp were obtained. The research results indicate that the chloroplast genome contains a total of 16,561 codons. Among them, there are 1721 codons encoding leucine (Leu), accounting for the highest proportion of 10.39%. The codon encoding cysteine (Cys) is 188, accounting for the smallest proportion at 1.14% (<xref ref-type="table" rid="table-1">Table 1</xref>). At the same time, 31 codons with RSCU greater than 1 were detected, with all codons ending in A/U except UUG and AUG. The RSCU value of codon AUG encoding methionine was the highest, at 6.7991 (<xref ref-type="table" rid="table-1">Table 1</xref> and Fig. S4).</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>The RSCU value of <italic>P. atropurpura</italic> chloroplast genome</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Amino acid</th>
<th>Symbol</th>
<th>Codon</th>
<th>Number</th>
<th>RSCU</th>
<th>Amino acid</th>
<th>Symbol</th>
<th>Codon</th>
<th>Number</th>
<th>RSCU</th>
</tr>
</thead>
<tbody>
<tr>
<td>&#x002A;</td>
<td>Ter</td>
<td>UAA</td>
<td>45</td>
<td>1.7763</td>
<td>M</td>
<td>Met</td>
<td>AUU</td>
<td>2</td>
<td>0.0364</td>
</tr>
<tr>
<td>&#x002A;</td>
<td>Ter</td>
<td>UAG</td>
<td>13</td>
<td>0.5133</td>
<td>M</td>
<td>Met</td>
<td>CUG</td>
<td>3</td>
<td>0.0546</td>
</tr>
<tr>
<td>&#x002A;</td>
<td>Ter</td>
<td>UGA</td>
<td>18</td>
<td>0.7104</td>
<td>M</td>
<td>Met</td>
<td>GUG</td>
<td>1</td>
<td>0.0182</td>
</tr>
<tr>
<td>A</td>
<td>Ala</td>
<td>GCA</td>
<td>305</td>
<td>1.1664</td>
<td>M</td>
<td>Met</td>
<td>UUG</td>
<td>3</td>
<td>0.0546</td>
</tr>
<tr>
<td>A</td>
<td>Ala</td>
<td>GCC</td>
<td>153</td>
<td>0.5852</td>
<td>N</td>
<td>Asn</td>
<td>AAC</td>
<td>182</td>
<td>0.4828</td>
</tr>
<tr>
<td>A</td>
<td>Ala</td>
<td>GCG</td>
<td>111</td>
<td>0.4244</td>
<td>N</td>
<td>Asn</td>
<td>AAU</td>
<td>572</td>
<td>1.5172</td>
</tr>
<tr>
<td>A</td>
<td>Ala</td>
<td>GCU</td>
<td>477</td>
<td>1.824</td>
<td>P</td>
<td>Pro</td>
<td>CCA</td>
<td>207</td>
<td>1.2</td>
</tr>
<tr>
<td>C</td>
<td>Cys</td>
<td>UGC</td>
<td>43</td>
<td>0.4574</td>
<td>P</td>
<td>Pro</td>
<td>CCC</td>
<td>119</td>
<td>0.69</td>
</tr>
<tr>
<td>C</td>
<td>Cys</td>
<td>UGU</td>
<td>145</td>
<td>1.5426</td>
<td>P</td>
<td>Pro</td>
<td>CCG</td>
<td>68</td>
<td>0.3944</td>
</tr>
<tr>
<td>D</td>
<td>Asp</td>
<td>GAC</td>
<td>118</td>
<td>0.4112</td>
<td>P</td>
<td>Pro</td>
<td>CCU</td>
<td>296</td>
<td>1.716</td>
</tr>
<tr>
<td>D</td>
<td>Asp</td>
<td>GAU</td>
<td>456</td>
<td>1.5888</td>
<td>Q</td>
<td>Gln</td>
<td>CAA</td>
<td>485</td>
<td>1.598</td>
</tr>
<tr>
<td>E</td>
<td>Glu</td>
<td>GAA</td>
<td>635</td>
<td>1.5544</td>
<td>Q</td>
<td>Gln</td>
<td>CAG</td>
<td>122</td>
<td>0.402</td>
</tr>
<tr>
<td>E</td>
<td>Glu</td>
<td>GAG</td>
<td>182</td>
<td>0.4456</td>
<td>R</td>
<td>Arg</td>
<td>AGA</td>
<td>299</td>
<td>1.7136</td>
</tr>
<tr>
<td>F</td>
<td>Phe</td>
<td>UUC</td>
<td>264</td>
<td>0.594</td>
<td>R</td>
<td>Arg</td>
<td>AGG</td>
<td>88</td>
<td>0.504</td>
</tr>
<tr>
<td>F</td>
<td>Phe</td>
<td>UUU</td>
<td>625</td>
<td>1.406</td>
<td>R</td>
<td>Arg</td>
<td>CGA</td>
<td>264</td>
<td>1.5126</td>
</tr>
<tr>
<td>G</td>
<td>Gly</td>
<td>GGA</td>
<td>474</td>
<td>1.5568</td>
<td>R</td>
<td>Arg</td>
<td>CGC</td>
<td>66</td>
<td>0.378</td>
</tr>
<tr>
<td>G</td>
<td>Gly</td>
<td>GGC</td>
<td>147</td>
<td>0.4828</td>
<td>R</td>
<td>Arg</td>
<td>CGG</td>
<td>57</td>
<td>0.3264</td>
</tr>
<tr>
<td>G</td>
<td>Gly</td>
<td>GGG</td>
<td>196</td>
<td>0.6436</td>
<td>R</td>
<td>Arg</td>
<td>CGU</td>
<td>273</td>
<td>1.5642</td>
</tr>
<tr>
<td>G</td>
<td>Gly</td>
<td>GGU</td>
<td>401</td>
<td>1.3168</td>
<td>S</td>
<td>Ser</td>
<td>AGC</td>
<td>73</td>
<td>0.3882</td>
</tr>
<tr>
<td>H</td>
<td>His</td>
<td>CAC</td>
<td>94</td>
<td>0.4784</td>
<td>S</td>
<td>Ser</td>
<td>AGU</td>
<td>250</td>
<td>1.3284</td>
</tr>
<tr>
<td>H</td>
<td>His</td>
<td>CAU</td>
<td>299</td>
<td>1.5216</td>
<td>S</td>
<td>Ser</td>
<td>UCA</td>
<td>209</td>
<td>1.1106</td>
</tr>
<tr>
<td>I</td>
<td>Ile</td>
<td>AUA</td>
<td>455</td>
<td>0.9627</td>
<td>S</td>
<td>Ser</td>
<td>UCC</td>
<td>155</td>
<td>0.8238</td>
</tr>
<tr>
<td>I</td>
<td>Ile</td>
<td>AUC</td>
<td>250</td>
<td>0.5289</td>
<td>S</td>
<td>Ser</td>
<td>UCG</td>
<td>86</td>
<td>0.4572</td>
</tr>
<tr>
<td>I</td>
<td>Ile</td>
<td>AUU</td>
<td>713</td>
<td>1.5084</td>
<td>S</td>
<td>Ser</td>
<td>UCU</td>
<td>356</td>
<td>1.8918</td>
</tr>
<tr>
<td>K</td>
<td>Lys</td>
<td>AAA</td>
<td>728</td>
<td>1.5706</td>
<td>T</td>
<td>Thr</td>
<td>ACA</td>
<td>241</td>
<td>1.142</td>
</tr>
<tr>
<td>K</td>
<td>Lys</td>
<td>AAG</td>
<td>199</td>
<td>0.4294</td>
<td>T</td>
<td>Thr</td>
<td>ACC</td>
<td>153</td>
<td>0.7252</td>
</tr>
<tr>
<td>L</td>
<td>Leu</td>
<td>CUA</td>
<td>232</td>
<td>0.8088</td>
<td>T</td>
<td>Thr</td>
<td>ACG</td>
<td>79</td>
<td>0.3744</td>
</tr>
<tr>
<td>L</td>
<td>Leu</td>
<td>CUC</td>
<td>113</td>
<td>0.3942</td>
<td>T</td>
<td>Thr</td>
<td>ACU</td>
<td>371</td>
<td>1.7584</td>
</tr>
<tr>
<td>L</td>
<td>Leu</td>
<td>CUG</td>
<td>94</td>
<td>0.3276</td>
<td>V</td>
<td>Val</td>
<td>GUA</td>
<td>350</td>
<td>1.4692</td>
</tr>
<tr>
<td>L</td>
<td>Leu</td>
<td>CUU</td>
<td>337</td>
<td>1.1748</td>
<td>V</td>
<td>Val</td>
<td>GUC</td>
<td>112</td>
<td>0.47</td>
</tr>
<tr>
<td>L</td>
<td>Leu</td>
<td>UUA</td>
<td>596</td>
<td>2.0778</td>
<td>V</td>
<td>Val</td>
<td>GUG</td>
<td>143</td>
<td>0.6004</td>
</tr>
<tr>
<td>L</td>
<td>Leu</td>
<td>UUG</td>
<td>349</td>
<td>1.2168</td>
<td>V</td>
<td>Val</td>
<td>GUU</td>
<td>348</td>
<td>1.4608</td>
</tr>
<tr>
<td>M</td>
<td>Met</td>
<td>AUA</td>
<td>0</td>
<td>0</td>
<td>W</td>
<td>Trp</td>
<td>UGG</td>
<td>270</td>
<td>1</td>
</tr>
<tr>
<td>M</td>
<td>Met</td>
<td>AUC</td>
<td>2</td>
<td>0.0364</td>
<td>Y</td>
<td>Tyr</td>
<td>UAC</td>
<td>113</td>
<td>0.3662</td>
</tr>
<tr>
<td>M</td>
<td>Met</td>
<td>AUG</td>
<td>372</td>
<td>6.7991</td>
<td>Y</td>
<td>Tyr</td>
<td>UAU</td>
<td>504</td>
<td>1.6338</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="table-1fn1" fn-type="other">
<p>Note: &#x002A; menas stop codon.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>The nucleotide polymorphism (Pi) value of chloroplast protein-coding gene sequences of seven <italic>Pyrola</italic> species was analyzed using DnaSP software (<xref ref-type="fig" rid="fig-3">Fig. 3</xref>). The complete length of the aligned sequences was 57,658 bp, and a total of 5213 polymorphic sites were discriminated. The Pi value ranged from 0 to 0.3019, with an average of 0.0479. Among the seven highly variable hotspots that were identified (Pi value &#x003E; 0.11), two genes (<italic>trnT-UGU</italic>, and <italic>trnF-GAA</italic>) are situated in the LSC region, and five genes (<italic>ccsA</italic>, <italic>psaC</italic>, <italic>ndhE</italic>, <italic>ndhI</italic>, and <italic>rps15</italic>) are seated in the SSC region. No nucleotide polymorphism sites were detected in the IR region, demonstrating that the nucleotide polymorphism in the LSC and SSC regions is significantly higher than in the IR region.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Divergent hot-spot nucleotide sites in Ericaceae species&#x2019; chloroplast genomes</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Phyton-94-61424-f003.tif"/>
</fig>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Structure Variation of Chloroplast Genome</title>
<p>To compare the chloroplast genome differences between <italic>P. atropurpura</italic> and the representative group of Ericaceae species, we calculated the basic information of chloroplast genomes (<xref ref-type="table" rid="table-2">Table 2</xref>). The <italic>P. atropurpura</italic> chloroplast genomes and its six closely related species had significant differences, with a total length range of 151,656 bp (<italic>Chimaphila japonica</italic>) to 176,632 bp (<italic>Gaultheria sinensis</italic>), all of which are typical tetrad structures. The length compositions of LSC, SSC, and IR in the above-mentioned genome are 78,997 to 109,173 bp, 2979 to 11,946 bp, and 1717 to 33,332 bp, respectively. The total number of genes is 102 to 146, PCGs are 63 to 94, tRNA genes are 4 to 8, and rRNA numbers are 30 to 43. In the variation of GC content, the GC values ranged from 35.0% to 36.8%. Boundary analysis showed obvious differences in the transition regions of the four boundary zones (<xref ref-type="fig" rid="fig-4">Fig. 4</xref>), which further demonstrated the significant differences in Ericaceae chloroplast genome sequences.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>Comparison of seven Ericaceae chloroplast genomes</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Genome structure</th>
<th><italic>Pyrola atropurpurea</italic>, PP473790</th>
<th><italic>Pyrola rotundifolia</italic>, KU833271.1</th>
<th><italic>Chimaphila japonica</italic>, MG461316.1</th>
<th><italic>Gaultheria sinensis</italic>, OM048872.1</th>
<th><italic>Agapetes malipoensis</italic>, NC_058759.1</th>
<th><italic>Rhododendron simsii</italic>, MW030509.1</th>
<th><italic>Vaccinium bracteatum</italic>, LC521967.1</th>
</tr>
</thead>
<tbody>
<tr>
<td>Genome size/bp</td>
<td>172,535</td>
<td>168,995</td>
<td>151,656</td>
<td>176,632</td>
<td>172,729</td>
<td>152,214</td>
<td>174,404</td>
</tr>
<tr>
<td>LSC length/bp</td>
<td>105,081</td>
<td>109,173</td>
<td>100,403</td>
<td>107,395</td>
<td>105,281</td>
<td>78,997</td>
<td>106,565</td>
</tr>
<tr>
<td>SSC length/bp</td>
<td>11,700</td>
<td>11,946</td>
<td>7699</td>
<td>2573</td>
<td>3030</td>
<td>69,783</td>
<td>2979</td>
</tr>
<tr>
<td>IR length/bp</td>
<td>27,877</td>
<td>23,938</td>
<td>21,777</td>
<td>33,332</td>
<td>32,209</td>
<td>1717</td>
<td>32,430</td>
</tr>
<tr>
<td>GC content (%)</td>
<td>35.0</td>
<td>35.7</td>
<td>36.4</td>
<td>36.7</td>
<td>36.7</td>
<td>35.7</td>
<td>36.8</td>
</tr>
<tr>
<td>Number of genes</td>
<td>132</td>
<td>146</td>
<td>102</td>
<td>135</td>
<td>128</td>
<td>111</td>
<td>117</td>
</tr>
<tr>
<td>Protein-coding gene</td>
<td>81</td>
<td>94</td>
<td>63</td>
<td>88</td>
<td>89</td>
<td>75</td>
<td>79</td>
</tr>
<tr>
<td>rRNA</td>
<td>8</td>
<td>9</td>
<td>8</td>
<td>8</td>
<td>8</td>
<td>4</td>
<td>8</td>
</tr>
<tr>
<td>tRNA</td>
<td>43</td>
<td>43</td>
<td>31</td>
<td>39</td>
<td>41</td>
<td>32</td>
<td>30</td>
</tr>
</tbody>
</table>
</table-wrap><fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Boundary comparison of Ericaceae chloroplast genome</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Phyton-94-61424-f004.tif"/>
</fig>
</sec>
<sec id="s3_5">
<label>3.5</label>
<title>Genome Diversity of Chloroplast Genome</title>
<p>The seven Ericaceae chloroplast genomes were compared with the Mauve software (<ext-link ext-link-type="uri" xlink:href="https://github.com/MauveSoftware/novu">https://github.com/MauveSoftware/novu</ext-link>) (accessed on 13 January 2025). It was found that the chloroplast genome sequences had high variation, with higher variation in non-coding regions than in coding regions, and higher variation in LSC and SSC regions than in IR regions. Compared to other Ericaceae species, the sequence variation of <italic>Chimaphila japonica</italic> and <italic>Rhododendron simsii</italic> is relatively high. The gene number and order of the chloroplast genome in Ericaceae are variable, and an obvious gene rearrangement phenomenon was observed (<xref ref-type="fig" rid="fig-5">Fig. 5</xref>).</p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Alignment of chloroplast genomes structure in Ericaceae</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Phyton-94-61424-f005.tif"/>
</fig>
<p>To investigate the evolutionary characteristics of the Ericaceae family in the chloroplast genome, Ka/Ks calculations were performed on 68 common PCGs of the <italic>P. atropurpura</italic> chloroplast genome and its six closely related species (<xref ref-type="fig" rid="fig-6">Fig. 6</xref>). The genes without numerical values in <xref ref-type="fig" rid="fig-6">Fig. 6</xref> indicate that the Ka/Ks value is zero. Most genes related to the photosystem system, such as <italic>petA</italic>, <italic>psbA</italic>, <italic>atpA</italic>, <italic>atpB</italic>, <italic>atpE</italic>, <italic>atpI</italic>, <italic>rbcL</italic>, <italic>psbE</italic>, <italic>psbB</italic>, <italic>psbC</italic>, <italic>psbD</italic>, <italic>psbF</italic>, <italic>ndhB</italic>, <italic>ndhJ</italic>, etc., have Ka/Ks values less than 1, showing that these key genes have been purified and selected. The above genes play a very important role in the functioning of the chloroplast genome and are therefore relatively conserved in evolution. Moreover, the Ka/Ks values of <italic>rpl33</italic> and <italic>rps16</italic> genes were higher among the five species, demonstrating that these genes were positively selected.</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>Ka/Ks values of common PCGs in <italic>P. atropurpura</italic></title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Phyton-94-61424-f006.tif"/>
</fig>
</sec>
<sec id="s3_6">
<label>3.6</label>
<title>Molecular Phylogeny</title>
<p>Based on the PCGs of chloroplast genome, a phylogenetic tree of 41 Ericaceae species was built using the ML method of the IQ-TREE software (<xref ref-type="fig" rid="fig-7">Fig. 7</xref>). <italic>Magnolia officinalis</italic> (NC_020316.1) was set as the out-group. And the Ericaceae species were mainly classified into four cluster groups, namely Cluster Groups I, II, III, and IV. The three species of the genera <italic>Pyrola</italic> and <italic>Chimaphila</italic> are distributed in Cluster I. Cluster Group II is composed of thirteen species in the genus <italic>Rhododendron</italic>. The thirteen species of the genus <italic>Gaultheria</italic> are located in Cluster III. And Cluster Group IV is mainly made up of the species in the genus <italic>Vaccinium</italic>. The phylogenetic result indicated that <italic>P. atropura</italic>, <italic>Pyrola rotundifolia</italic> and <italic>Chimaphila japonica</italic> are clustered together with a 100% support rate, and they have the closest genetic relationship.</p>
<fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>The phylogenetic tree of 41Ericaceae species based on chloroplast genome PCGs</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Phyton-94-61424-f007.tif"/>
</fig>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Discussion</title>
<p>Chloroplasts are important plant organelles, widely involved in processes such as photosynthesis and energy conversion. It belongs to matrilineal inheritance, with a genome much smaller and more conserved than the nuclear genome, and is therefore widely used in the study of plant phylogenetics [<xref ref-type="bibr" rid="ref-29">29</xref>]. It is also widely used as a chloroplast DNA barcode for species identification and classification research [<xref ref-type="bibr" rid="ref-30">30</xref>]. <italic>P. atropura</italic> chloroplast genome was successfully sequenced, assembled, and annotated, obtaining a genome of 172,535 bp, which is similar in size and number of genes to the reported chloroplast genomes of <italic>Pyrola</italic> [<xref ref-type="bibr" rid="ref-31">31</xref>], this shows that the chloroplast genome has a highly conserved characteristic in this genus.</p>
<p>SSR markers have advantages such as co-dominant inheritance, polymorphism, good repeatability, and easy operation, and are widely used in molecular genetic breeding, population genetic diversity analysis, evolutionary processes, and identification of closely related species [<xref ref-type="bibr" rid="ref-32">32</xref>]. And 358 SSR loci were distributed in the <italic>P. atropura</italic> chloroplast genome, with the highest number of single nucleotide repeat sequences (181), accounting for the highest proportion (50.56%). SSR loci tend to use A/T bases, which are similar to the sequence characteristics of other plants in the Ericaceae family. This result further confirms the view that polyA and polyT repeats are common in chloroplast SSRs, and they rarely contain C or G tandem repeats [<xref ref-type="bibr" rid="ref-33">33</xref>]. Complex repeats refer to highly repetitive DNA fragments in the genome, which play a crucial role in evolution and the formation of complex genome structures [<xref ref-type="bibr" rid="ref-34">34</xref>]. The scattered repeat sequences in the <italic>P. atropura</italic> chloroplast genome include three types: forward repeat, reverse repeat, and palindromic repeat. A total of 4223 complex repeats were discriminated, resulting in the high heterogeneity of this chloroplast genome. Rich dispersed repetitive sequences have also been found in other species of Ericaceae, such as the genus <italic>Rhododendron</italic> [<xref ref-type="bibr" rid="ref-35">35</xref>]. And the heterogeneity is also reflected in the collinearity analysis of the <italic>P. atropura</italic> chloroplast genome structure.</p>
<p>Codon preference is an important genome evolution feature in organisms, and RSCU value is an important parameter for evaluating the degree of codon preference [<xref ref-type="bibr" rid="ref-36">36</xref>]. There are a total of 64 codons in the <italic>P. atropura</italic> chloroplast genome, with 31 preferred codons and a preference for using A and U bases. This result is similar to the codon usage preference of nine plant chloroplast genomes in the genera <italic>Glycine</italic> [<xref ref-type="bibr" rid="ref-37">37</xref>] and also verifies the theory that the closer the species are, the more similar the codon usage patterns [<xref ref-type="bibr" rid="ref-38">38</xref>].</p>
<p>The IR boundaries expansion and contraction in chloroplast genomes are common phenomena in plant evolution [<xref ref-type="bibr" rid="ref-39">39</xref>]. The IR region length in most photosynthetic plant chloroplast genomes varies from 5 to 76 kb and may undergo multiple reductions and expansions during plant evolution. Therefore, the expansion, reduction, or IR region loss is the main cause of length differences in the chloroplast genome and structural variation, and it is also an important feature for distinguishing specific taxa [<xref ref-type="bibr" rid="ref-40">40</xref>]. Studies have shown that there is a typical feature in the plastid genomes of the Ericaceae family, specifically the large expansion of the IR region (approximately 10 kb). The IR region length of <italic>P. atropura</italic> is 27,877 bp, which is similar to that of <italic>Pyrola rotundifolia</italic> and <italic>Agapetes malipoensis</italic>. During long-term evolution, the IR regions of Ericaceae family plants have decreased or increased, indicating that the IR region may be necessary for their growth and development, and also plays an important role in maintaining the chloroplast genome stability. Moreover, there are many heterotrophic groups in this family of species [<xref ref-type="bibr" rid="ref-41">41</xref>].</p>
<p>The colinear alignment in this study showed that the <italic>P. atropura</italic> chloroplast genome was highly different from its closely related species. This result is also consistent with the IR boundary and genomic feature statistics, further verifying the significant differences between species in this family. There were varying degrees of variation in the gene regions of <italic>trnT-UGU</italic>, <italic>trnF-GAA</italic>, <italic>ccsA, psaC, ndhE, ndhI</italic>, and <italic>rps15</italic>. It is expected that molecular markers for interspecific identification and phylogenetic analysis of <italic>Pyrola</italic> plants can be developed from these regions. Ericaceae is a globally distributed family, particularly common in tropical mountainous areas, comprising approximately 125 genera and over 3500 species [<xref ref-type="bibr" rid="ref-42">42</xref>]. This group has complex systematic relationships, including autotrophic and heterotrophic plants. The Ericaceae family is divided into 9 subfamilies, including Enkianthoideae, Pyroloidae, Monotropoideae, Arbutoideae, Cassiopoideae, Ericoideae, Harrimanelloideae, Eparidoideae, and Vaccinoideae. Among them, the Monotropoideae subfamily is heterotrophic, parasitic on fungi, without chlorophyll, while the others are autotrophic groups [<xref ref-type="bibr" rid="ref-43">43</xref>]. One view holds that the subfamily Arbutoideae and Monotropoideae are sister groups, the branch was formed by them and the subfamily Pyroloidae are sister groups, this large branch and other groups of Ericaceae are sister groups, and Enkianthoideae is located at the most basic position [<xref ref-type="bibr" rid="ref-44">44</xref>,<xref ref-type="bibr" rid="ref-45">45</xref>]. Another view is that Enkianthoideae is the most basic group, followed by the subfamily Monotropoideae and Arbutoideae, and the subfamily Pyroloidae is the sister group of other groups [<xref ref-type="bibr" rid="ref-46">46</xref>]. Based on the above research, the phylogenetic relationships within the subfamilies have not been thoroughly resolved. This study found that the 41 species already published by NCBI can be divided into four groups by using PCGs of chloroplast genomes, namely, the genus <italic>Pyrola</italic> and <italic>Chimaphila</italic> as group one, the genus <italic>Rhododendron</italic> as group two, the genus <italic>Gaultheria</italic> as group three, and the genus <italic>Vaccinium</italic> as group four. The above data provides information for further species differentiation in this family.</p>
<p>In summary, the <italic>P. atropura</italic> chloroplast genome length is 172,535 bp and exhibits a typical tetrad structure. The GC content of the entire genome is 34.95%. And 132 genes were annotated, comprising 76 protein-coding, 43 tRNA, and 8 rRNA genes. It is preferred to use codons ending in A/U. A total of 358 SSR loci were detected, with single nucleotide repeat sequences being the predominant SSR loci; and 4223 scattered repeat sequences were discriminated. The size of the IR, SSR, and LSR region was different from that of plants between <italic>P. atropurpura</italic> and the other five Ericaceae species, and the SSC and IR region variation degree is higher than that in the LSC region. The relationship between <italic>P. atropura</italic>, <italic>Pyrola rotundifolia</italic>, and <italic>Chimaphila japonica</italic> is the closest.</p>
</sec>
<sec sec-type="supplementary-material" id="s5">
<title>Supplementary Materials</title>
<supplementary-material id="SD1">
<label>Figure S1</label>
<caption><title>Distribution of long fragment repeat in the chloroplast genome of <italic>P. atropurpura</italic>. F: forward repeats; P: palindromic repeats; R: reverse repeat; C: complementary repeats</title></caption>
<media xlink:href="Phyton-94-61424-s001.tif"/>
</supplementary-material>
<supplementary-material id="SD2">
<label>Figure S2</label>
<caption><title>Distribution of SSR in the chloroplast structure region of <italic>P. atropurpura</italic></title></caption>
<media xlink:href="Phyton-94-61424-s002.tif"/>
</supplementary-material>
<supplementary-material id="SD3">
<label>Figure S3</label>
<caption><title>Distribution of SSR in the chloroplast genome region of <italic>P. atropurpura</italic></title></caption>
<media xlink:href="Phyton-94-61424-s003.tif"/>
</supplementary-material>
<supplementary-material id="SD4">
<label>Figure S4</label>
<caption><title>RSCU statistics of chloroplast genome of <italic>P. atropurpura</italic></title></caption>
<media xlink:href="Phyton-94-61424-s004.tif"/>
</supplementary-material>
<supplementary-material id="SD5">
<media xlink:href="Phyton-94-61424-s005.docx"/>
</supplementary-material>
<supplementary-material id="SD6">
<media xlink:href="Phyton-94-61424-s006.docx"/>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<p>Not applicable.</p>
</ack>
<sec>
<title>Funding Statement</title>
<p>This work was supported by the Education Reform Program of Jiangxi Provincial Department of Education (JXJG-22-23-3, JXJG-23-23-5), the &#x201C;Biology and Medicine&#x201D; Discipline Construction Project of Nanchang Normal University (100/20149), Jiangxi Province Key Laboratory of Oil Crops Biology (YLKFKT202203), the Education Reform Program of Nanchang Normal University (NSJG-21-25) and Nanchang Key Laboratory of Comprehensive Research and Development of <italic>Brasenia schreberi</italic> (32060078).</p>
</sec>
<sec sec-type="data-availability">
<title>Availability of Data and Materials</title>
<p>The datasets generated and analysed during the current study are available in the NCBI public database (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/nuccore/PP473790.1/">https://www.ncbi.nlm.nih.gov/nuccore/PP473790.1/</ext-link>) (accessed on 13 January 2025), and the corresponding accession number was PP473790.1.</p>
</sec>
<sec>
<title>Ethics Approval</title>
<p>The plant materials in this research do not comprise any endangered wild species that are at risk of extinction. All methods and materials adhered strictly to the relevant legislative frameworks.</p>
</sec>
<sec sec-type="COI-statement">
<title>Conflicts of Interest</title>
<p>The author declares no conflicts of interest to report regarding the present study.</p>
</sec>
<sec>
<title>Supplementary Materials</title>
<p>The supplementary material is available online at <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.32604/phyton.2025.061424">https://doi.org/10.32604/phyton.2025.061424</ext-link>.</p>
</sec>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>1.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lee</surname> <given-names>C</given-names></string-name>, <string-name><surname>Leonie</surname> <given-names>HL</given-names></string-name>, <string-name><surname>Tina</surname> <given-names>BS</given-names></string-name>, <string-name><surname>Enrique</surname> <given-names>LJ</given-names></string-name>, <string-name><surname>Julian</surname> <given-names>MH</given-names></string-name></person-group>. <article-title>Chloroplast development in green plant tissues: the interplay between light, hormone, and transcriptional regulation</article-title>. <source>New Phytol</source>. <year>2022</year>;<volume>233</volume>(<issue>5</issue>):<fpage>2000</fpage>&#x2013;<lpage>16</lpage>. doi:<pub-id pub-id-type="doi">10.1111/nph.17839</pub-id>.</mixed-citation></ref>
<ref id="ref-2"><label>2.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dobrogojski</surname> <given-names>J</given-names></string-name>, <string-name><surname>Adamiec</surname> <given-names>M</given-names></string-name>, <string-name><surname>Luci&#x0144;ski</surname> <given-names>R</given-names></string-name></person-group>. <article-title>The chloroplast genome: a review</article-title>. <source>Acta Physiol Plant</source>. <year>2020</year>;<volume>42</volume>:<fpage>98</fpage>. doi:<pub-id pub-id-type="doi">10.1007/s11738-020-03089-x</pub-id>.</mixed-citation></ref>
<ref id="ref-3"><label>3.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Kan</surname> <given-names>SL</given-names></string-name>, <string-name><surname>Liao</surname> <given-names>XZ</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>JW</given-names></string-name>, <string-name><surname>Tembrock</surname> <given-names>LR</given-names></string-name>, <string-name><surname>Daniell</surname> <given-names>H</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Plant organellar genomes: much done, much more to do</article-title>. <source>Trends Plant Sci</source>. <year>2024</year>;<volume>29</volume>(<issue>7</issue>):<fpage>754</fpage>&#x2013;<lpage>69</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.tplants.2023.12.014</pub-id>.</mixed-citation></ref>
<ref id="ref-4"><label>4.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wu</surname> <given-names>ZQ</given-names></string-name>, <string-name><surname>Liao</surname> <given-names>XZ</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>XN</given-names></string-name>, <string-name><surname>Tembrock</surname> <given-names>LR</given-names></string-name>, <string-name><surname>Broz</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Genomic architectural variation of plant mitochondria-A review of multichromosomal structuring</article-title>. <source>J Syst Evol</source>. <year>2022</year>;<volume>60</volume>(<issue>1</issue>):<fpage>160</fpage>&#x2013;<lpage>8</lpage>. doi:<pub-id pub-id-type="doi">10.1111/jse.12655</pub-id>.</mixed-citation></ref>
<ref id="ref-5"><label>5.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chong</surname> <given-names>X</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Yan</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Li</surname> <given-names>M</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>Y</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Comparative chloroplast genome analysis of 10 Ilex species and the development of species-specific identification markers</article-title>. <source>Ind Crops Prod</source>. <year>2022</year>;<volume>187</volume>:<fpage>115408</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.indcrop.2022.115408</pub-id>.</mixed-citation></ref>
<ref id="ref-6"><label>6.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Xiao</surname> <given-names>SZ</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>P</given-names></string-name>, <string-name><surname>Deng</surname> <given-names>YT</given-names></string-name>, <string-name><surname>Dai</surname> <given-names>XB</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>LK</given-names></string-name>, <string-name><surname>Heider</surname> <given-names>B</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Comparative analysis of chloroplast genomes of cultivars and wild species of sweet potato (<italic>Ipomoea batatas</italic> (L.) Lam)</article-title>. <source>BMC Genomics</source>. <year>2021</year>;<volume>22</volume>:<fpage>262</fpage>; <pub-id pub-id-type="pmid">33849443</pub-id></mixed-citation></ref>
<ref id="ref-7"><label>7.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>R</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Tian</surname> <given-names>H</given-names></string-name>, <string-name><surname>Yi</surname> <given-names>H</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>L</given-names></string-name>, <string-name><surname>Lv</surname> <given-names>Y</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>A scalable and robust chloroplast genotyping solution: development and application of SNP and InDel markers in the maize chloroplast genome</article-title>. <source>Genes</source>. <year>2024</year>;<volume>15</volume>(<issue>3</issue>):<fpage>293</fpage>. doi:<pub-id pub-id-type="doi">10.3390/genes15030293</pub-id>; <pub-id pub-id-type="pmid">38540352</pub-id></mixed-citation></ref>
<ref id="ref-8"><label>8.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Du</surname> <given-names>CH</given-names></string-name>, <string-name><surname>Zhan</surname> <given-names>HX</given-names></string-name>, <string-name><surname>Shang</surname> <given-names>CL</given-names></string-name>, <string-name><surname>Li</surname> <given-names>RF</given-names></string-name></person-group>. <article-title>Yuan SJ. Comparative and phylogeny analysis of <italic>Platycodon grandiflorus</italic> complete chloroplast genomes</article-title>. <source>Chin Tradit Herb Drugs</source>. <year>2023</year>;<volume>54</volume>(<issue>15</issue>):<fpage>4981</fpage>&#x2013;<lpage>991</lpage>.</mixed-citation></ref>
<ref id="ref-9"><label>9.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>XY</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>BX</given-names></string-name>, <string-name><surname>Shi</surname> <given-names>JZ</given-names></string-name></person-group>. <article-title>Complete chloroplast genome and phylogenetic analysisof <italic>Rubia cordifolia</italic></article-title>. <source>Acta Botanica Boreali-Occidentalia Sinica</source>. <year>2023</year>;<volume>43</volume>(<issue>11</issue>):<fpage>1</fpage>&#x2013;<lpage>10</lpage>.</mixed-citation></ref>
<ref id="ref-10"><label>10.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Geng</surname> <given-names>YM</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>XQ</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>TC</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>LP</given-names></string-name></person-group>. <article-title>Characterization and phylogenetic analysis of chloroplast genome of <italic>Cynanchum wallichii</italic> and <italic>Cynanchum otophyllum</italic></article-title>. <source>Acta Pharmaceutica Sinica</source>. <year>2024</year>;<volume>59</volume>(<issue>3</issue>):<fpage>764</fpage>&#x2013;<lpage>74</lpage>.</mixed-citation></ref>
<ref id="ref-11"><label>11.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dong</surname> <given-names>HJ</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>ZW</given-names></string-name>, <string-name><surname>Peng</surname> <given-names>H</given-names></string-name></person-group>. <article-title>Geographical distribution and floristic significance of <italic>Pyrola</italic> in China</article-title>. <source>Plant Sci J</source>. <year>2024</year>;<volume>42</volume>(<issue>1</issue>):<fpage>43</fpage>&#x2013;<lpage>7</lpage>. doi:<pub-id pub-id-type="doi">10.11913/PSJ.2095-0837.23018</pub-id>.</mixed-citation></ref>
<ref id="ref-12"><label>12.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cao</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>QB</given-names></string-name>, <string-name><surname>Li</surname> <given-names>TY</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>BR</given-names></string-name></person-group>. <article-title>Exploitation and utilization of <italic>Pyrola</italic> L. resources</article-title>. <source>J Hebei Agricul Sci</source>. <year>2008</year>;<volume>12</volume>(<issue>3</issue>):<fpage>113</fpage>&#x2013;<lpage>114 119</lpage>.</mixed-citation></ref>
<ref id="ref-13"><label>13.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhao</surname> <given-names>ZF</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>N</given-names></string-name>, <string-name><surname>Tian</surname> <given-names>X</given-names></string-name>, <string-name><surname>Fu</surname> <given-names>YL</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Q</given-names></string-name>, <string-name><surname>He</surname> <given-names>XR</given-names></string-name></person-group>. <article-title>Chemical constituents, biological activities and quality control of plants from genus <italic>Pyrola</italic></article-title>. <source>China J Chin Mater Med</source>. <year>2017</year>;<volume>42</volume>(<issue>4</issue>):<fpage>618</fpage>&#x2013;<lpage>27</lpage>.</mixed-citation></ref>
<ref id="ref-14"><label>14.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yang</surname> <given-names>XL</given-names></string-name>, <string-name><surname>She</surname> <given-names>JL</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>JP</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>T</given-names></string-name>, <string-name><surname>An</surname> <given-names>GG</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>QR</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>A comprehensive review of the genus <italic>Pyrola</italic> Herbs in traditional uses, phytochemistry and pharmacological activities</article-title>. <source>Curr Top Med Chem</source>. <year>2020</year>;<volume>20</volume>(<issue>1</issue>):<fpage>57</fpage>&#x2013;<lpage>77</lpage>; <pub-id pub-id-type="pmid">31797760</pub-id></mixed-citation></ref>
<ref id="ref-15"><label>15.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>LY</given-names></string-name>, <string-name><surname>Song</surname> <given-names>MS</given-names></string-name>, <string-name><surname>Zha</surname> <given-names>HG</given-names></string-name>, <string-name><surname>Li</surname> <given-names>ZM</given-names></string-name></person-group>. <article-title>A modified protocol for plant genome DNA extraction</article-title>. <source>Plant Divers Resour</source>. <year>2014</year>;<volume>36</volume>(<issue>3</issue>):<fpage>375</fpage>&#x2013;<lpage>80</lpage>. doi:<pub-id pub-id-type="doi">10.7677/ynzwyj201413156</pub-id>.</mixed-citation></ref>
<ref id="ref-16"><label>16.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dierckxsens</surname> <given-names>N</given-names></string-name>, <string-name><surname>Mardulyn</surname> <given-names>P</given-names></string-name>, <string-name><surname>Smits</surname> <given-names>G</given-names></string-name></person-group>. <article-title>NOVOPlasty: <italic>de novo</italic> assembly of organelle genomes from whole genome data</article-title>. <source>Nucleic Acids Res</source>. <year>2017</year>;<volume>45</volume>(<issue>4</issue>):<fpage>e18</fpage>. doi:<pub-id pub-id-type="doi">10.1093/nar/gkw955</pub-id>.</mixed-citation></ref>
<ref id="ref-17"><label>17.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Qu</surname> <given-names>XJ</given-names></string-name>, <string-name><surname>Moore</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Li</surname> <given-names>DZ</given-names></string-name>, <string-name><surname>Yi</surname> <given-names>TS</given-names></string-name></person-group>. <article-title>PGA: a software package for rapid, accurate, and flexible batch annotation of plastomes</article-title>. <source>Plant Methods</source>. <year>2019</year>;<volume>15</volume>(<issue>1</issue>):<fpage>50</fpage>. doi:<pub-id pub-id-type="doi">10.1186/s13007-019-0435-7</pub-id>.</mixed-citation></ref>
<ref id="ref-18"><label>18.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lohse</surname> <given-names>M</given-names></string-name>, <string-name><surname>Drechsel</surname> <given-names>O</given-names></string-name>, <string-name><surname>Kahlau</surname> <given-names>S</given-names></string-name>, <string-name><surname>Bock</surname> <given-names>R</given-names></string-name></person-group>. <article-title>Organellar Genome DRAW&#x2014;a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets</article-title>. <source>Nucleic Acids Res</source>. <year>2013</year>;<volume>41</volume>(<issue>W1</issue>):<fpage>W575</fpage>&#x2013;<lpage>81</lpage>. doi:<pub-id pub-id-type="doi">10.1093/nar/gkt289</pub-id>.</mixed-citation></ref>
<ref id="ref-19"><label>19.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Shields</surname> <given-names>DC</given-names></string-name>, <string-name><surname>Sharp</surname> <given-names>PM</given-names></string-name></person-group>. <article-title>Synonymous codon usage in <italic>Bacillus subtilis</italic> reflects both translational selection and mutational biases</article-title>. <source>Nucleic Acids Res</source>. <year>1987</year>;<volume>15</volume>(<issue>19</issue>):<fpage>8023</fpage>&#x2013;<lpage>40</lpage>. doi:<pub-id pub-id-type="doi">10.1093/nar/15.19.8023</pub-id>.</mixed-citation></ref>
<ref id="ref-20"><label>20.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kurtz</surname> <given-names>S</given-names></string-name>, <string-name><surname>Choudhuri</surname> <given-names>JV</given-names></string-name>, <string-name><surname>Ohlebusch</surname> <given-names>E</given-names></string-name>, <string-name><surname>Schleiermacher</surname> <given-names>E</given-names></string-name>, <string-name><surname>Stoye</surname> <given-names>J</given-names></string-name>, <string-name><surname>Giegerich</surname> <given-names>R</given-names></string-name></person-group>. <article-title>REPuter: the manifold applications of repeat analysis on a genomic scale</article-title>. <source>Nucleic Acids Res</source>. <year>2001</year>;<volume>29</volume>(<issue>22</issue>):<fpage>4633</fpage>&#x2013;<lpage>42</lpage>. doi:<pub-id pub-id-type="doi">10.1093/nar/29.22.4633</pub-id>; <pub-id pub-id-type="pmid">11713313</pub-id></mixed-citation></ref>
<ref id="ref-21"><label>21.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Thiel</surname> <given-names>T</given-names></string-name>, <string-name><surname>Michalek</surname> <given-names>W</given-names></string-name>, <string-name><surname>Varshney</surname> <given-names>RK</given-names></string-name>, <string-name><surname>Graner</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (<italic>Hordeum vulgare</italic> L.)</article-title>. <source>Theor Appl Genet</source>. <year>2003</year>;<volume>106</volume>:<fpage>411</fpage>&#x2013;<lpage>22</lpage>. doi:<pub-id pub-id-type="doi">10.1007/s00122-002-1031-0</pub-id>.</mixed-citation></ref>
<ref id="ref-22"><label>22.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Amiryousefi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hyv&#x00D6;nen</surname> <given-names>J</given-names></string-name>, <string-name><surname>Poczai</surname> <given-names>P</given-names></string-name></person-group>. <article-title>IRscope: an online program to visualize the junction sites of chloroplast genomes</article-title>. <source>Bioinformatics</source>. <year>2018</year>;<volume>34</volume>(<issue>17</issue>):<fpage>3030</fpage>&#x2013;<lpage>1</lpage>. doi:<pub-id pub-id-type="doi">10.1093/bioinformatics/bty220</pub-id>.</mixed-citation></ref>
<ref id="ref-23"><label>23.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kearse</surname> <given-names>M</given-names></string-name>, <string-name><surname>Moir</surname> <given-names>R</given-names></string-name>, <string-name><surname>Wilson</surname> <given-names>A</given-names></string-name>, <string-name><surname>Stones-Havas</surname> <given-names>S</given-names></string-name>, <string-name><surname>Cheung</surname> <given-names>M</given-names></string-name>, <string-name><surname>Sturrock</surname> <given-names>S</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data</article-title>. <source>Bioinformatics</source>. <year>2012</year>;<volume>28</volume>(<issue>12</issue>):<fpage>1647</fpage>&#x2013;<lpage>9</lpage>. doi:<pub-id pub-id-type="doi">10.1093/bioinformatics/bts199</pub-id>; <pub-id pub-id-type="pmid">22543367</pub-id></mixed-citation></ref>
<ref id="ref-24"><label>24.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Rozas</surname> <given-names>J</given-names></string-name>, <string-name><surname>Ferrer-Mata</surname> <given-names>A</given-names></string-name>, <string-name><surname>Sanchez-Delbarrio</surname> <given-names>JC</given-names></string-name>, <string-name><surname>Guirao-Rico</surname> <given-names>S</given-names></string-name>, <string-name><surname>Librado</surname> <given-names>P</given-names></string-name>, <string-name><surname>Ramos-Onsins</surname> <given-names>SE</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>DnaSP 6: DNA sequence polymorphism analysis of large data sets</article-title>. <source>Mol Phylogenet Evol</source>. <year>2017</year>;<volume>34</volume>(<issue>12</issue>):<fpage>3299</fpage>&#x2013;<lpage>302</lpage>. doi:<pub-id pub-id-type="doi">10.1093/molbev/msx248</pub-id>.</mixed-citation></ref>
<ref id="ref-25"><label>25.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Katoh</surname> <given-names>K</given-names></string-name>, <string-name><surname>Rozewicki</surname> <given-names>J</given-names></string-name>, <string-name><surname>Yamada</surname> <given-names>KD</given-names></string-name></person-group>. <article-title>MAFFT on-line service: multiple sequence alignment, interactive sequence choice and visualization</article-title>. <source>Brief Bioinform</source>. <year>2019</year>;<volume>20</volume>(<issue>4</issue>):<fpage>1160</fpage>&#x2013;<lpage>6</lpage>. doi:<pub-id pub-id-type="doi">10.1093/bib/bbx108</pub-id>; <pub-id pub-id-type="pmid">28968734</pub-id></mixed-citation></ref>
<ref id="ref-26"><label>26.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>CJ</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>H</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Thomas</surname> <given-names>HR</given-names></string-name>, <string-name><surname>Frank</surname> <given-names>MH</given-names></string-name>, <string-name><surname>He</surname> <given-names>YH</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>TBtools: an integrative toolkit developed for interactive analyses of big biological data</article-title>. <source>Mol Plant</source>. <year>2020</year>;<volume>13</volume>(<issue>8</issue>):<fpage>1194</fpage>&#x2013;<lpage>202</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.molp.2020.06.009</pub-id>; <pub-id pub-id-type="pmid">32585190</pub-id></mixed-citation></ref>
<ref id="ref-27"><label>27.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Darriba</surname> <given-names>D</given-names></string-name>, <string-name><surname>Posada</surname> <given-names>D</given-names></string-name>, <string-name><surname>Kozlov</surname> <given-names>AM</given-names></string-name>, <string-name><surname>Stamatakis</surname> <given-names>A</given-names></string-name>, <string-name><surname>Morel</surname> <given-names>B</given-names></string-name>, <string-name><surname>Flouri</surname> <given-names>T</given-names></string-name></person-group>. <article-title>ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models</article-title>. <source>Mol Phylogenet Evol</source>. <year>2020</year>;<volume>37</volume>(<issue>1</issue>):<fpage>291</fpage>&#x2013;<lpage>4</lpage>. doi:<pub-id pub-id-type="doi">10.1093/molbev/msz189</pub-id>.</mixed-citation></ref>
<ref id="ref-28"><label>28.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kozlov</surname> <given-names>AM</given-names></string-name>, <string-name><surname>Darriba</surname> <given-names>D</given-names></string-name>, <string-name><surname>Flouri</surname> <given-names>T</given-names></string-name>, <string-name><surname>Morel</surname> <given-names>B</given-names></string-name>, <string-name><surname>Stamatakis</surname> <given-names>A</given-names></string-name></person-group>. <article-title>RAxMLNG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference</article-title>. <source>Bioinformatics</source>. <year>2019</year>;<volume>35</volume>(<issue>21</issue>):<fpage>4453</fpage>&#x2013;<lpage>5</lpage>. doi:<pub-id pub-id-type="doi">10.1093/bioinformatics/btz305</pub-id>.</mixed-citation></ref>
<ref id="ref-29"><label>29.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kress</surname> <given-names>W</given-names></string-name>, <string-name><surname>Wurdack</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zimmer</surname> <given-names>E</given-names></string-name>, <string-name><surname>Weigt</surname> <given-names>L</given-names></string-name>, <string-name><surname>Janzen</surname> <given-names>D</given-names></string-name></person-group>. <article-title>Use of DNA barcodes to identify flowering plants</article-title>. <source>Proc Natl Acad Sci U S A</source>. <year>2005</year>;<volume>102</volume>(<issue>23</issue>):<fpage>8369</fpage>&#x2013;<lpage>74</lpage>. doi:<pub-id pub-id-type="doi">10.1073/pnas.0503123102</pub-id>.</mixed-citation></ref>
<ref id="ref-30"><label>30.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Jansen</surname> <given-names>RK</given-names></string-name>, <string-name><surname>Cai</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Raubeson</surname> <given-names>LA</given-names></string-name>, <string-name><surname>Daniell</surname> <given-names>H</given-names></string-name>, <string-name><surname>Depamphilis</surname> <given-names>CW</given-names></string-name>, <string-name><surname>Leebens-Mack</surname> <given-names>J</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns</article-title>. <source>Proc Natl Acad Sci U S A</source>. <year>2007</year>;<volume>104</volume>(<issue>49</issue>):<fpage>19369</fpage>&#x2013;<lpage>74</lpage>. doi:<pub-id pub-id-type="doi">10.1073/pnas.0709121104</pub-id>; <pub-id pub-id-type="pmid">18048330</pub-id></mixed-citation></ref>
<ref id="ref-31"><label>31.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Logacheva</surname> <given-names>MD</given-names></string-name>, <string-name><surname>Schelkunov</surname> <given-names>MI</given-names></string-name>, <string-name><surname>Shtratnikova</surname> <given-names>VY</given-names></string-name>, <string-name><surname>Matveeva</surname> <given-names>MV</given-names></string-name>, <string-name><surname>Penin</surname> <given-names>AA</given-names></string-name></person-group>. <article-title>Comparative analysis of plastid genomes of non-photosynthetic Ericaceae and their photosynthetic relatives</article-title>. <source>Sci Rep</source>. <year>2016</year>;<volume>6</volume>(<issue>1</issue>):<fpage>30042</fpage>. doi:<pub-id pub-id-type="doi">10.1038/srep30042</pub-id>.</mixed-citation></ref>
<ref id="ref-32"><label>32.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kaur</surname> <given-names>S</given-names></string-name>, <string-name><surname>Panesar</surname> <given-names>PS</given-names></string-name>, <string-name><surname>Bera</surname> <given-names>MB</given-names></string-name>, <string-name><surname>Kaur</surname> <given-names>V</given-names></string-name></person-group>. <article-title>Simple sequence repeat markers in genetic divergence and marker-assisted selection of rice cultivars: a review</article-title>. <source>Crit Rev Food Sci Nutr</source>. <year>2015</year>;<volume>55</volume>(<issue>1</issue>):<fpage>41</fpage>&#x2013;<lpage>9</lpage>. doi:<pub-id pub-id-type="doi">10.1080/10408398.2011.646363</pub-id>.</mixed-citation></ref>
<ref id="ref-33"><label>33.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Liu</surname> <given-names>B</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>HF</given-names></string-name>, <string-name><surname>Cao</surname> <given-names>YZ</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>XM</given-names></string-name>, <string-name><surname>Sui</surname> <given-names>SZ</given-names></string-name></person-group>. <article-title>Establishment of novel simple sequence repeat (SSR) markers from Chimonanthus praecox transcriptome data and their application in the Identification of Varieties</article-title>. <source>Plants</source>. <year>2024</year>;<volume>13</volume>:<fpage>2131</fpage>. doi:<pub-id pub-id-type="doi">10.3390/plants13152131</pub-id>.</mixed-citation></ref>
<ref id="ref-34"><label>34.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cui</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Ye</surname> <given-names>W</given-names></string-name>, <string-name><surname>Li</surname> <given-names>JS</given-names></string-name>, <string-name><surname>Li</surname> <given-names>JJ</given-names></string-name>, <string-name><surname>Vilain</surname> <given-names>E</given-names></string-name>, <string-name><surname>Sallam</surname> <given-names>T</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>A genome-wide spectrum of tandem repeat expansions in 338,963 humans</article-title>. <source>Cell</source>. <year>2024</year>;<volume>187</volume>(<issue>22</issue>):<fpage>6411</fpage>&#x2013;<lpage>2</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.cell.2024.09.045</pub-id>; <pub-id pub-id-type="pmid">39368475</pub-id></mixed-citation></ref>
<ref id="ref-35"><label>35.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mo</surname> <given-names>ZQ</given-names></string-name>, <string-name><surname>Fu</surname> <given-names>CN</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>MS</given-names></string-name>, <string-name><surname>Milne</surname> <given-names>RI</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>JB</given-names></string-name>, <string-name><surname>Cai</surname> <given-names>J</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Resolution, conflict and rate shifts: insights from a densely sampled plastome phylogeny for <italic>Rhododendron</italic> (Ericaceae)</article-title>. <source>Ann Bot</source>. <year>2022</year>;<volume>130</volume>(<issue>5</issue>):<fpage>687</fpage>&#x2013;<lpage>701</lpage>. doi:<pub-id pub-id-type="doi">10.1093/aob/mcac114</pub-id>.</mixed-citation></ref>
<ref id="ref-36"><label>36.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chakraborty</surname> <given-names>S</given-names></string-name>, <string-name><surname>Yengkhom</surname> <given-names>S</given-names></string-name>, <string-name><surname>Uddin</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Analysis of Codon usage bias of chloroplast genes in <italic>Oryza</italic> species: codon usage of chloroplast genes in <italic>Oryza</italic> species</article-title>. <source>Planta</source>. <year>2020</year>;<volume>252</volume>(<issue>4</issue>):<fpage>67</fpage>. doi:<pub-id pub-id-type="doi">10.1007/s00425-020-03470-7</pub-id>; <pub-id pub-id-type="pmid">32989601</pub-id></mixed-citation></ref>
<ref id="ref-37"><label>37.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Xiao</surname> <given-names>M</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>X</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Shen</surname> <given-names>S</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>T</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Comparative analysis of codon usage patterns in the chloroplast genomes of nine forage legumes</article-title>. <source>Physiol Mol Biol Plants</source>. <year>2024</year>;<volume>30</volume>(<issue>2</issue>):<fpage>153</fpage>&#x2013;<lpage>66</lpage>. doi:<pub-id pub-id-type="doi">10.1007/s12298-024-01421-0</pub-id>.</mixed-citation></ref>
<ref id="ref-38"><label>38.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Parvathy</surname> <given-names>ST</given-names></string-name>, <string-name><surname>Udayasuriyan</surname> <given-names>V</given-names></string-name>, <string-name><surname>Bhadana</surname> <given-names>V</given-names></string-name></person-group>. <article-title>Codon usage bias</article-title>. <source>Mol Biol Rep</source>. <year>2022</year>;<volume>49</volume>(<issue>1</issue>):<fpage>539</fpage>&#x2013;<lpage>65</lpage>. doi:<pub-id pub-id-type="doi">10.1007/s11033-021-06749-4</pub-id>.</mixed-citation></ref>
<ref id="ref-39"><label>39.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Tang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Tam</surname> <given-names>NFY</given-names></string-name>, <string-name><surname>Lam</surname> <given-names>W</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>TCH</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>SJL</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>CL</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Interpreting the complexities of the plastid genome in dinoflagellates: a mini-review of recent advances</article-title>. <source>Plant Mol Biol</source>. <year>2024</year>;<volume>114</volume>(<issue>6</issue>):<fpage>87</fpage>. doi:<pub-id pub-id-type="doi">10.1007/s11103-024-01511-3</pub-id>; <pub-id pub-id-type="pmid">39432142</pub-id></mixed-citation></ref>
<ref id="ref-40"><label>40.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>DM</given-names></string-name>, <string-name><surname>Pan</surname> <given-names>YG</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>HL</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>B</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>D</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>GF</given-names></string-name></person-group>. <article-title>Thirteen complete chloroplast genomes of the costaceae family: insights into genome structure, selective pressure and phylogenetic relationships</article-title>. <source>BMC Genomics</source>. <year>2024</year>;<volume>25</volume>:<fpage>68</fpage>. doi:<pub-id pub-id-type="doi">10.1186/s12864-024-09996-4</pub-id>.</mixed-citation></ref>
<ref id="ref-41"><label>41.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Gao</surname> <given-names>WJ</given-names></string-name>, <string-name><surname>Li</surname> <given-names>HL</given-names></string-name>, <string-name><surname>Song</surname> <given-names>WW</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>XQ</given-names></string-name></person-group>. <article-title>Plastid genome structural characteristics and phylogenetic relationships of Ericaceae</article-title>. <source>J West China Forest Sci</source>. <year>2023</year>;<volume>52</volume>(<issue>5</issue>):<fpage>20</fpage>&#x2013;<lpage>8</lpage>.</mixed-citation></ref>
<ref id="ref-42"><label>42.</label><mixed-citation publication-type="other"><person-group person-group-type="author"><collab>WFO</collab></person-group>. <article-title>World Flora Online</article-title>; <year>[cited 2024 Oct 24]</year>. Available from: <ext-link ext-link-type="uri" xlink:href="https://www.Worldfloraonline.org">https://www.Worldfloraonline.org</ext-link>.</mixed-citation></ref>
<ref id="ref-43"><label>43.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kron</surname> <given-names>KA</given-names></string-name>, <string-name><surname>Judd</surname> <given-names>WS</given-names></string-name>, <string-name><surname>Stevens</surname> <given-names>PF</given-names></string-name>, <string-name><surname>Crayn</surname> <given-names>DM</given-names></string-name>, <string-name><surname>Anderberg</surname> <given-names>AA</given-names></string-name>, <string-name><surname>Gadek</surname> <given-names>PA</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Phylogenetic classification of Ericaceae: molecular and morphological evidence</article-title>. <source>Bot Rev</source>. <year>2002</year>;<volume>68</volume>(<issue>3</issue>):<fpage>335</fpage>&#x2013;<lpage>423</lpage>. doi:<pub-id pub-id-type="doi">10.1663/0006-8101(2002)068[0335:PCOEMA]2.0.CO;2</pub-id>.</mixed-citation></ref>
<ref id="ref-44"><label>44.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Braukmann</surname> <given-names>T</given-names></string-name>, <string-name><surname>Stefanovic</surname> <given-names>S</given-names></string-name></person-group>. <article-title>Plastid genome evolution in mycoheterotrophic Ericaceae</article-title>. <source>Plant Mol Biol</source>. <year>2012</year>;<volume>79</volume>(<issue>1&#x2013;2</issue>):<fpage>5</fpage>&#x2013;<lpage>20</lpage>. doi:<pub-id pub-id-type="doi">10.1007/s11103-012-9884-3</pub-id>.</mixed-citation></ref>
<ref id="ref-45"><label>45.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Freudenstein</surname> <given-names>JV</given-names></string-name>, <string-name><surname>Broe</surname> <given-names>MB</given-names></string-name>, <string-name><surname>Feldenkris</surname> <given-names>ER</given-names></string-name></person-group>. <article-title>Phylogenetic relationships at the base of Ericaceae: implications for vegetative and mycorrhizal evolution</article-title>. <source>Taxon</source>. <year>2016</year>;<volume>65</volume>(<issue>4</issue>):<fpage>794</fpage>&#x2013;<lpage>804</lpage>. doi:<pub-id pub-id-type="doi">10.12705/654.7</pub-id>.</mixed-citation></ref>
<ref id="ref-46"><label>46.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Rose</surname> <given-names>JP</given-names></string-name>, <string-name><surname>Kleist</surname> <given-names>TJ</given-names></string-name>, <string-name><surname>Lofstrand</surname> <given-names>SD</given-names></string-name>, <string-name><surname>L&#x00F6;fstrand</surname> <given-names>SD</given-names></string-name>, <string-name><surname>Drew</surname> <given-names>BT</given-names></string-name>, <string-name><surname>Sch&#x00F6;nenberger</surname> <given-names>J</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Phylogeny, historical biogeography, and diversification of angiosperm order Ericales suggest ancient Neotropical and East Asian connections</article-title>. <source>Mol Phylogenet Evol</source>. <year>2018</year>;<volume>122</volume>:<fpage>59</fpage>&#x2013;<lpage>79</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.ympev.2018.01.014</pub-id>; <pub-id pub-id-type="pmid">29410353</pub-id></mixed-citation></ref>
</ref-list>
</back></article>


