2. 广东药科大学中药学院,广州 510006;
3. 西北大学生命科学学院,西安 710069
2. College of Traditional Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou 510006, China;
3. School of Life Science, Northwest University, Xi'an 710069, China
Polygonatum kingianum, also known as jiejiegao and xianren rice in China, belongs to the perennial herb of Polygonatum Mill in Asparagaceae. Its wild resources are widely distributed in southwest China. In addition, it is engaged in relevant artificial cultivation and planting industries in Yunnan, Guizhou, Sichuan[1–3]. Polygonatum kingianum is one of the source plants of Rhizoma Polygonati which has high medicinal and edible value[4]. As a medicinal plant, P. kingianum is recorded in various national medical books. Chinese ancient medical book (Ming Yi Bie Lu, Han Dynasty, 220—450 AD) listed Rhizoma Polygonati as the top grade. Modern pharmacological studies have found their main chemical components have anti-aging, anti-tumor, immune enhancement, sterilization and anti-inflammatory effects[5].
As an essential subcellular organelle of plants and algae, previous study has found that chloroplast is not only the main place for photosynthesis, but also participates energy transformation[6]. In addition, chlo-roplasts also have relatively independent genome. In most angiosperms, the chloroplast genome belongs to the maternal inheritance, which has the characteristics of stable structure, conserved coding region sequence, rich information[7–8]. The complete chloroplast genome has been widely used in plant system evolution[9–11], related species identification and genetic diversity analysis[12], chloroplast genetic engineering, etc[13].
Codon, also known as triplet code, as a bridge connecting nucleic acid and protein[14], is the basic unit of biological genetic information transmission. In the case of mutation pressure, natural selection and genetic drift, prokaryotic and eukaryotic organisms generally tend to use one or more specific synonymous codons called synonymous codon usage bias (CUB)[15]. Through the analysis of species codon usage bias, the optimal codon can be determined, which can improve the efficiency and accuracy of related gene expression products, infer the function and expression mode of unknown genes, and provide scientific basis for exploring species relationship and genetic evolution[16–17].
Until now, the codon usage bias of some species has been analyzed[18–20], but the research on the codon usage preference of P. kingianum has not been reported yet. In the present study, Illumina Hiseq was used technologies for complete chloroplast genome sequence of P. kingianum. Based on this, we analyzed the sequence characteristics and codon usage bias of chloroplast genome. It was of great significance to provide a scientific reference for the application and investigation of the chloroplast genome in P. kingianum.
1 Materials and methods 1.1 Material collection and sequencingPolygonatum kingianum was collected in Tengchong City of China. The total DNA was extracted from 100 mg of fresh and healthy leaves using the modified CTAB method. Then the complete cp genome was sequenced by using Illumina Hiseq 2000 sequencing platform. The reference specimen (Ji et Wang 2) was deposited at the Herbarium of Kunming Institute of Botany, Chinese Academy of Sciences.
1.2 Plastome assembly, annotation, and comparisonFirst, we assembled the complete chloroplast genome with a reference-based assembly strategies with GetOrganelle using the complete chloroplast genome sequence of P. kingianum (NCBI reference sequence: MN934979) as reference. Then, the assembly was edited and annotated according to the reference in Geneious V 10.2[21]. We generated a physical map of the cp genome using Organellar GenomeDRAW[22]. Finally, the complete chloroplast genome of P. kingianum was submitted to the NCBI (Accession: MW788495).
1.3 Microsatellite analysisPerl scripts from MISA were used to perform SSR identification with the default parameters (http://pgrc.ipk-gatersleben.de/misa/). The identification criteria were as follows: mono-nucleotide repeat motifs with at least 10 repeats, di-nucleotide repeat motifs with 5 repeats, trinucleotide repeat motifs with four repeats, tetra-, pentaand hexa-nucleotide repeat motifs with three repeats. Compound SSRs were defined as those with a < 100-nt interval between two repeat motifs[23–24].
1.4 Phylogenetic analysisIn order to explore the evolutionary relationships of P. kingianum, the whole chloroplast genome sequences of two genera of Heteropolygonatum and Polygonatum from NCBI along with the obtained chloroplast genome sequence in the present study, were analyzed for phylogenetic analysis. All of the chloroplast genome sequences were aligned with MAFFT[25] implemented in Geneious (10.0.5), and a maximum-likelihood phylogenetic analysis was per-formed in RAxML[26] under the GTR-GAMMA model with 1 000 bootstrap replicates.
1.5 Codon composition and optimal codon analysisBased on the consideration of reducing sample error and accurately counting the number of effective codons, we eliminated the sequences with length less than 300 bp, duplicate genes, and coding sequences (CDS) containing stop codon. Fifty-three CDS sequences (start codon: ATG; stop codons: TAA, TAA, TGA and TAG) of the chloroplast genome of P. kingianum were used as research samples to analyze the CUB ultimately[27]. In order to analyze the rule of gene base composition, Codon W 1.4.2 software was used for the analysis relative synonymous codon usage (RSCU). We also used the CUSP and CHIPS models in the online software EMBOSS to analyze the GC content of the first base of codon (GC1), the second base of codon (GC2), the third base of codon (GC3), total GC content (GC), and effective number of codons (ENC). The Pearson correlation analysis of the above parameters was carried out using SPSS 24.0 software.
The RSCU value of codon more than 1 are determined to be high frequency codons[28]. Then taking an ENC as the preferred standard, five genes with the highest ENC values and the lowest ENC values in 51 chloroplast genes were regarded as the high and low expression groups. The RSCU values of 2 datasets were calculated and compared by ΔRSCU (RSCU in high and low expression groups). The codons satisfying both ΔRSCU > 0.08 defined as high expression codons. Finally, by combining high frequency and high expression codons, the optimal codons was defined for the chloroplast genome of P. kingianum[29].
1.6 Neutrality plot analysisA neutral graph was drew to research the influence of mutation pressure and natural selection on the chloroplast codon usage pattern of P. kingianum. GC3 values were regarded as abscissa; the average values of GC1 and GC2 of each gene were seen as GC12, which were ordinate. The correlation analysis of GC3 and GC12 will be helpful to make scientific judgments on the main factors affecting codon preference[30]. If there is a significant correlation between the two data and the regression coefficient is close to 1, it means that the codon preference is mainly affected by mutation pressure. On the contrary, it indicates that base composition preference is mainly affected by selection pressure.
1.7 ENC-GC3s plot analysisThe effective number of codon (ENC) is a measure of the degree of species independent synonymous codon bias in genes. Its value ranges from 20 to 61, which is negatively correlated with the CUB[31].
ENC-plot analysis can intuitively judge gene codon preference factors. With the GC3s values as horizontal ordinate and ENC values as longitudinal coordinate, two-dimensional scatter plot was drawn[31]. Standard curve formula: ENCexp=2+GC3s+29/[GC3s2+ (1-GC3s)2][32]. This curve shows the functional relationship between ENC and GC3s only under mutation pressure conditions. The ENC ratio distribution can quantify the results obtained by ENC-plot and clarify how far away or close each gene point is from the curve. The expression∶ENC ratio=(ENC expected value-ENC actual value)/ENC expected value[33].
1.8 Analysis of PR2-bias plotBased on the analysis of the composition of the four bases (A, T, C, G) at the third position of the chloroplast genome of the P. kingianum, we used G3/(G3+C3) and A3/(A3+T3) as the horizontal and vertical coordinates for analysis. The PR2-bias plot, which analyzes the nucleotide compositions at the third position of codons, are usually used to estimate the effects of mutation pressure and natural selection by analysing the AT bias and GC bias[34]. Through the vector emitted by the center of the plane, we can judge the degree and direction of the 4 kinds of bias[35].
2 Results 2.1 Characteristics of complete chloroplast genome sequenceThe length of the complete chloroplast genome sequence of P. kingianum was 155 852 bp, with GC content of 37.7%. It contained a pair of inverted repeats (IRs, 26 347 bp each), a small single copy (SSC, 18 525 bp) and large single copy region (LSC, 84 633bp). The average GC content of IR, LSC and SSC regions were 43%, 31.6% and 35.7%, respectively. Annotation results showed that there were 132 genes in the chloroplast genome in P. kingianum, including 85 protein-coding genes, 38 tRNA genes and 8 ribosomal rRNA genes. The GC contents of these three types of genes were 38.1%, 53.2% and 55.3%, respectively. The coding region of the gene was 90 607 bp, accounting for 40.3% of the entire chloroplast genome (Fig. 1). The coding genes families of P. kingiantum chloroplast were involved in four aspects: photosynthesis, self-replication, biosynthesis and unknown function. Table 1 shows the gene functions and groups in the cp genome. Compared with the chloroplast genomes of other species in Polygonatum genus, such as P. zanlanscianense (155 609 bp)[36] and P. humile (156 082 bp)[37], all cp genomes shared the same gene order and structure, which displayed a high degree of similarity.
![]() |
Fig. 1 Gene map of the chloroplast genome of Polygonatum kingianum |
![]() |
Table 1 List of identified genes in cp genomes of the Polygonatum kingianum |
With MISA analysis, a total of 69 SSRs were identified in the chloroplast genome of P. kingianum. Examination of all SSR loci in the genome showed that the majority of the SSR loci were located in the LSC region, with a number of 50 (72.46%). There were 11 (15.94%) located within the SSC region and the least number of SSRs located in the IR region, with only 8 (11.59%) (Fig. 2). The types of SSRs differed greatly in the number of repeats. The number of mononucleotide repeats was the largest, with 43, and all repeating units were A/T. There were 15, 5, 10, and 2, dinucleotide, trinucleotide, tetranucleotide, and pentanucleotide repeats, respectively; while hexanucleotide repeats were not observed. In terms of SSR repeat unit types, tetranucleotide repeat units were the most common, followed by the dinucleotide, trinucleotide repeat units and, finally, the mononucleotide, pentanucleotide repeat units (Table 2).
![]() |
Fig. 2 Distribution of various types of SSRs in LSC, SSC and IR regions Polygonatum are clustered in a clade, and alternate phyllotaxis species are clustered in anther clade. |
![]() |
Table 2 Repeat type, number and frequency of SSRs in complete chloroplast genome of Polygonatum kingianum |
The phylogenetic graph revealed that most of species of Polygonatum were clustered into a monophyletic clade with a high bootstrap value (Fig. 3). It indicated that the resulting phylogenetic tree we constructed was relatively robust. The sequence of P. kingianum in this study and P. huanum (P. huanum is a synonym of P. kingianum) from GenBank clustered together with a support rate of 100%. It showed that the conclusion of morphological identification was supported by molecular evidence. These samples were clearly divided into three branches. Branch I included 8 species (P. urceolatum, P. punctatum, P. stewardtianum, P. oppositifolium, P. tessellatum, P. huanum, P. kingianum). Branch II composed 5 species (P. yunnanense, P. arisanense, P. humile, P. biflorum, P. cyrtonema). The other two species (Heteropolygonatum altelobatum and H. ginfushanicum) were classified into branche III. The verticillate leaf type of Polygonatum are clustered in a clade, and alternate phyllotaxis species are clustered in anther clade.
![]() |
Fig. 3 Phylogenetic tree constructed using the maximum likelihood method based on chloroplast genome sequence |
From the chloroplast genome of P. kingianum, 51 CDSs suitable for analysis of CUB were selected (Table 3). The average GC content (GCall) of these 51 CDs sequences was 38.28% (30.77%-45.08%). The GC con-tents at different positions of the codons varied, with the average GC contents at the first position (GC1) being 46.84% (31.79%-58.54%), the second (GC2) being 39.38% (26.61%-55.40%), and the third (GC3) being 28.61% (20.90%-37.23%), indicating that the codons in the chloroplast genome of P. kingianum preferred to end with A or U. The statistics of the ENC values of 51 CDs sequences (Table 2) showed that the average value of the ENC of all genes was 48.21 (range from 39.83 to 60.11), suggesting a weak codon preference.
![]() |
Table 3 ENC value and GC content in different positions of codons in chloroplast 51 CDS of Polygonatum kingianum |
Pearson correlation analysis showed that the average GC content (GCall) at all codon locations was significantly correlated with GC1, GC2 and GC3. GC1 was significantly correlated with GC2, but neither of them reached a significant level with GC3, showing that the composition of the first and second base of the codon was similar, and there was a great difference from the third base. The correlation of ENC with GC1 and GC2 did not reach significance, but it was highly significant with GC3, indicating that the third base composition can significantly influence codon usage bias (Table 4). The correlation coefficient between ENC and codon counts (CC) was 0.079, which did not reach a significant level, indicating that the CC has a very weak influence on ENC. That is, in this study, the effect of gene length on codon bias analysis was not significant.
![]() |
Table 4 Correlation analysis of each gene's related parameters of Polygonatum kingianum |
There were 30 codons (RSCU > 1) in the chloroplast genomic protein coding sequence of P. kingianum, of which only one codon ended with G or C, and the remaining 29 codons ended with A and U, indicating that the chloroplast genome of P. kingianum prefers to use Codons ending with A or U; there were 31 codons (RSCU < 1), of which 28 ended with G and C, and 3 ended with A and U (Table 5), manifesting that the frequency of codons ending in C and G was relatively low in the chloroplast genome of P. kingianum. According to ΔRSCU values of codes in high and low libraries, 26 highly-expressed superior codon were screened out from the chloroplast genome of P. kingianum (Table 6). Combining the high expression superior codons with the 30 high-frequency codons described above, nine common codon (UUU, CUU, UCA, CCA, CAU, CAA, AAU, GAU, GGA) were finally identified as the optimal codons for the genome of P. kingianum, with 4 codons ending with A, and the remaining 5 codons all end with U, whereas the chloroplast protein encoding genome of P. kingianum preferred the codons ending with A and U, especially the codons ending with U.
![]() |
Table 5 RSCU analysis of CDS in Polygonatum kingianum |
![]() |
Table 6 Optimal codons in chloroplast genome of Polygonatum kingianum |
To estimate the extent of mutation pressure as well as natural selection contributed to the CUB of P. kingianum, a neutrality plot was constructed based on the GC12 and GC3 (Fig. 4). The range was 0.328 1-0.557 6 and 0.209 0-0.372 3 for GC12, GC3, respectively. As shown in Fig. 4, most of the points represented by each gene were distributed above the diagonal of the neutrality plot. Only the ycf2 gene was close to the diagonal. The Pearson correlation coefficient between GC12 and GC3 was 0.142 (P=0.311 > 0.05), showing that the correlation between them was not significant. The regression coefficient observed was closer to zero (The slope of the regression line was 0.161 1), which inferred that the GC content in the chloroplast genome of P. kingianum was highly conserved. Natural selection played a remarkably important role in the CUB of P. kingianum. And the mutation pressure accounted for a minority of the affecting factors.
![]() |
Fig. 4 Neutrality plot analysis |
The standard curve in the ENC-plot reflected the relationship between ENC and GC3s only when the influence of selection pressure is excluded. Figure 5 showed that some genes in the chloroplast of P. kingianum were located near the standard curve. The actual ENC value of this part of the gene was close to the expected ENC value, indicating that the mutation effect was greater than natural selection; while the position of the other part of the gene farther from the standard curve represented natural selection factors were stronger than mutations. In order to quantify the closeness of genes to the standard curve, the ENC ratio was used to count the frequency of ENC ratios (Table 7). The results showed that there were only 21 genes with ratios in the range from -0.05 to 0.05, accounting for 41.18% of the total number of genes. That means, most genes were far away from the standard curve, and the codon preference was related to the difference of GC3s, indicating that the codon preference of the chloroplast genome of P. kingianum was more affected by selection than mutation.
![]() |
Fig. 5 Analysis of ENC-plot |
![]() |
Table 7 Distribution of ENC ratio |
PR2-plot analysis showed that the chloroplast genes of P. kingianum were scatteredly distributed in the four regions of the chart, and most of the genes were distributed in the lower left part of the chart (Fig. 6), indicating that the frequency of T base in the third codon was higher than that of A base, and that of C base was higher than that of G base, that is, the frequency of pyrimidine was higher than that of purine. If the codon usage pattern is completely caused by mutation, the usage frequency of the four bases should be equal. The biased usage of four bases indicated that the usage pattern of chloroplast codon in P. kingianum was not only influenced by mutation, but also by other factors, such as selection pressure.
![]() |
Fig. 6 Analysis of PR2-plot |
Polygonatum kingianum is one of medicine and food homologous plants announced by the National Health Commission, PRC. For thousands of years, its medicinal effect and edible value has been widely recognized by Chinese people. Up to date, it has integrated medicinal, edible, ornamental and health function which has extremely high economic value and social benefits. In summary, P. kingianum has good development and research prospects[2].
In this study, basic research on the chloroplast of P. kingianum was conducted. The assembly annotation results showed that the chloroplast genome of P. kingianum was 155 852 bp in length, including one large single-copy region, one small single-copy region and two inverted repeat regions, which is consistent with the typical tetrad structure of the chloroplast genome of most angiosperms[38]. The study found that the chloroplast genome of P. kingianum was not much different from the chloroplast genome of other species in the genus Polygonatum in sequence length. Simple sequence repeats (SSRs) are an important part of the plant chloroplast genome which play an indispensable role in gene expression, transcription regulation, chromosome construction, and physiological metabolism[39]. After statistical analysis of MISA online software, 69 SSR loci were detected. These SSR loci will be helpful for subsequent research on the population genetics of P. kingianum. Studies have shown that P. kingianum is a species with large morphological variation, and it is difficult to identify this species only from morphology. The phylogenetic tree results of this study provide reliable DNA molecular evidence support for the morphological identification of P. kingienum. Compared with other plants of Polygonatum genus, the relationship between P. kingianum and P. tessellatum is closer, and the geographic distribution of 2 species is basically the same.
Although CUB is affected by many factors, natural selection and mutation are key factors that affect codon usage preference[40]. The results of codon bias in the chloroplast genome of P. kingianum showed that the average effective codon ranged from 39.830 to 60.106, which suggested a weak codon preference among these chloroplast genes.
The results of related parameters analysis and neutrality plot showed that the third position of codon had low base composition similarity with the first and second position. The correlation between the first position and the other two positions of the codon was not significant, indicating that the codon bias was subject to a strong degree of selection.
The results of ENC-plot analysis also confirmed the above argument. The ENC-plot graph showed that only a small part of genes distributed near the standard curve, and the actual ENC value of these genes were basically consistent with the theoretical ENC values. It indicated that these codon preferences were greatly affected by mutations. The scattered points of most genes were far from the standard curve. The actual ENC value of this part of the gene was quite different from the theoretical ENC value, indicating that it was more easily affected by the selection.
The analysis result of PR2-plot concluded that at the third codon position of cp genes, pyrimidines (C and T) were used more frequently than purines (A and G), which verified that the codon usage pattern of the chloroplast genes of P. kingianum was more affected by selection factors.
Based on the results of the high-frequency and highly expressed codons, nine codons (UUU, CUU, UCA, CCA, CAU, CAA, AAU, GAU, GGA) were obtained as the optimal codons of P. kingianum chloroplast genes finally. This preference pattern is consistent with the results of codon bias analysis of chloroplast genes in Oryza[41], Oncidium gower[42], Panicum miliaceum[43], etc. These indicate that P. kingiantum prefers to use codons ending in AT like other monocotyledonous plants.
The chloroplast genome sequences are not only most valuable for understanding plant evolution and phylogeny but also have made some achievments in chloroplast transformation technology. However, chloroplast genome has rarely been used in evolution and phylogeny of the P. kingiantum. Only three relevant research have been repored[37, 44–45]. These reports mainly discussed the phylogenetic relationship and identification of Polygonatum genus. Up to now, chloroplast genetic transformation technology has been applied in a variety of plants. Degray, et al[46] transferred the antimicrobial peptide gene MSI-99 into the chloroplast genome of tobacco by chloroplast transformation technology, and the descendants of transgenic plants showed high antibacterial activity. Chakrabarti, et al[47] incorporated a truncated Bacillus thuringiensis cry9Aa2 gene in the plastid genome of tobacco to control of potato tuber moth (Phthorimaea operculella). But, it has not been found any studies on chloroplast genetic transformation of Polygonatum species by far.
In conclusion, we sequenced and analyzed the complete cp genome of P. kingiantum, which exhibits conserved structure. The phylogenetic relationship based on the plastid genome data shows that the chloroplast genome as ultra-barcoding has great potential in the identification of Polygonatum species. Comprehensive analysis found that the codon bias of the P. kingiantum chloroplast genome is weak, and the factors that affect the formation of codon bias in the chloroplast protein-coding genes of P. kingianum do not depend on a single factor, but are the result of mutations, selection and many other factors. According to the codon usage characteristics of its chloroplast genome, we screened out 9 optimal codons for the chloroplast genome of P. kingiantum. The current study provides a scientific reference for the identification of germplasm resources, genetic breeding as well as for prediction of unknown functional genes, discovery of new genes, improvement of foreign gene expression in P. kingianum.
[1] |
CHEN X Q, LIANG S Y, XU J M, et al. Liliaceae[M]// WU Z Y, RAREN P H. Flora of China, vol. 24. Beijing: Science Press & St. Louis: Missouri Botanical Garden Press, 2000: 223-232.
|
[2] |
LIU W, LIN M Y, LIU J J, et al. Progress in study of Polygonatum kingianum and research status of Polygonati rhizoma[J]. Chin J Exp Trad Med Formulae, 2017, 23(14): 226-234. (in Chinese). DOI:10.13422/j.cnki.syfjx.2017140226 |
[3] |
LI J N, LIU H P, XIONG X W, et al. Growth performance of Polygonatum kingianum in Carya illinoensis grove[J]. J W China For Sci, 2020, 49(5): 42-46. (in Chinese). DOI:10.16473/j.cnki.xblykx1972.2020.05.007 |
[4] |
Chinese Pharmacopoeia Commission. Chinese Pharmacopoeia of the People's Republic of China, Vol. 1[M]. Beijing: China Medical Science Press, 2020: 1-319. (in Chinese).
|
[5] |
ZHANG J, WANG Y Z, YANG W Z, et al. Research progress in chemical constituents in plants of Polygonatum and their pharmacological effects[J]. China J Chin Mat Med, 2019, 44(10): 1989-2008. (in Chinese). DOI:10.19540/j.cnki.cjcmm.20190222.006 |
[6] |
VERMA D, DANIELL H. Chloroplast vector systems for biotechnology applications[J]. Plant Physiol, 2007, 145(4): 1129-1143. DOI:10.1104/PP.107.106690 |
[7] |
JIANG W J, GUO M Y, PANG X H. Application of chloroplast genome in identification and phylogenetic analysis of medicinal plants[J]. World Chin Med, 2020, 15(5): 702-708. (in Chinese). DOI:10.3969/j.issn.1673-7202.2020.05.008 |
[8] |
BI Y F, WEN X, PAN Y H, et al. Application and research progress of chloroplast DNA barcoding in forest trees[J]. Mol Plant Breed, 2020, 18(16): 5444-5452. (in Chinese). DOI:10.13271/j.mpb.018.005444 |
[9] |
WU L W, CUI Y X, NIE L P, et al. The characteristics of complete chloroplast genome sequence and phylogenetic analysis of Dendrobium moniliforme[J]. Acta Pharm Sin, 2020, 55(5): 1056-1066. (in Chinese). DOI:10.16438/j.0513-4870.2019-0940 |
[10] |
LI P R, ZHANG S J, LI F, et al. A phylogenetic analysis of chloroplast genomes elucidates the relationships of the six economically important Brassica species comprising the triangle of U[J]. Front Plant Sci, 2017, 8: 111. DOI:10.3389/fpls.2017.00111 |
[11] |
ZHANG J W, JIANG Z M, SU H, et al. The complete chloroplast genome sequence of the endangered species Syringa pinnatifolia (Oleaceae)[J]. Nord J Bot, 2019, 37(5): 2201-2212. DOI:10.1111/njb.02201 |
[12] |
WANG M H, HU S J, YANG C H, et al. Chloroplast genome analysis of Liriope spicata var. prolifera, Ophiopogon japonicus in Sichuan and Zhejiang[J]. Chin J Exp Trad Med Formulae, 2020, 26(8): 182-191. (in Chinese). DOI:10.13422/j.cnki.syfjx.20200947 |
[13] |
ZHANG J F, LI Y, JIN J J, et al. Recent advances in tobacco chloroplast genetic engineering[J]. Tob Sci Technol, 2017, 50(6): 88-98. (in Chinese). DOI:10.16135/j.issn1002-0861.2017.0035.(inChinese) |
[14] |
WALL D P, HERBECK J T. Evolutionary patterns of codon usage in the chloroplast gene rbcL[J]. J Mol Evol, 2003, 56(6): 673-688. DOI:10.1007/s00239-002-2436-8 |
[15] |
QIN H, WU W B, COMERON J M, et al. Intragenic spatial patterns of codon usage bias in prokaryotic and eukaryotic genomes[J]. Genetics, 2004, 168(4): 2245-2260. DOI:10.1534/genetics.104.030866 |
[16] |
LI Y, DUO J C, XIONG H Y, et al. Codon bias analysis of barley homeodomain-leucine zipper gene family[J]. J Triticeae Crops, 2020, 40(2): 144-153. (in Chinese). DOI:10.7606/j.issn.1009-1041.2020.02.02 |
[17] |
YANG G F, SU K L, ZHAO Y R, et al. Analysis of codon usage in the chloroplast genome of Medicago truncatula[J]. Acta Pharm Sin, 2015, 24(12): 171-179. DOI:10.11686/cyxb2015016 |
[18] |
LIU F L, WANG F J, SHAO Z R. Characterization of codon usage in chloroplast genome of Ectocarpus siliculosus[J]. Adv Mar Sci, 2012, 30(4): 587-594. DOI:10.3969/j.issn.1671-6647.2012.04.015 |
[19] |
WANG J, WANG T Y, WANG L Y, et al. Assembling and analysis of the whole chloroplast genome sequence of Elaeagnus angustifolia and its codon usage bias[J]. Acta Bot Boreali-Occid Sin, 2019, 39(9): 1559-1572. DOI:10.7606/j.issn.1000-4025.2019.09.1559 |
[20] |
ZHOU C Z, ZHU C, LI X Z, et al. Codon usage bias analysis methods of Camellia sinensis and its research progress[J]. Mol Plant Breed, 2020, 18(5): 1480-1488. (in Chinese). DOI:10.13271/j.mpb.018.001480 |
[21] |
KEARSE M, MOIR R, WILSON A, et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data[J]. Bioinformatics, 2012, 28(12): 1647-1649. DOI:10.1093/bioinformatics/bts199 |
[22] |
LOHSE M, DRECHSEL O, BOCK R. OrganellarGenomeDRAW (OGDRAW): A tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes[J]. Curr Genet, 2007, 52(5/6): 267-274. DOI:10.1007/s00294-007-0161-y |
[23] |
ZHAO C Z, QIU J J, AGARWAL G, et al. Genome-wide discovery of microsatellite markers from diploid progenitor species, Arachis duranensis and A. ipaensis, and their application in cultivated peanut (A. hypogaea)[J]. Front Plant Sci, 2017, 8(1209). DOI:10.3389/fpls.2017.01209 |
[24] |
DENG P C, WANG M, FENG K W, et al. Genome-wide characterrization of microsatellites in Triticeae species: Abundance, distribution and evolution[J]. Sci Rep, 2016, 6(1): 32224. DOI:10.1038/srep32224 |
[25] |
KATOH K, STANDLEY D M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability[J]. Mol Biol Evol, 2013, 30(4): 772-780. DOI:10.1093/molbev/mst010 |
[26] |
STAMATAKIS A. RAxML Version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies[J]. Bioinformatics, 2014, 30(9): 1312-1313. DOI:10.1093/bioinformatics/btu033 |
[27] |
XIN Y X, DONG Z H, QU S H, et al. Analysis on codon usage bias of chloroplast genome in Pyrus betulifolia Bge[J]. J Agric Univ Hebei, 2020, 43(6): 51-95. (in Chinese). DOI:10.13320/j.cnki.jauh.2020.0112.(inChinese) |
[28] |
XU Y, JIA R Y, ZHANG Z L, et al. Analysis of synonymous codon usage pattern in duck circovirus[J]. Gene, 2015, 557(2): 138-145. DOI:10.1016/j.gene.2014.12.019 |
[29] |
QIN Z, ZHENG Y J, GUI L J, et al. Codon usage bias analysis of chloroplast genome of camphora tree (Cinnamomum camphora)[J]. Guihaia, 2018, 38(10): 1346-1355. (in Chinese). DOI:10.11931/guihaia.gxzw201805023.(inChinese) |
[30] |
YANG X, LUO X N, CAI X P. Analysis of codon usage pattern in Taenia saginata based on a transcriptome dataset[J]. Parasite Vector, 2014, 7(1): 527. DOI:10.1186/s13071-014-0527-1 |
[31] |
WRIGHT F. The 'effective number of codons' used in a gene[J]. Gene, 1990, 87(1): 23-29. DOI:10.1016/0378-1119(90)90491-9 |
[32] |
YUAN X L, LI Y Q, ZHANG J F, et al. Analysis of codon usage bias in the chloroplast genome of Dalbergia odorifera[J]. Guihaia, 2021, 41(4): 622-630. (in Chinese). DOI:10.11931/guihaia.gxzw201906012 |
[33] |
SUEOKA N. Near homogeneity of PR2-bias fingerprints in the human genome and their implications in phylogenetic analyses[J]. J Mol Evol, 2001, 53(4/5): 469-476. DOI:10.1007/s002390010237 |
[34] |
SUEOKA N. Intrastrand parity rules of DNA base composition and usage biases of synonymous codons[J]. J Mol Evol, 1995, 40(3): 318-325. DOI:10.1007/BF00163236 |
[35] |
SUEOKA N. Translation-coupled violation of parity rule 2 in human genes is not the cause of heterogeneity of the DNA G+C content of third codon position[J]. Gene, 1999, 238(1): 53-58. DOI:10.1016/S0378-1119(99)00320-0 |
[36] |
SHI N X, LI L, WANG S Y, et al. The complete chloroplast genome of Polygonatum zanlanscianense (Pampanini, 1915) (Asparagaceae), an adulterants of Polygonati rhizoma[J]. Mitochondrial DNA B, 2021, 6(8): 2420-2421. DOI:10.1080/23802359.2021.1945973 |
[37] |
LEE S Y, ZOU Y L, LIAO W B, et al. The complete chloroplast genome of a traditional medicinal and food plant, Polygonatum humile (Asparagaceae, Asparagales)[J]. Mitochondrial DNA B, 2019, 4(2): 3184-3185. DOI:10.1080/23802359.2019.1666044 |
[38] |
ZHANG Y J, LI D Z. Advances in phylogenomics based on complete chloroplast genomes[J]. Plant Div, 2011, 33(4): 365-375. (in Chinese). DOI:10.3724/SP.J.1143.2011.10202 |
[39] |
AI D Y. The meaning of repeat sequences[J]. Chem Life, 2008, 28(3): 343-345. (in Chinese). DOI:10.3969/j.issn.1000-1336.2008.03.031 |
[40] |
SHI Y B, WANG G B, YANG X M, et al. Analysis of codon usage bias of gene factors in Ginkgo biloba WRKY family[J]. Mol Plant Breed, 2019, 17(5): 1503-1511. (in Chinese). DOI:10.13271/j.mpb.017.001503 |
[41] |
CHAKRABORTY S, YENGKHOM S, UDDIN A. Analysis of codon usage bias of chloroplast genes in Oryza species[J]. Planta, 2020, 252(4): 67. DOI:10.1007/s00425-020-03470-7 |
[42] |
LI D M, LÜ F B, ZHU G F, et al. Analysis on codon usage of chloroplast genome of Oncidium gower Ramsey[J]. Guangdong Agric Sci, 2012, 39(10): 61-65. (in Chinese). DOI:10.16768/j.issn.1004-874x.2012.10.058 |
[43] |
LIU H, WANG M X, YUE W J, et al. Analysis of codon usage in the chloroplast genome of Broomcorn millet (Panicum miliaceum L.)[J]. Plant Sci J, 2017, 35(3): 362-371. (in Chinese). DOI:10.11913/PSJ.2095-0837.2017.30362 |
[44] |
JIN J, LAO J, ZHONG C, et al. Complete chloroplast genome of a medicinal species Polygonatum kingianum in China (Asparagaceae, Asparagales)[J]. Mitochondrial DNA B, 2020, 5(1): 959-960. DOI:10.1080/23802359.2020.1721373 |
[45] |
FLODEN A, SCHILLING E E. Using phylogenomics to reconstruct phylogenetic relationships within tribe Polygonateae (Asparagaceae), with a special focus on Polygonatum[J]. Mol Phylogenet Evol, 2018, 129: 202-213. DOI:10.1016/j.ympev.2018.08.017 |
[46] |
DEGRAY G, RAJASEKARAN K, SMITH F, et al. Expression of an antimicrobial peptide via the chloroplast genome to control phytopathogenic bacteria and fungi[J]. Plant Physiol, 2001, 127(3): 852-862. DOI:10.1104/pp.127.3.852 |
[47] |
CHAKRABARTI S K, LUTZ K A, LERTWIRIYAWONG B, et al. Expression of the cry9Aa2 B.t. gene in tobacco chloroplasts confers resistance to potato tuber moth[J]. Transgen Res, 2006, 15(4): 481. DOI:10.1007/s11248-006-0018-z |