张明英, 王西芳, 高静, 刘阿萍, 颜永刚, 杨新杰, 张岗. 美丽芍药叶绿体全基因组解析及系统发育分析J. 药学学报, 2020,55(1): 168-176. doi: 10.16438/j.0513-4870.2019-0654
引用本文: 张明英, 王西芳, 高静, 刘阿萍, 颜永刚, 杨新杰, 张岗. 美丽芍药叶绿体全基因组解析及系统发育分析J. 药学学报, 2020,55(1): 168-176. doi: 10.16438/j.0513-4870.2019-0654
ZHANG Ming-ying, WANG Xi-fang, GAO Jing, LIU A-ping, YAN Yong-gang, YANG Xin-jie, ZHANG Gang. Complete chloroplast genome of Paeonia mairei H. Lév.: characterization and phylogenyJ. Acta Pharmaceutica Sinica, 2020,55(1): 168-176. doi: 10.16438/j.0513-4870.2019-0654
Citation: ZHANG Ming-ying, WANG Xi-fang, GAO Jing, LIU A-ping, YAN Yong-gang, YANG Xin-jie, ZHANG Gang. Complete chloroplast genome of Paeonia mairei H. Lév.: characterization and phylogenyJ. Acta Pharmaceutica Sinica, 2020,55(1): 168-176. doi: 10.16438/j.0513-4870.2019-0654

美丽芍药叶绿体全基因组解析及系统发育分析

Complete chloroplast genome of Paeonia mairei H. Lév.: characterization and phylogeny

  • 摘要: 本研究利用Illumina HiSeq X Ten测序平台对芍药属药用植物美丽芍药进行了叶绿体全基因组测序,通过生物信息学分析方法进行序列组装、注释和特征解析,并将其与芍药属其他11种植物的叶绿体全基因组进行了比较基因组学和系统发育分析。结果显示,美丽芍药叶绿体基因组全长152 731 bp,GC含量为38.4%,具有被子植物叶绿体基因组典型的四分体结构,包括一个大单拷贝区(large single copy,LSC)、一个小单拷贝区(small single copy,SSC)和一对反向重复区(inverted repeat sequence,IRa/IRb),长度分别为84 402、16 969和25 680 bp;共注释得到136个基因,包括90个蛋白编码基因,38个tRNA基因和8个rRNA基因,其中7个蛋白编码基因、7个tRNA基因和4个rRNA基因分别在反向重复区发生了一次重复;此外,在美丽芍药叶绿体基因组中共检测出28个散在重复序列(dispersed repeats)、10个串联重复序列(tandem repeats)和64个简单重复序列(simple sequence repeats,SSRs)。芍药属12个物种的叶绿体基因组在大小、基因组成和排列顺序、GC含量等方面高度保守;同时,非编码区(包括基因间区和内含子)序列的种间变异高于蛋白编码基因序列,LSC区和SSC区序列变异高于IR区。系统发育分析结果以100%的支持率支持美丽芍药与芍药、草芍药和川赤芍共同构成一个单系分支,并与芍药亲缘关系最近。本研究首次报道了美丽芍药叶绿体全基因组,对其序列变异及结构特征进行了解析,并基于叶绿体全基因组序列构建系统发育树,明确了美丽芍药在芍药属内的系统发育位置,研究结果将为美丽芍药的保护遗传学和资源开发利用等研究提供理论基础。

     

    Abstract: The whole chloroplast genome of the medicinal plant Paeonia mairei H. Lév. was sequenced using the Illumina HiSeq X Ten platform and then assembled, annotated, and characterized by bioinformatic methods in this study. The complete chloroplast genome of P. mairei is 152 731 bp in length with the typical quadripartite structure, which consists of a large single copy-region (LSC, 84 402 bp), a small single copy-region (SSC, 16 969 bp), and a pair of inverted repeat regions (IRa and IRb, 25 680 bp), with an overall GC content of 38.4%. A total of 136 predicted genes, including 90 protein-coding genes, 38 tRNA genes and eight rRNA genes were identified. Among these, seven protein-coding genes, seven tRNA genes and four rRNA genes were found duplicated in the IR regions. In addition, 28 dispersed repeats, 10 tandem repeats, and 64 simple sequence repeats were detected within the whole chloroplast genome of P. mairei. Comparative analyses between 12 Peaonia species showed that the chloroplast genomes are highly conserved in length, gene content, gene order, and GC content. Meanwhile, the noncoding sequences (intergenic regions and introns) show a higher variation than the protein coding sequences, and sequences from the LSC region and SSC region are more variable than those from the IR regions. P. mairei was inferred forming in a distinct clade with P. lactiflora, P. obovate, and P. anomala subsp. veitchii with a 100% bootstrap value and is phylogenetically closest to P. lactiflora. These results may provide a basis for further genetic studies and the development and utilization of medicinal P. mairei.

     

/

返回文章
返回