Abstract
Chinese motherwort (Leonurus japonicus), a member of Lamiaceae family, is a commonly used medicinal herb for treating obstetrical and gynecological diseases, producing over 280 officinal natural products. Due to limited genomic resources, little progress has been made in deciphering the biosynthetic pathway of valuable natural products in L. japonicus. Here, we de novo assembled the L. japonicus genome using high-coverage ONT long reads and Hi-C reads. The chromosome-level genome assembly contained ten chromosomes representing 99.29% of 489.34 Mb genomic sequence with a contig and scaffold N50 of 7.27 Mb and 50.86 Mb, respectively. Genome validations revealed BUSCO and LAI score of 99.2% and 21.99, respectively, suggesting high quality of genome assembly. Using transcriptomic data from various tissues, 22,531 protein-coding genes were annotated. Phylogenomic analysis of 13 angiosperm plants suggested L. japonicus had 58 expanded gene families functionally enriched in specialized metabolism such as diterpenoid biosynthesis. The genome assembly, annotation, and sequencing data provide resources for the elucidation of biosynthetic pathways behind natural products of pharmaceutical applications in L. japonicus.
Similar content being viewed by others
Background & Summary
Leonurus japonicus Houtt., known as Chinese motherwort or “Yi Mu Cao”, is a medicinal herb widely used in Traditional Chinese Medicine for thousands of years in China, Japan, and Korea for the treatment of obstetrical and gynecological diseases1. The annual plant L. japonicus belongs to the Lamioideae subfamily within Lamiaceae (mint) family, a highly diverse family containing more than 7,000 species2. To date, genome resources of 32 members in Lamiaceae have been published including some economically important species (medicinal, culinary, and fragrance usage) such as Salvia officinalis (sage)3, Salvia bowleyana (southern danshen)4, Lavandula angustifolia (lavender)5 and Prunella vulgaris (self heal)6, while in Lamioideae subfamily two genomes Leonotis leonurus (Lion’s tail)6 and Pogostemon cablin (Patchouli)7 are available. Current research on L. japonicus mainly focused on phytochemistry, pharmacology, and clinical applications, while little genomic resource is available besides karyotyping (2n = 2x = 20)1,8,9,10,11,12. According to China Pharmacopoeia (1977 to 2015 A.D.), L. japonicus can be used to treat menstrual disturbance, dysmenorrhea, amenorrhea, postpartum hemorrhage, postpartum lochiorrhea, invigorating blood circulation, diuretics, and dispelling edema1,10,11,12. The medicinal properties of L. japonicus come from its various natural products, including terpenoids, alkaloids, steroids, and flavonoids12,13,14. About 280 natural products have been identified in L. japonicus, notably leonurine and stachydrine, which are key in treating obstetric and gynecological conditions1,10,15. However, their natural yield is low in the plant16, making the elucidation of their biosynthetic pathways crucial for bioengineering and industrial production. This research is hindered by limited genetic resources, with only one published genome available17, insufficient for studying the species’ genetic diversity. Therefore, decoding the L. japonicus genome is essential for understanding and enhancing the production of these medicinal compounds.
Here, we report a chromosome-scale genome assembly of L. japonicus plant (Fig. 1a) YMC-RGv1.0 using a combination of Oxford Nanopore Technology (ONT) long reads, Illumina short reads (NGS) and high-throughput chromatin conformation capture sequencing (Hi-C) reads. In total, 130.69 Gb paired-end NGS reads of 150 bp, 46.33 Gb ONT reads with N50 of 17.67 kb, and 321.46 Gb Hi-C reads for L. japonicus were obtained (Table 1). The k-mer (k = 19) frequency analysis revealed that L. japonicus had an estimated genome size of 488.46 Mb with a heterozygosity rate of 0.36% (Fig. 1b). Karyotyping of L. japonicus confirmed its karyotype of 2n = 2x = 20 (Fig. 1c-d) as reported previously8. The L. japonicus genome was assembled from ONT reads by NextDenovo and polished by NGS reads using NextPolish, producing a draft assembly of 173 contigs with a contig N50 of 7.27 Mb (Table 2). The contigs were then scaffolded and corrected using Hi-C data, unambiguously anchoring them onto 10 chromosomes (Fig. 2a) for L. japonicus with a scaffold N50 of 50.86 Mb (Table 2), representing 99.29% of total contig sequences (Table 2; Fig. 2b). The final genome (YMC-RGv1.0) was 489.34 Mb in length, close to the estimated L. japonicus genome size (Fig. 1b) but smaller than the 590 Mb L. leonurus genome, another recently published genome in Lamioideae6. Genome annotation of YMC-RGv1.0 predicted 22,531 protein-coding genes with an N50 of 1,536 bp using MAKER2 pipeline, among which 22,458 were located on chromosomes (Table 2; Fig. 2c).
In summary, the genomic resources presented in this study will be valuable to studying the biology of L. japonicus. The high-quality genome assembly and annotation will facilitate the dissection of biosynthesis pathways of medicinal natural products in L. japonicus. Also, the chromosome-scale genome of L. japonicus will help comparative genomic studies to illustrate the genome evolution of Lamiaceae family and beyond.
Methods
Sample preparation
L. japonicus cultivar YMC-01 was grown with a day/night cycle of 25 °C/14 h light and 18 °C/10 h dark, at a humidity of ~60% in the greenhouse of Peking University Institute of Advanced Agricultural Sciences. Young leaf tissues were sampled from a 7-week-old L. japonicus plant for high molecular weight (HMW) DNA extraction. Root, stem, leaf, and flower tissues at 1 day post anthesis were harvested for RNA extraction. The collected materials were frozen immediately in liquid nitrogen and stored at −80 °C. HMW DNA (N50 > 50 kb) was obtained by a modified cetyltrimethylammonium bromide (CTAB) method18. RNA extraction was conducted using Trizol reagent following manufacturer recommendation. The total RNA extracts with a RIN (RNA integrity number) value ≥ 7.0 were used for downstream library construction for transcriptome sequencing.
Barcoding analysis
The extracted DNA was then used as PCR templates to amplify rbcL DNA barcode employing a pair of universal primers19 (F: 5′-AGACCTWTTTGAAGAAGGTTCWGT-3′; R: 5′-TCGGTYAGAGCRGGCATRTGCCA-3′. The PCR reaction conditions were as follows: initial denaturation at 95 °C for 5 min; then 35 cycles of 50 s at 94 °C, 30 s at 60 °C, and 1 min at 72 °C. Finally, an additional of 7 min was continued at 72 °C to complete the reaction. All reactions were carried out in SureCycler 8800 Thermal Cycler (Agilent, USA). The PCR products were electrophorized on 1% agarose gel using 1 kb DNA marker (Bioline, UK) to confirm the amplification length. The product was next sent to Tsingke Biotechnology Co., Ltd. for sequencing, and BLAST comparison was performed at National Center for Biotechnology Information (NCBI) after sequencing. For species identification, the rbcL sequence was identified with Barcode of Life Database (BOLD) system20.
Karyotype analysis
Actively growing root tips from germinating L. japonicus seeds were treated in ice water at 0 °C for 24 h to accumulate metaphases before fixation in 3:1 (v/v) 100% ethanol:acetic acid. The root tips were digested with an enzyme mixture containing cellulase and pectolyase and squashed in a drop of 45% acetic acid. The chromosomes were counterstained with DAPI (2 mg·ml−1) and mounted in Vectashield mounting medium. Images were captured using an Olympus BX63 fluorescence microscope equipped with an Olympus DP80 CCD camera and were processed using cellSens Dimension 1.9 (Olympus Corporation). Fluorescent probes based on telomere and conservative repeated sequences of 5 S ribosomal DNA (rDNA) or 18 S rDNA are used for fluorescence in situ hybridization (FISH) on the samples to determine the chromosomal ploidy characteristics of the species.
Genome and transcriptome sequencing
The NGS library was constructed using NEB Next® Ultra™ DNA Library Prep Kit for Illumina (NEB, USA) following its standard protocol. The Hi-C library was prepared from cross-linked chromatins using a standard Hi-C protocol. Both the NGS and Hi-C libraries were sequenced on Illumina NovaSeq 6000 platform to generate 150 bp paired-end reads. The Nanopore DNA library was prepared following the Ligation Sequencing SQK-LSK109 Kit (Oxford Nanopore Technologies, Oxford, UK) protocol and sequenced using Oxford Nanopore GridION (20 kb) platform. The transcriptome sequencing libraries from different tissues (root, stem, leaf, flower) were prepared using Illumina True-seq transcriptome kit (Illumina, CA). The libraries were then sequenced on Illumina NovaSeq 6000 platform to generate 150 bp paired-end reads.
Genome assembly
The genome size and heterozygosity rate of L. japonicus were estimated by performing K-mer frequency analysis with Jellyfish v2.3.021 and GenomeScope v2.022 using NGS reads. De novo genome assembly using ONT long reads was carried out with NextDenovo v2.5.0 (https://github.com/Nextomics/NextDenovo). To correct sequencing errors, we polished the draft assembly for three rounds using Illumina short reads by NextPolish v1.3.123. Then Hi-C sequencing data were used to anchor all contigs using the pipeline of Juicer v1.524, 3D-DNA v18041925 and Juicebox v1.11.0826. Finally, we assessed the completeness of the genome assembly using Benchmarking Universal Single-Copy Orthologs (BUSCO) v4.0.627 with default parameters. The assembly continuity was also evaluated by calculating the Long Terminal Repeat Assembly Index (LAI) using LTR_retriever28 with default parameters.
Genome annotation
For the annotation of repetitive elements, we implemented a hybrid strategy that entailed both de novo prediction and homology-based searches. RepeatModeler2 v2.0.129 was used to build a de novo repeat library. Subsequently, we annotated and masked the assembled genome using RepeatMasker v4.10 software30. For gene annotation, transcriptomic reads from different tissues (root, leaf, stem and flower) were first assembled using StringTie31. The protein sequences used for homology-based prediction were from S. baicalensis32 and universal Swiss-Prot proteins33. Then, we combined transcriptomic assemblies, ab initio prediction, and homolog protein mapping with MAKER234 to predict gene structures. Finally, only the gene sets with Annotation Edit Distance (AED) lower than 0.5 were retained for further study. The BUSCO completeness of predicted gene models was assessed against eudicots_odb10 datasets under the protein mode. Gene functions were identified using NCBI protein database with an E-value threshold of 1e−10 for BLAST searches and annotated using eggNOG-mapper v235 database.
Gene family identification and Phylogenetic analysis
Orthologs and orthogroups among L. japonicus and 12 other genomes (Table 3, Fig. 3a) were identified using the software OrthoFinder v2.5.436 with default values setting and ‘-M msa’ activated. The longest predicted protein of each gene was used as the representative input for the OrthoFinder analysis. Then we removed poorly aligned regions of multiple protein sequence alignments with TrimAl v1.4.137. RAxML v8.2.1238 was used to a build Maximum Likelihood (ML) phylogenetic tree using the GAMMAJTT model, with A. trichopoda as an outgroup. The CodeML and MCMCTree programs in the PAML v4.939 were used to analyze amino acid substitution models and estimate divergence times. Five calibration points (S. indicum vs. O. europaea: 75.8~93.9 MYA, O. europaea vs. S. lycopersicum: 97.5~107.3 MYA, V. vinifera vs. A. thaliana: 109.0~123.5 MYA, A. thaliana vs. O. sativa: 143.0~174.8 MYA, and O. sativa vs. A. trichopoda: 179.9~205.0 MYA) derived from the TimeTree database40 were applied to constrain the divergence times of the nodes. Gene families that underwent expansion or contraction events were identified by CAFE541 software. The identified expanded gene families of L. japonicus were then subjected to further analysis of Gene Ontology (GO) term enrichment42 and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment43, and the p-value of significant enrichment was set as 0.05 in GO term and KEGG enrichment analysis. The gene distribution results showed that 184 gene families (573 genes) were unique to L. japonicus compared with other plants (Fig. 3b). L. japonicus genome had 58 expanded gene families (463 genes) that were enriched in pathways such as tryptophan metabolism, diterpenoid biosynthesis, and plant-pathogen interaction (Fig. 3a,c).
Synteny analysis
The syntenic analysis was performed by JCVI v1.1.1944. We identified synteny blocks by performing an all-against-all BLAST search and chaining the hits with a distance cutoff of 20 genes. Additionally, we required each synteny block to have at least five gene pairs. To estimate the timing of the WGD event in L. japonicus, Ks values of L. japonicus syntenic block genes were calculated using ParaAT v2.045. Our analysis indicated a significant WGD in L. japonicus, with a prominent Ks peak at 1.03 (Fig. 4a). Compared with Vitis vinifera, which lacks genome duplication post-γ event46, both L. japonicus and S. baicalensis exhibit a 2:1 collinearity relationship with grape, highlighting the WGD’s significance (Fig. 4b,c). The WGD, estimated to have occurred around 72 MYA based on the Ks distribution and mutation rate, appears widespread in Lamiales genomes (Fig. 3a), with exceptions like the Oleaceae family3,47.
Data Records
The sequencing dataset was deposited in public repositories. Illumina, Oxford Nanopore, Hi-C, and RNA-seq sequencing data used for genome assembly and annotation have been deposited in the NCBI Sequence Read Archive (SRA) database with accession numbers SRR2511088648, SRR2511088749, SRR2511088550, SRR2511088851, SRR2511088952, SRR2511089053 and SRR2645897554, respectively, under the BioProject accession number PRJNA989396. The chromosomal-level genome assembly was deposited in the GenBank database of NCBI with accession number GCA_030762865.155. Moreover, the genome annotation, multiple sequence alignment, and gene family expansion files in phylogenetic analysis have been deposited at the Figshare database56.
Technical Validation
Plant Identification
The ribulose 1,5-biphosphate carboxylase (rbcL) gene in the chloroplast genome has been explored as a DNA barcode for medicinal plant identification57,58. The size of rbcL amplicon by a universal primer pair was 895 bp. The partial sequence of rbcL was obtained and deposited into GenBank with the accession number OR790520. The BOLD system and NCBI were used to BLAST the sequence, and the results showed that the plant material was L. japonicus.
Genome assembly validation
The quality of the assembly YMC-RGv1.0 was evaluated using four approaches. Firstly, comparing the final genome assembly size with the estimated size, the results showed that they were both close to 489 Mb (Table 2; Fig. 1b). The Hi-C heatmap for our L. japonicus genome assembly revealed 10 chromosome models (Fig. 2a), consistent with our karyotype analysis (2n = 20) (Fig. 1c–e). Secondly, BUSCO analysis found 99.20% eudicot BUSCOs in the L. japonicus genome, and 94.70% eudicot BUSCOs in its predicted gene models (Table 2). Thirdly, LAI score (21.99) met the quality standard for reference genomes (LAI > 20) (Table 2). Finally, sequencing data were mapped to the genome using SAMtools v1.16.159 to verify the accuracy, which showed 98.48% genome coverage and mapping rates of 99.97% for ONT data, and 93.58% genome coverage and 99.27% for NGS data (Table 1). These results indicated that the L. japonicus assembly was of high accuracy and completeness.
Code availability
No custom code was used for this study. All data analyses were conducted using published bioinformatics software with default settings unless otherwise specified.
References
Miao, L. L., Zhou, Q. M., Peng, C., Liu, Z. H. & Xiong, L. Leonurus japonicus (Chinese motherwort), an excellent traditional medicine for obstetrical and gynecological diseases: A comprehensive overview. Biomed Pharmacother 117, 109060 (2019).
Schoch, C. L. et al. NCBI Taxonomy: a comprehensive update on curation, resources and tools. Database (Oxford) 2020, baaa062 (2020).
Li, C. Y. et al. The sage genome provides insight into the evolutionary dynamics of diterpene biosynthesis gene cluster in plants. Cell Rep. 40, 111236 (2022).
Zheng, X. et al. Insights into salvianolic acid B biosynthesis from chromosome-scale assembly of the Salvia bowleyana genome. J Integr Plant Biol. 63, 1309–1323 (2021).
Li, J. et al. The chromosome-based lavender genome provides new insights into Lamiaceae evolution and terpenoid biosynthesis. Hortic Res. 8, 53 (2021).
Bryson, A. E. et al. Uncovering a miltiradiene biosynthetic gene cluster in the Lamiaceae reveals a dynamic evolutionary trajectory. Nat Commun. 14, 343 (2023).
Shen, Y. et al. Chromosome-level and haplotype-resolved genome provides insight into the tetraploid hybrid origin of patchouli. Nat Commun. 13, 3511 (2022).
Ma, Y., Hui, R., Cui, Y., Zhang, Q. & Liu, Y. Karyotype analysis based on physical mapping of the 45S rDNA and meiotic observations in Leonurus japonicus Houtt. Acta Horticulturae Sinica 38, 125–132 (2011).
Xiong, L. et al. Leonuketal, a spiroketal diterpenoid from Leonurus japonicus. Org Lett. 17, 6238–6241 (2015).
Li, Y. Y. et al. Leonurine: From gynecologic medicine to pleiotropic agent. Chin J Integr Med. 26, 152–160 (2020).
Wang, C., Lv, X., Liu, W., Liu, S. & Sun, Z. Uncovering the pharmacological mechanism of motherwort (Leonurus japonicus Houtt.) for treating menstrual disorders: A systems pharmacology approach. Comput Biol Chem. 89, 107384 (2020).
Shang, X., Pan, H., Wang, X., He, H. & Li, M. Leonurus japonicus Houtt.: ethnopharmacology, phytochemistry and pharmacology of an important traditional Chinese medicine. J Ethnopharmacol. 152, 14–32 (2014).
Zhou, Q. M. et al. New triterpenoids from Leonurus japonicus (Lamiaceae). Biochem Syst Ecol. 82, 27–30 (2019).
Liu, J. et al. Alkaloids and flavonoid glycosides from the aerial parts of Leonurus japonicus and their opposite effects on uterine smooth muscle. Phytochemistry 145, 128–136 (2018).
Cheng, F. et al. A review of pharmacological and pharmacokinetic properties of stachydrine. Pharmacol. Res. 155, 104755 (2020).
Li, Z., Chen, K., Rose, P. & Zhu, Y. Z. Natural products in drug discovery and development: synthesis and medicinal perspective of leonurine. Front Chem. 10, 1036329 (2022).
Li, P. et al. Multi-omics analyses of two Leonurus species illuminate Leonurine biosynthesis and its evolution. Mol Plant 23, https://doi.org/10.1016/j.molp.2023.11.003 (2023).
Murray, M. G. & Thompson, W. F. Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res. 8, 4321–4326 (1980).
Dong, W. et al. Discriminating plants using the DNA barcode rbcLb: an appraisal based on a large data set. Mol Ecol Resour. 14, 336–343 (2014).
Ratnasingham, S. & Hebert, P. D. N. BOLD: The barcode of life data system (www.barcodinglife.org). Mol Ecol Notes 7, 355–364 (2007).
Marçais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764–770 (2011).
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
Hu, J., Fan, J., Sun, Z. & Liu, S. NextPolish: a fast and efficient genome polishing tool for long-read assembly. Bioinformatics 36, 2253–2255 (2020).
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Ou, S., Chen, J. & Jiang, N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res. 46, e126–e126 (2018).
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. PNAS. 117, 9451–9457 (2020).
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinform. 25, 4.10.1–4.10.14 (2009).
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 29, 644–U130 (2011).
Qing, Z. et al. The reference genome sequence of Scutellaria baicalensis provides insights into the evolution of wogonin biosynthesis. Mol Plant. 12, 935–950 (2019).
Bairoch, A. & Apweiler, R. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28, 45–48 (2000).
Cantarel, B. L. et al. MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 18, 188 (2008).
Huerta-Cepas, J. et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Mol Biol Evol. 34, 2115–2122 (2017).
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 1–14 (2019).
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 24, 1586–1591 (2007).
Hedges, S. B., Marin, J., Suleski, M., Paymer, M. & Kumar, S. Tree of life reveals clock-like speciation and diversification. Mol Biol Evol. 32, 832–845 (2015).
Mendes, F. K., Vanderpool, D., Fulton, B. & Hahn, M. W. CAFE 5 models variation in evolutionary rates among gene families. Bioinformatics 36, 5516–5518 (2020).
Balakrishnan, R., Harris, M. A., Huntley, R., Van Auken, K. & Cherry, J. M. A guide to best practices for Gene Ontology (GO) manual annotation. Database (Oxford) 2013, bat054 (2013).
Kanehisa, M., Goto, S., Kawashima, S., Okuno, Y. & Hattori, M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 32, D277–80 (2004).
Tang, H. et al. Synteny and collinearity in plant genomes. Science 320, 486–488 (2008).
Zhang, Z. et al. ParaAT: a parallel tool for constructing multiple protein-coding DNA alignments. Biochem Biophys Res Commun. 419, 779–781 (2012).
Jaillon, O. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–467 (2007).
Julca, I., Marcet-Houben, M., Vargas, P. & Gabaldón, T. Phylogenomics of the olive tree (Olea europaea) reveals the relative contribution of ancient allo-and autopolyploidization events. BMC biology 16, 1–15 (2018).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR25110886 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR25110887 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR25110885 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR25110888 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR21358889 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR25110890 (2023).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRR26458975 (2023).
NCBI GenBank https://identifiers.org/ncbi/insdc:JAUZNL000000000 (2023).
Chen, W. K. et al. De novo chromosome-level genome assembly of Chinese motherwort (Leonurus japonicus). figshare https://doi.org/10.6084/m9.figshare.23632353.v7 (2023).
Ho, V. T., Tran, T. K. P., Vu, T. T. T. & Widiarsih, S. Comparison of matK and rbcL DNA barcodes for genetic classification of jewel orchid accessions in Vietnam. J Genet Eng Biotechnol. 19, 93 (2021).
Negi, R. K., Nautiyal, P., Bhatia, R. & Verma, R. rbcL, a potential candidate DNA barcode loci for aconites: conservation of himalayan aconites. Mol Biol Rep. 48, 6769–6777 (2021).
Li, H. et al. The sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Acknowledgements
This project was supported by the Natural Science Foundation for Distinguished Young Scholars of Shandong Province (ZR2023JQ010) and the Key R&D Program of Shandong Province (ZR202211070163). L.G. is also supported by Taishan Scholars Program of Shandong Province. We would like to thank the Bioinformatics Platform at Peking University Institute of Advanced Agricultural Sciences for providing the high-performance computing resources.
Author information
Authors and Affiliations
Contributions
L.G. and W.K.C. conceived the research project. W.K.C., G.Y. and D.M. prepared the samples. L.L.Z. conducted experiment. X.R.W., X.F.W., S.Y., T.M. and W.K.C. performed bioinformatic analysis, prepared figures and tables. X.R.W., L.L.Z. and L.G. wrote and revised the manuscript. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, X., Zhang, L., Yao, G. et al. De novo chromosome-level genome assembly of Chinese motherwort (Leonurus japonicus). Sci Data 11, 55 (2024). https://doi.org/10.1038/s41597-023-02901-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02901-w