The Lupinus luteus database (LuluDB) gathers information about present research on non-coding and coding RNA. Based on their length, non-coding RNAs are classified into short (less than 200 nt) and long (over 200 nt) categories [1]. Short ncRNAs are represented by miRNA (micro RNAs) and phasiRNA (phased, secondary, small interfering RNAs originally designated as trans-acting small interfering RNAs or tasiRNAs) [2,3]. They are involved in post-transcriptional control of their target gene activity in the process of RNA interference. Short ncRNAs binding to specific mRNA on the principle of complementarity leads either to its cleavage within the bound sequence or inhibition of its translation [4,5]. Long ncRNAs were shown to be potent cis- and trans-regulators of gene transcription, and can act as: (i) scaffolds for chromatin-modifying complexes, (ii) decoy for splicing factors, or (iii) competitors for miRNA binding sites [1].
LuluDB was created basing on NGS sequencing analysis of following libraries: 32 sRNA, 18 transcriptome and 2 degradome obtained from flowers and pods of yellow lupine cv. Taper. It contains sequences of miRNAs and phased siRNAs identified in L. luteus, information about their expression in individual samples and their target sequences. LuluDB also contains identified lncRNAs and protein-coding RNA sequences with their organ expression and annotations to widely used databases like GO, KEGG, NCBI, Rfam, Pfam etc.
The database contains sequences of 456 known and 32 novel miRNAs, as well as 318 phased siRNAs (ta-siRNAs) identified in yellow lupine along with information about their expression and target transcripts. Each sRNA received an unique ID number. In case of miRNAs, known miRNAs (i.e. having identical hits in miRBase [6]) were assigned IDs from 1 to 456, and the numbering of novel miRNAs (identified with ShortStack [7]) started from 457 up.
LuluDB contains 267,349 protein-coding RNA sequences. Because the reference yellow lupine genome had not been released yet, the transcripts were assembled de novo. This assembly was carried out separately for libraries derived from flowers [8] and pods with Trinity toolkit, which assigned an ID for each transcript (e.g.TRINITY_DN10038_c0_g1_i1) within each batch separately. This created the risk that completely dissimilar transcripts in flowers and pods could have the same ID. We fixed this in three ways: (i) by providing each transcript with information about its origin (flowers or pods), (ii) by adding the “F” prefix to TRINITY ID in flower dataset (e.g. FTRINITY_DN53848_c2_g1_i5) and (iii) by assigning additional ID for the database (e.g. LI_transcript_534367). All of the assembled transcripts were clustered and, within each cluster, they were assigned "Gene name" (e.g. LI_gene_11901).
By building LuluDB we aimed to popularize the genetic research on this important crop plant and integrate our RNAseq data for yellow lupine protein-coding and non-coding RNA sequences in one place. This will also support the research on universal regulatory mechanisms of plant development. Thus, regardless of the model plant you are working with, you can search the LuluDB database for homological sequences by performing BLAST using your sequence as a query.
You are welcome to deposit your sequences in our database! We encourage you to contact us.
References:
Yellow lupine (Lupinus luteus L.) belongs to a legume family that benefits from symbiosis with nitrogen-fixing bacteria. Seeds of this species are rich in proteins that constitute up to 40% of their dry mass [1]. Additionally, years of research have led to the selection of alkaloid-free „sweet” cultivars. All these traits make lupine seeds a valuable food source for animals and humans primarily in climatic conditions unfavorable for soybean cultivation [2].
The main constraint on a large-scale cultivation of yellow lupine comes from its excessive shedding of generative organs, which contributes to significant yield losses. Therefore, current research focuses on the development of varieties of yellow lupine and cultivation conditions that would prevent massive flower and pod dropping, consequently stabilizing the yield in various environmental conditions [3]. Besides its practical significance, yellow lupine is also an excellent model plant for basic research on nodulation [4] or abscission of generative organs [5-7].
Contemporary advances in molecular biology e.g development of next generation sequencing (NGS) support the traditional methods for selection of new varieties of crops. Modern techniques also extend the range of possibilities of analysing the impact of various endogenous and environmental factors on the development of yellow lupine [5-8]. However, the knowledge about the mechanisms of regulation of its generative development is still fragmentary, especially concerning the involvement of short and long non-coding RNA in this process.
References
Paulina Glazińska (PhD)
Department of Plant Physiology and Biotechnology
Faculty of Biological and Veterinary Sciences
Nicolaus Copernicus University
Torun, Poland
paulina.glazinska [at] umk.pl
Glazinska Paulina (PhD)1, Glinkowski Wojciech (MSc)1, Kosinski Jan (MSc)2,4, Kulasek Milena (MSc)1, Szczesniak Michal (PhD)2,3, Wojciechowski Waldemar (PhD)1, Wysocka Marta (MSc)4
1 - Department of Plant Physiology and Biotechnology, Faculty of Biological and Veterinary Sciences, Nicolaus Copernicus University, Torun, PolandSupported by The National Science Centre SONATA grant No. 2015/19/D/NZ9/03601
If you make use of the data presented here, please cite the following article:
Glazinska P, Kulasek M, Glinkowski W, Wysocka M, Kosiński JG. LuluDB-The Database Created Based on Small RNA, Transcriptome, and Degradome Sequencing Shows the Wide Landscape of Non-coding and Coding RNA in Yellow Lupine (Lupinus luteus L.) Flowers and Pods. Front Genet. 2020;11:455. Published 2020 May 15. doi:10.3389/fgene.2020.00455