Genomic studies have revealed in details the expression dynamics and control of genes in several organisms, allowing a more accurate manipulation of genetic information, specially in agriculture. The first plant genome released was of the model plant Arabidopsis thaliana, due to intrinsec features of its biology. Subsequently, several other plant genomes began to be sequenced, as the sugarcane genome (Saccharum officinarum), a species with great role in the development of biofuels. In this context, the study of ncRNA and taxon-specific orphan genes can help to elucidate the fine tuning of genomes and then enable a more precise manipulation of it. Therefore, the understanding of origin and evolution of these transcript classes is indispensable. In this study, we will identify and characterize ncRNA and orphan genes sets of A. thaliana e S. officinarum, through comparisons between the sets constructed and genomic and transcriptomic data sets. The two sets of each species will be compared to elucidate the creation de novo of orphan genes from ncRNA and its integration in cellular networks, through the coexpression analysis of these transcripts. To accomplish this, we will build coding-non-coding coexpression networks, using RNA-Seq data sets, that will provide key information about the dynamics involved at the functions and evolution of ncRNAs and orphan genes.
News published in Agência FAPESP Newsletter about the scholarship: