In the past five years, high-throughput sequencing became the main tool for genetic studies. The exponential increase in the volume of sequencing, allied with technology cost reduction, have created new challenges for computing. High-performance computing proposes solutions that are able to process large amount of data in a short amount of time. The evolution of this area led to the development of new architectures, such as cloud computing and heterogeneous computing. Cloud computing takes advantage of connectivity to the Internet to provide low-cost services; whereas heterogeneous computing combines high-performance computers with specialized devices to quickly process large volumes of data. To get the benefits of these new solutions in bioinformatics, it is necessary to optimize the tools and protocols that already exist or create new solutions. Besides the ability to analyze larger amount of genetic data sets, it is necessary to develop new methods for integrating results from different studies. Thus, this project aims to develop tools and automated pipelines for processing next-generation sequencing data, covering epigenetics, transcriptomics and genomics studies. The results of these studies will be integrated with public data bases of genomic information. We believe that high-performance computing will be the standard solution in -omics studies in a few years. Therefore, we want to anticipate and contribute positively in this area of knowledge, by providing tools that will be routinely used in research. Our work will also assist in the development of high-performance computing, since it will make this technology to be used more frequently by the scientific community.
News published in Agência FAPESP Newsletter about the scholarship: