Genomics of Gene Expression


Speaker: Sonia Tarazona
Genomics Of Gene Expression Lab
Date: 27/06/2014 – 13h
Place: Salón de Actos CIPF


Recent developments in the omics field have resulted in the availability of a wide array of high
throughput technologies that allow the study of cell biology at different levels of molecular
organization. Specially, the explosion in the last years of next generation sequencing (NGS)
applications and their continuing drop in price makes genome-wide, system-oriented
approaches in biomedical research increasingly affordable for many molecular biology labs.

However, there is still a gap between the available tools for statistical analysis of a single data
type versus the requirements of biomedical scientists who address their studies through
multiple omics approaches and are faced with the challenge of understanding the combined
results in an integrative fashion. Furthermore, researchers need guidance on how to design
such integrative analyses more efficiently both at planning of the experiments and at the data
collection. These analytical challenges represent critical points for the successful translation of
the omics experimental investments into significant knowledge advances for the biomedicine.

The STATegra project aims to fill this scientific and technological gap in current genomics
research by developing an appropriate and accurate analysis framework that will permit a
more efficient use of the omics data and a better understanding of the results. Collaboration
between statistical experts, biomedical researchers, data producers and software developers
assures that statisticians are close to the needs of the experimentalists, understand the nature
of the data and create sound analytical solutions.

In order to design the most appropriate statistical procedures for pre-processing and analysis
of data in a real context, a complete set of omics experimental data has been produced within
the STATegra project. The goal of this benchmarking experiment is studying the role of the
transcription factor Ikaros in B cell development. Thus, experimental measurements for a time
course of B cell differentiation under control of Ikaros were generated and different data types
were collected: mRNA-seq, miRNA-seq, ChIP-seq, Methyl-seq, DNase-seq, Proteomics and

Therefore, in this seminar, we show some issues and solutions in the pre-processing and
analysis of omics data from an integrative point of view.