Doctoral Thesis
Transcriptomic Characterization Using RNA-Seq Data Analysis
Final Thesis 6.49 MB Final Thesis 306.44 kB Summary of Thesis 2.55 MBAuthor of thesis: Layal Abo Khayal, Ph.D.
Acad. year: 2017/2018
Supervisor: prof. Ing. Valentýna Provazník, Ph.D.
Reviewers: prof. Pharm.Dr. Petr Babula, Ph.D., Ing. Matej Lexa, Ph.D.
Abstract:The high-throughputs sequence technologies produce a massive amount of data, that can reveal new genes, identify splice variants, and quantify gene expression genome-wide. However, the volume and the complexity of data from RNA-seq experiments necessitate a scalable, and mathematical analysis based on a robust statistical model. Therefore, it is challenging to design integrated workflow, that incorporates the various analysis procedures. Particularly, the comparative transcriptome analysis is complicated due to several sources of measurement variability and poses numerous statistical challenges. In this research, we performed an integrated transcriptional profiling pipeline, which generates novel reproducible codes to obtain biologically interpretable results. Starting with the annotation of RNA-seq data and quality assessment, we provided a set of codes to serve the quality assessment visualization needed for establishing the RNA-Seq data analysis experiment. Additionally, we performed comprehensive differential gene expression analysis, presenting descriptive methods to interpret the RNA-Seq data. For implementing alternative splicing and differential exons usage analysis, we improved the performance of the Bioconductor package DEXSeq by defining the open reading frame of the exonic regions, which are differentially used between biological conditions due to the alternative splicing of the transcripts. Furthermore, we present a new methodology to analyze the differentially expressed long non-coding RNA, by finding the functional correlation of the long non-coding RNA with neighboring differential expressed protein coding genes. Thus, we obtain a clearer view of the regulation mechanism, and give a hypothesis about the role of long non-coding RNA in gene expression regulation.
RNA-Seq, Differential Gene Expression (DGE), Alternative splicing, Differential Exon Usage (DEU), long non-coding RNA (lncRNA).
Date of defence
02.05.2018
Result of the defence
Defended (thesis was successfully defended)
Process of defence
Komise považuje disertaci doktorandky za bezproblémovou. Největší přínos spatřuje v analýze Long RNA dat.
Language of thesis
English
Faculty
Department
Study programme
Electrical Engineering and Communication (EKT-PK)
Field of study
Biomedical Electronics and Biocybernetics (PP-BEB)
Composition of Committee
doc. Ing. Daniel Schwarz, Ph.D. (předseda)
prof. PharmDr. Petr Babula, Ph.D. - oponent (člen)
MUDr. Petr Džubák, Ph.D. (člen)
doc. Ing. Radim Kolář, Ph.D. (člen)
Ing. Matej Lexa, Ph.D. - oponent (člen)
Supervisor’s report
prof. Ing. Valentýna Provazník, Ph.D.
Reviewer’s report
prof. Pharm.Dr. Petr Babula, Ph.D.
File inserted by the reviewer | Size |
---|---|
Posudek oponenta [.pdf] | 988,95 kB |
Reviewer’s report
Ing. Matej Lexa, Ph.D.
File inserted by the reviewer | Size |
---|---|
Posudek oponenta [.pdf] | 654,40 kB |