Large-scale data analysis of “-omics” data in the biological sciences.Many experimental procedures such as the various “-omics” techniques routinely employed within biotechnology/biological research fields produce vast amounts of data. Therefore, the amount of available data in many biological disciplines is steadily increasing. Fundamental knowledge and skills of large-scale computing systems and analysis methods is required to make use of this wealth of information. The purpose of this course is to introduce the theory and practice of large-scale data analysis to students, which will allow them to perform and assess different types of ”-omics”-scale data procedures, specifically focusing on Transcriptomic data (RNAseq) and Metabolomic data (LC-MS).
Basic statistical understanding equivalent to a MSc from SCIENCE; Beginners level experience with R***Completion of the course will rely on the production and acceptance of a complete data analysis report in Rmarkdown.
Learning outcome
Knowledge:•The general principles of large-scale data analysis•Common pitfalls in large-scale data analysis•The basic concepts underlying clustering and visualization techniquesSkills:•How to efficiently keep, move, and analyse large amounts of data•How to structure and perform large-scale data analyses in a coding-based software environment, such as for example R•Handling and modifying large datasets•Visualization and dissemination of dataCompetences:•Analysing different types of large-scale biotechnology data•Critically evaluating the quality of different types of biotechnology data•Assessing and understanding results of large-scale data analyses
Lectures and computer exercises.Completion of the course will rely on the production and acceptance of a complete data analysis report in Rmarkdown.
RadEditor - please enable JavaScript to use the rich text editor.