Login for PhD students at UCPH      Login for others
Fundamentals in Computational Analysis of Large-Scale Datasets (Online)
Provider: Faculty of Health and Medical Sciences

Activity no.: 3918-21-00-00There are no available seats 
Enrollment deadline: 08/02/2021
Date and time08.03.2021, at: 09:00 - 19.03.2021, at: 14:00
Regular seats20
Course fee7,200.00 kr.
LecturersMartin Sikora
ECTS credits5.00
Contact personKirsten Wivel-Snejbjerg    E-mail address: kws@adm.ku.dk
Enrolment Handling/Course OrganiserPhD administration     E-mail address: phdkursus@sund.ku.dk

Aim and content
This is a generic course. This means that the course is reserved for PhD students at the Graduate School of Health and Medical Sciences at UCPH. Anyone can apply for the course, but if you are not a PhD student at the Graduate School, you will be placed on the waiting list until enrollment deadline. After the enrolment deadline, available seats will be allocated to the waiting list.
The course is free of charge for PhD students at Danish universities (except Copenhagen Business School), and for PhD students at graduate schools in the other Nordic countries. All other participants must pay the course fee.


Learning objectives
A student who has met the objectives of the course will be able to:
1. Manage large-scale datasets using the UNIX command line
2. Import, visualize, transform and summarize datasets using the R statistical programming language
3. Understand basic concepts in probability theory
4. Distinguish supervised and unsupervised statistical learning and their applications
5. Perform a comprehensive exploratory analysis on a given real-world dataset


Content
The topic of this course is to provide the attendees with a broad introduction into the fundamentals of modern computational data analysis. The aim is to equip the attendees with the basic tools for “making sense of data", from the fundamentals of working with large-scale datasets to introductory probability theory and statistics. The first half of the course will be dedicated to practical aspects of computational data analysis using the UNIX shell and the R statistical programming language. Topics include an introduction to UNIX and R; data visualization and data wrangling in R using the tidyverse suite of packages; as well as reproducible computational workflows (snakemake). In the second half of the course, students will be introduced to basic concepts in probability theory and statistics, with topics including a probability theory bootcamp; introduction to supervised learning (linear regression); and introduction to unsupervised learning (PCA). The students will learn these topics through a combination of introductory lectures and hands-on analysis examples on real-world datasets.


Participants
PhD fellows in the “Life, Earth and Environmental Sciences” Programme (required course) or related fields.


Relevance to graduate programmes
The course is relevant to PhD students from the following graduate programmes at the Graduate School of Health and Medical Sciences, UCPH:
Life, Earth and Environmental Sciences
Biostatistics and Bioinformatics


Language
English


Form
Combination of lectures and practical computational exercises


Course director
Martin Sikora, Associate Professor, Globe Institute, University of Copenhagen.
martin.sikora@sund.ku.dk


Teachers
Martin Sikora, Associate Professor, Globe Institute, University of Copenhagen.
martin.sikora@sund.ku.dk

Fernando Racimo, Associate Professor, Globe Institute, University of Copenhagen.
fracimo@sund.ku.dk

Shyam Gopalakrishnan, Associate Professor, Globe Institute, University of Copenhagen, shyam.gopalakrishnan@sund.ku.dk

Thorfinn Sand Korneliussen, Assistant Professor, Globe Institute, University of Copenhagen
tskorneliussen@sund.ku.dk


Teaching assistants
Rasa Muktupavela, PhD Fellow, Globe Institute, University of Copenhagen.
Alba Refoyo Martinez, PhD Fellow, Globe Institute, University of Copenhagen
Tharsika Vimalasuntharam, PhD Fellow, Globe Institute, University of Copenhagen


Dates
8-19 March 2021 (Block 3, 2 weeks), Monday-Friday 9:00 – 14:00


Course location
Online


Registration
Please register before 8 February 2021.
Seats to PhD students from other Danish universities will be allocated on a first-come, first-served basis and according to the applicable rules.
Applications from other participants will be considered after the last day of enrolment.

Note: All applicants are asked to submit invoice details in case of no-show, late cancellation or obligation to pay the course fee (typically non-PhD students). If you are a PhD student, your participation in the course must be in agreement with your principal supervisor.

Search
Click the search button to search Courses.


Course calendar
See which courses you can attend and when
JanFebMarApr
MayJunJulAug
SepOctNovDec



New courses
Courses are published regularly. High demand courses are announced in spring and autumn.


Learn which courses are announced on fixed dates