Over the last 3 years, I have been involved in designing and creating material for many high-level RNA-seq and single-cell omics courses. All these courses are offered to PhD students Postdocs and high-level researchers in Sweden (NBIS), Finland (CSC) and Switzerland (SIB) via ELIXIR network, either as course leader, as scientific committee member and/or as lecturer. All courses are open source (links below) and also include recorded videos (i.e. Dimensionality Reduction, Trajectory inference lectures). All courses were composed of lectures and detailed hands-on exercise materials (both in R and python). Of relevance, I also implemented two unique courses using Project-Based Learning (PBL) methodology for single-cell RNA-seq data analysis.
Teaching summary:
Feedback from students:
Institutions I collaborate through teaching:
Spatial Omics Data analysis
Online, 32 students, 5 days, 40h
NBIS, Stockholm, Sweden
Role: Course Leader, Lecturer
Single-cell RNA-seq data analysis
Online, 25 students, 5 days, 40h
NBIS, Stockholm, Sweden
Role: Course Leader, Lecturer
Advanced Topics in Single Cell Analysis
Online, 32 students, 5 days, 40h
NBIS and SIB, Stockholm, Sweden
Role: Scientific Committee, Course Leader, Lecturer
Single-cell RNA-seq data analysis
Online, 25 students, 5 days, 40h
NBIS, Stockholm, Sweden
Role: Course Leader, Lecturer | Online, 25 students
RNA-seq data analysis
Online, 25 students, 5 days, 40h
NBIS, Stockholm, Sweden
Role: Course Leader, Lecturer
RNA Summer School (single-cell)
Presential and Online, 32 students, 5 days, 40h
SIB, Schwarzenberg, Switzerland
Role: Course Leader, Lecturer
Single-cell RNA-seq data analysis
Presential, 24 students, 4 days, 32h
NBIS, Stockholm, Sweden
Role: Course Leader, Lecturer
Single-cell RNA-seq data analysis
Presential, 24 students, 4 days, 32h
NBIS, Stockholm, Sweden
Role: Course Leader, Lecturer
NBIS-SIB Single Cell School
Presential, 30 students, 5 days, 40h
NBIS and SIB, Leysin, Switzerland
Role: Lecturer
ELIXIR single-cell RNA-seq data analysis with R
Presential, 32 students, 4 days, 32h
CSC, Helsinki, Finland
Role: Lecturer
RNA-seq data analysis
Presential, 20 students, 4 days, 32h
NBIS, Uppsala, Sweden
Role: Course Leader, Lecturer
Dimensionality reduction (DR), 2023
Contents: introduction to DR, types of DR, linear DR using principal component analysis (PCA), introduction to graphs and graph-based DR, t-distributed stochastic Neighbourhood Embedding (tSNE), uniform manifold approximation and projection (UMAP).
Watch Lecture
Data integration and batch correction, 2023
Contents: introduction to data integration, types of integration, method overview for data integration, linear batch correction, graph-based batch correction, batch correction performance comparison, batch correction performance assessment
Watch Lecture
Gene Set Analysis (GSA), 2023
Contents: introduction to GSA, introduction to gene sets and databases, overrepresentation analysis, gene set enrichment analysis, technical consideration for GSA.
Watch Lecture
Trajectory inference analysis (TI), 2023
Contents: introduction to TI, overview of TI methods, independent component analysis (ICA), diffusion maps (DM), minimum spanning tree (MST), principal trees and graph abstraction, RNA velocity, differential gene expression for TI.
Watch Lecture
Principal Component Analysis (PCA), 2023
Contents: introduction to DR, types of DR, linear DR using principal component analysis (PCA), visual intuition on PCA.
Watch lecture
Clustering Analysis, 2023
Contents: introduction to clustering, distance measurements, K-means clustering, introduction to hierarchical clustering, clustering linkage measures, clustering robusteness.
Watch Lecture
Advanced differential gene expression (DGE, part 2), 2023
Contents: Introduction to general linear models (GLM), Poisson vs negative binomial distribution, how to use GLMs in complex interaction between experimental covariates and gene expression (categorial variables, continuous variables, variable intercepts and interaction terms).
Watch Lecture
Online Sequencing Repositories, 2023
Contents: Introduction to online repositories, ENSEMBL, GEO, SRA, ENA, EGA, referencing public datasets, depositing sequencing data into public repositories.
Watch Lecture
Reading Bioinformatics, 2023
Contents: Step-by-step reading of an article containing all the topics learned during the course in RNA-seq data analysis (PCA, DGE, GSA, clustering, visualization, data deposition, etc).
Watch Lecture
Introduction to Single cell data analysis
Contents: introduction to single cell data analysis, introduction to project-based learning (PBL), single cell analysis workflow, single cell sequencing technologies, pre-processing, quality control, dimensionality reduction, data integration, graph construction, clustering, differential expression, trajectory inference.
Introduction to advanced topics in Single cell omics (PBL)
Contents: introduction to single cell data analysis, introduction to project-based learning (PBL), single cell analysis workflow, RNA velocity, multi-omics integration, deep learning spatial transcriptomics.
Methods in microbiota research and potential pitfalls
Contents: Introduction to amplicon sequencing and metagenomics, microbiota analysis overview, designing and pitfalls in experimental research, recommendations for planning experiments to account for microbiota cofounders.
When preparing a course, I am using the constructive alignment approach. I begin by defining learning outcomes which are clear as well as measurable end-results the student should achieve after completing the course. For example, I like to think of learning outcomes as classes of certain problems the student should be able to tackle when the course is finished.
All courses I have thought were focused towards PhD and postdocs. However, most of the students do not possess a background in bioinformatics nor a deep knowledge in mathematical jargon which makes the teaching of computational analysis methods challenging. For this reason, all my lectures on advanced topics are break down to small incremental steps of complexity and are centred in explaining the fundamental concepts and detailed explanation on how the computational methods work in essence. I achieve this by creating animations on the step-by-step of the computational process and how changing some steps might impacts the analysis results.
Another practice I commonly use is by interacting with students to provide a meaningful connection between a concept and a real-life analogy. For example, when teaching shared k nearest neighbours (k-NN) graph concept: I illustrate it based on the physical position between me and the students in the classroom (aka., nearest neighbours) so that the 5 closest students to me are my 5 nearest neighbours, while the 5 closest students to another reference student in the back of the classroom are their 5 nearest neighbours. Therefore, once we identify the 5 nearest neighbours for all students in class (representing the data points), then we constructed the 5-NN graph. The lecture then continues on this real-life exemplification on how we use the 5-NN graph to understand the physical structure of the classroom (as an analogy for the “structure of the data” concept).
Some teaching decisions are not only based on the theoretical knowledge per se, but also involves how to make a real impact on the students’ future research career. As an advanced bioinformatics expert, my view is that data analysis should be done in reproducible and systematic manner so results can be replicated. Therefore, 2 years ago, I started providing the same course contents but using CONDA-environments instead, which is a tool commonly used by us experts to achieve code reproducibility. Therefore, by teaching how to use CONDA in all my courses, enforces good bioinformatics practices since the beginning in order to impact on quality of their research.
After each and every course, students are given course evaluation forms to be filled, containing multiple choice and open feedback questions about each section of the course as well as feedback on the course structure and format. Most questions are graded in a scale from 1-5, where 1 represents “I did not like it” and 5 represents “I liked it a lot”. This way, each lecture and exercise can be monitored for performance, since the areas where students usually have more difficulty also receive lower grades (although still with average score above 4 out of 5) followed by specific comments on what they think could be improved in open question section.
The evaluations are used by us, the course leaders, to plan the next course instance and identify these potential weak points that require further development. As course leaders and teachers, we take notes on the overall development of the course, areas of struggle and computational issues. Once the final evaluation forms are filled, the course leaders gather 1 week after the course end, so that the details and most important action points could be prioritised (see course summary attached). Examples include addition of new content, ways to improve the lecture flow, enhance didactics for complex concepts, facilitate exercise flows, clarify coding challenges and tasks as well as incorporate more details where needed. In many cases, the course analysis is more thorough than we can execute, so certain action points need to be prioritised for the next course instance.