High Throughput Sequencing¶
Level 1 - Beginner
Introduction to NeSI
Quality filtering Illumina data
Quality filtering Nanopore data
Annotating sequences with BLAST
Level 2 - Advanced
Working in the shell
De novo assembly of sequencing data
- Overview
- Short read assembly with SPAdes
- Long read assembly with Flye
- Assessing the quality of a genome
- Polishing of genome assemblies
Mapping reads to a reference
- Overview
- Illumina mapping with bowtie2
- Nanopore mapping with minimap2
- Filtering and sorting mapping files
- Summarising mapping statistics
Performing gene prediction and classification
Proficiency testing
Data used in training
This workshop provides a basic introduction to working with the slurm scheduling system, and begins working with Illumina MiSeq and Oxford Nanopore Technology sequence data. The data used in this workshop is mostly using simulated reads, produced using InSilicoSeq[^2] from the Mycoplasma bovis 8790 reference genome NZ_LAUS01000004.1. We also make use of publicly available sequencing data from the studies PRJNA813586, PRJEB38441, and PRJEB38523.
Additional teaching materials were sourced from:
- Genomics Aoteoroa Metagenomic Summer School workshop2.
- Long-Read, long reach Bioinformatics Tutorial3.
- Galaxy Training! sequence analysis resources4.
Citations
-
Erin Alison Becker, Anita Schürch, Tracy Teal, Sheldon John McKay, Jessica Elizabeth Mizzi, François Michonneau, et al. (2019, June). datacarpentry/shell-genomics: Data Carpentry: Introduction to the shell for genomics data, June 2019 (Version v2019.06.1). Zenodo. http://doi.org/10.5281/zenodo.3260560.
-
Hadrien Gourlé, Oskar Karlsson-Lindsjö, Juliette Hayer, Erik Bongcam-Rudloff (2019). Simulating Illumina metagenomic data with InSilicoSeq. Bioinformatics 35(3), 521-522.
-
Jian Sheng Boey, Dinindu Senanayake, Michael Hoggard et al. (2022). Metagenomics Summer School https://github.com/GenomicsAotearoa/metagenomics_summer_school.
-
Tim Kahlke (2021). Long-Read Data Analysis https://timkahlke.github.io/LongRead_tutorials/.
-
Joachim Wolff, Bérénice Batut, Helena Rasche (2023). Sequence Analysis (revision 96e0180). https://training.galaxyproject.org/training-material/topics/sequence-analysis/.