Bioinformatics for Environmental Sequencing (DNA metabarcoding)

These pages contain course material for BIO9905MERG1 Spring 2023
For the course page on the University of Oslo webpage, click here.

Course content

For mapping and exploring communities of both micro- and macroorganisms, high throughput sequencing (HTS) of environmental DNA has become a powerful approach. One can either analyze the total DNA content to obtain knowledge about which genes are present (DNA metagenomics) or sequence a selected PCR-amplified marker (DNA metabarcoding) to obtain information about the taxonomic composition. We will focus on the latter approach in this course. The students will be introduced to important analytical bioinformatics approaches from the processing of raw sequence data to establishment of the OTU/sample matrix and retrieval of taxonomic identity of the sequences.

Important themes will be (1) filtering and quality assessment of high throughput sequence data, (2) error correction and/or clustering of high throughput sequence data, and (3) taxonomic annotation of high throughput sequence data. We will also touch upon some further downstream analyses, including network analyses and evolutionary placement of HTS onto backbone phylogenies. Applications of a wide suite of tools will be presented, including VSEARCH and DADA2.

The course will be a blend of presentations, guest lectures, discussion and a few hands-on sessions. All hands-on secession will be run in R on your local laptop/computer. Hence, all participants should have R and selected R packages installed – see information below.

Schedule

The course will run from 17-21 April, 9:00-17:00 (times may vary). For a detailed overview of the program, see below.

Report

Those of you that attend the course through the research schools or UiO and want to obtain ECTS credits, will have to hand in a report before June 1th. For the report, you should write a 4-page text (minimum) about a fictive research project where you will use DNA-metabarcoding to explore the community composition and diversity of a certain habitat and/or ecological gradient. You are free to select the organismal group(s) and the habitat/gradient. In the text you should: (1) define the goal(s) of the study, (2) describe the sampling design, (3) the wet-lab work (briefly) and, most important, (4) the bioinformatics analyses. Not only describe how you plan to carry out the research, but also why you make your choices. On point (4), describe in detail how you plan to analyze your data and which bioinformatics approaches you will use and why. Also mentioned what you expect to be problematic and which steps that might introduce bias(es) to your results, as well as what type of bias(es). Concerning the format, you should use Times New Roman size 12, 1.5 line spacing and 2.5 cm margins.

The report should be sent to: [email protected]

Teachers

The main teachers will be Ramiro Logares, Anders K. Krabberød, Micah Dunthorn, Torbjørn Rognes, Frédéric Mahé and Håvard Kauserud (organizer), but other experts will provide guest lectures (see table).

Program

Day	Time (start)	Topic	Responsible
Monday	09:00	Introduction to DNA metabarcoding	Håvard Kauserud
	10:00	Introduction continued
	11:00	Introduction to sequencing techniques	Robert Lyle
	12:00	Lunch break
	13:00	Group work	Håvard Kauserud
	14:00	Introduction to Linux, Google Colab and R	Ramiro Logares
	15:00	Cutadapt and sequence cleaning
	16:00	Help with setup and installation of required packages	Anders K. Krabberød
	17:00	PIZZA

Tuesday	09:00	Introduction to DADA2	A. Krabberød / R. Logares
	10:00	DADA2 continued
	11:00	DADA2 continued
	12:00	Lunch break
	13:00	DADA2 continued
	14:00	DADA2 continued
	15:00	Community Ecology	R. Logares
	16:00	Community Ecology

Wednesday	09:00	Introduction to VSEARCH and Swarm	Torbjørn Rognes
	10:00	from FASTQ files to OTU tables	Frédéric Mahé
	11:00	continued
	12:00	Lunch break
	13:00	LULU and MUMU (C++ version of LULU)	Frédéric Mahé
	14:00	Abundance estimation in DNA metabarcoding	Douglas Yu (Zoom)
	15:00	Early end, social activities (TBA)
	18:00	Social activities (TBA)

Thursday	09:00	Phylogenetic placement/binning of HTS data	Lucas Czech
	10:00	Introduction to long-read DNA metabarcoding	Mahwash Jamy
	11:00	continued
	12:00	Lunch break
	13:00	OTUs, ASVs and phylospecies	Micah Dunthorn
	14:00	Contamination, library prep, aerial metabarcoding	Kristine Bohmann (Zoom)
	15:00	continued
	16:00	LULU and MUMU (C++ version of LULU)	Frédéric Mahé

Friday	09:00	Taxonomic Assignment	Marie Davey
	10:00	Case study (insect metabarcoding)	Marie Davey
	11:00	Metacoder	Ella Thoen
	12:00	Lunch break
	13:00	Downstream analyses: Networks	Ramiro and Anders
	14:00	Case studie on DNA metabarcoding	Sundy Maurice (guest lecture)
	15:00	Summing up, QnA

Software

We will use R (version 4.0.5 or later) and Rstudio (version 1.4.1 or later) in this course. In addition, we will use Google Colab for programs that require a Linux/Unix environment.

Everybody should download and install R (https://www.r-project.org/), Rstudio (https://www.rstudio.com/) and the required packages before the course starts.

For more information about the required packages click here.

Class of 2023

Supported by Digitalt Liv Norge, ForBio, and Norbis

All the keywords in this explanation, by the way, are totally misleading, due to the everyday quirks of language. Don DeLillo, Ratner's Star.

krabberod / bio9905merg1_v23 Goto Github PK

bio9905merg1_v23's Introduction

Bioinformatics for Environmental Sequencing (DNA metabarcoding)

Course content

Schedule

Report

Teachers

Program

Software

Suggested reading (reviews)

Class of 2023

Supported by Digitalt Liv Norge, ForBio, and Norbis

bio9905merg1_v23's People

Contributors

Stargazers

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent