IMPRS Comparative genomics 2013
(Eva Stukenbrock and Julien Dutheil)
Presentations
- Genetic variation. Here we talk about mutations, diversity, selection and demography.
- Alignment. Here we talk about homology and its inference.
- Phylogeny. Here we demonstrate how to reconstruct the history of sequences.
- Positive selection. Here we introduce the methodology for infering positive selection.
- Positive selection 2: codon models. Here we briefly introduce codon models of sequence evolution.
Practical course
Practical session 1: origin and maintenance of genetic variation
Genetic drift: http://darwin.eeb.uconn.edu/simulations/drift.html
Drift + selection: http://darwin.eeb.uconn.edu/simulations/selection-drift.html
The effect of selection: http://darwin.eeb.uconn.edu/simulations/selection.html
(Simulations are from the Holsinger lab).
The UCSC Genome Browser: http://genome.ucsc.edu/cgi-bin/hgGateway
Link toward population size estimates to paste in UCSC: http://kimura.univ-montp2.fr/~jdutheil/Gorilla/UCSCTracks/thetaHCPerAln.track.gz
Practical session 2: homology and alignment
Get SeaView: http://pbil.univ-lyon1.fr/software/seaview.html
Random sequences generator (R script)
A random sequence file (Fasta)
- Open the file with Seaview
- Align the sequences
- Compare Clustal and Muscle output
- Assess the quality of the alignment
- Filter the alignment using GBLocks
Some globin sequences (Proteins, Fasta)
Practical session 3: reconstructing the history of sequences
[Hemoglobin data set for session 2]
- Using Seaview, build a phylogenetic tree from the previously filtered alignment
- Compare Parsimony, Distance and ML methods
- Assess the confidence of the tree reconstruction
- Identify duplication events and date them
rRNA sequences aligment (Mase)
Full species names for rRNA alignment
Analyze the data! Which model best describes the data?
Practical session 4: inferring positive selection
LysM data and programs (windows exe) and protocol.