Sequence Alignment and Gene Prediction

Vic University, March 2013

Cedric Notredame





OBJECTIVES

This course is an 6 hours primer on sequence alignments. Its goal is to present an overview of the basic concepts of sequence alignments and some of their applications. The first two hours will be dedicated to molecular evolution. We will focus on the implications of molecular evolution on sequence variation. We will use these concepts to define homology. We will then see how specific mathematical models (the substitution matrices) have been derived in order to quantify the evolutionary relationship between sequences. The next two hours will be used to introduce the Needleman and Wunsch algorithm (Dynamic programming), a very basic algorithm that makes it possible to derive pairwise alignments from the sequences while using the substitution matrices. Over the following 2 hours, we will see how these pairwise alignment methods can be applied to database searches and we will develop the main concepts behind the BLAST algorithm. I will finally introduce the notion of multiple sequence alignment and show how a group of related sequences can be compared in order to infer common properties. We will then see the main principles behins two multiple sequence alignment package: ClustalW and T-Coffee.



Send your Questions to: cedric.notredame@crg.es



DateLocationSessionTitleLinks
13/03/2013BCNLECTUREEvolution and Comparison ModelsL
13/03/2013BCNLECTUREBlast and Database SearchesL
13/03/2013BCNPRACTICALDatabase Searches P
.
14/03/2013BCNLECTUREDynamic Programming Based Sequence ComparisonL
14/03/2013BCNPRACTICALIntroduction to Dynamic ProgrammingP
.
19/03/2013BCNLECTUREIntroduction to Multiple Sequence AlignmentsL
19/03/2013BCNLECTUREIntroduction to Gene PredictionL
19/03/2013BCNPRACTICALUsing T-Coffee to combine Sequences and StructuresP
.



REFERENCES

1. Claverie and Notredame, Bioinformatics for Dummies, 2007, Wiley

2. Durbin et al., Biological Sequence Analysis, 1999, Oxford Press

3. Patthy, Protein Evolution, 2007, Wiley



This Entire Course Was Automatically Generated Using BED, the Bioinformatics Exercise Database. BED is a freeware available on request Cedric Notredame