New York University

Computational Genomics




 The genome contained within a human cell is very large and complex. It holds all of the genetic information necessary for its creation and function encoded with a total of six feet of DNA. The goals of the Human Genome Initiative (HGI), as framed by the National Institutes of Health and the Department of Energy, are to generate a complete map, containing well-defined markers, and to sequence the entire human genome within the next seven, or less years. The sequencing aspects of this project will have to deal with approximately 3 billion base pairs. A large number of genes (30,000-40,000) will be identified and characterized in terms of biochemical, developmental, and clinical criteria. Additionally, the development of approaches to globally, and quantitatively, characterize message (RNA transcripts, which direct synthesis of specific proteins) will also play a major role in virtually every aspect of biological, pharmaceutical and clinical research.
The science of computational genomics and bio-informatics have been created out of this massive sea of sequence data and the need to establish functionality of genes largely based on similarities discerned at the level of the DNA code; bypassing the need for extensive biochemical characterization.

 This emerging subfield relies on some classical and many novel mathematical, statistical and algorithmic ideas that are essential to accomplish this task. This course deals with mainly these mathematical and computational approaches. The course is self contained, developing the biological, statistical, probabilistic and algorithmic tools and techniques along the way

Postscript viewer -> download 1 , download2