Course Syllabus
The syllabus for the course is spelled out in detail on the course home page, with a week by week breakdown. Briefly, we will cover the following:
- Sources of Bioinformatics Data (Week 1)
-
Next gen sequencing, and database filtering for big data searches (Week 2,3)
- Pigeonhole principle for database filtering
- Bloom filters
- Sketching
- Genome segmentation using HMMs (Week 4)
-
Data preprocessing, feature extraction, and abstraction for proteomics (Week 5)
- MS2 based peptide identification
- Peptide quantification
- A gentle introduction to linear algebra (Week 5)
- Unsupervised clustering approaches (Week 5,6)
- Supervised classification methods OR dimensionality reduction and visualization (Week 6)
-
Applications: (Week 7,8,9) (Dr. Zhong)
- Extracellular RNA
- Longitudinal data: linear models, mixture models, classification
- Single cell sequencing and time-course clustering
Grading policy:
- 30% assignments (3 assignments)
- A1
- A2
- A3
- 30% midterm
- 40% final project.
Late submission policy: 5% penalty for each day after the deadline for a maximum of 4 days. Regrade requests must be turned in within 7 days of receiving the grade.