Course Syllabus

CSE284 Spring 2021 Syllabus

Note: for Spring 2021, the entire course will be held remotely and all materials will be made available asynchronously. No in-person attendance is required.

Instructor: Dr. Melissa Gymrek, mgymrek@eng.ucsd.edu

TAs: Yuan Yuan, yuy188@ucsd.edu; Timofey Prodanov, tprodanov@eng.ucsd.edu

Lectures: MW 3:30-4:50pm (Zoom)

Office Hours:

  • Melissa 4:55-5:30pm MW (Zoom)
  • Yuan 10-11am T (Zoom)
  • Timofey 1-2pm F (Zoom)

Private appointments to discuss accommodations or special circumstances can be arranged by email.

Course Description

Genome-sequencing is quickly becoming a commodity, and more than a million people have already analyzed their own genomes through direct-to-consumer companies. This course provides an introduction to current bioinformatics techniques for analyzing and interpreting human genomes. We will learn how to interpret a single genome in the context of an entire population, based on the often quoted concept: interpreting one genome requires tens of thousands of genomes. Topics covered include an introduction to human medical and population genetics, human ancestry, finding and interpreting disease-causing variants, genome-wide association studies, genetic risk prediction, analyzing next generation sequencing data, and how to scale current genomics techniques to analyze hundreds of thousands of genomes. We will also discuss the social impact of the personal genomics revolution.

Prerequisites

There are no official prerequisites for this course, but it is assumed students have some programming experience with Python and are familiar with using the UNIX command line.

Course objectives

Through this course students will:

  • Gain basic bioinformatics skills needed to analyze a personal genome using the UNIX command line.
  • Gain the ability to critically read and interpret basic science and translational literature relevant to personal genomics.
  • Demonstrate knowledge and understanding of the social impacts of the personal genomics revolution.
  • Gain skills and experience necessary to carry out original research related to personal genomics.

Course structure

The course will consist of 5 modules (Intro, ancestry, trait prediction, NGS, special topics). Each module will have a corresponding problem set due at the end of the module. Problem sets will be completed on JupyterHub, and will consist of a mixture of programing assignments and short answer questions. We will have journal club discussions in class approximately every other week. For those discussions, you will be asked to submit some short answer questions before class. Finally, each student will work in a group to complete a final project, described below.

Deadlines are set up such that at most 1 assignment is due in a given week (problem set, journal club, or project item). (Note since Memorial Day falls in week 10 for this quarter, we had to break this rule during week 8 this year).

Course sessions

Lectures are held MW 3:30-4:50 by Zoom. Class meetings will consist of two sections separated by a short 5 minute break. The first section will usually consist of a lecture (~45 minutes). The second session (~30 minutes) will usually consist either of a journal club discussion or practical tutorials regarding different bioinformatics methods for genome analysis.

Genotyping FAQ

The goal of this course is to teach you how to analyze your genome. At this time, we are not able to provide genotyping or sequencing services to students, and instead homeworks will be performed on publicly available human genomes. You are welcome to analyze your own genome using a direct-to-consumer service such as 23andme or Ancestry.com. You will also get access to a range of fun tools on their website that will complement some of the analyses we do in class. Many homework assignments are built to analyze the type of data available from 23andme.

Please keep in mind the following:

  • It is NOT required that you analyze your own genome. Your grade is in no way influenced by whether or not you participate in 23andme.
  • To ensure the previous point, you do not need to tell me if you will be analyzing your own genome.
  • If you do 23andme, it is at your own risk and at your expense. We do not have funding to pay for the test.
  • While you are free to analyze your genome, you must still complete the homework assignments with the provided genomes as well.
  • 23andme can take up to six weeks, so sign up early if you want to use it during class.

Grading

Your grade will be based on:

  • Problem sets (50%)
  • Journal club participation + questions (20%)
  • Final projects (30%)

There will be one problem set for each of the first 4 modules that will mostly involve performing analyses on genomes using existing tools. PS1 is worth 5%. The other three problem sets are worth 15% each. Problem sets are graded out of 100 points. Late problem sets will lose five points for each day past the due date. We will cap this after 4 days. So if you submit after 4 days you can still get up to 80/100. 

We will have 4 journal club discussions throughout the quarter. Short answer questions will be worth 5% each and will be graded out of 10 points.

The final project will consist of a proposal (5%), paper (20%), and presentation (5%). 

Final grades will be out of 100 points. ≥70 points is passing. If you are taking the course for a letter grade, we will use the following scale: A+:97-100, A:93-96, A-:90-92, B+:87-89, B:83-86, B-:80-82, C+:77-79, C:73-76, C-:70-72, F:0-69.

Academic integrity

While working on assignments you are encouraged to chat with your classmates, look things up online, and ask instructors for help. You may use scientific literature, but must cite it in the text. DO NOT directly copy/paste code to/from other students in the class or websites. DO NOT post questions about assignments on sites like seqanswers or stack overflow (you can research existing threads on these sites, just don't start new ones for the coursework). 

Accommodations for students with disabilities

If you have a disability for which you are or may be requesting accommodations, please contact the Office for Students with Disabilities. You must have documentation from the the Office before accommodations can be granted.

Diversity and inclusion

We are committed to fostering a learning environment for this course that supports a diversity of thoughts, perspectives and experiences, and respects your identities (including race, ethnicity, heritage, gender, sex, class, sexuality, religion, ability, age, educational background, etc.). Our goal is to create a diverse and inclusive learning environment where all students feel comfortable and can thrive.

Our instructional staff will make a concerted effort to be welcoming and inclusive to the wide diversity of students in this course. If there is a way we can make you feel more included please let one of the course staff know. Our learning about diverse perspectives and identities is an ongoing process, and we welcome your perspectives and input.

We also expect that you, as a student in this course, will honor and respect your classmates, abiding by the UCSD Principles of Community (https://ucsd.edu/about/principles.html). Please understand that others’ backgrounds, perspectives and experiences may be different than your own, and help us to build an environment where everyone is respected and feels comfortable.

If you experience any sort of harassment or discrimination, please contact the instructor as soon as possible. If you prefer to speak with someone outside of the course, please contact the Office of Prevention of Harassment and Discrimination: https://ophd.ucsd.edu/.

Disclaimer

While we have every intention of following this syllabus, any information here is subject to change.

Course Summary:

Date Details Due