Team II Comparative Genomics Group

From Compgenomics 2020
Jump to navigation Jump to search

Team 2: Comparative Genomics

Team Members: Kara Keun Lee, Courtney Astore, Kristine Lacek, Ujani Hazra, Jayson Chao


Introduction

What is Comparative Genomics?

Objectives

  • Identify kinds of strains (outbreak vs. sporadic)​
  • Construct phylogeny demonstrating which isolates are related and which differ​
  • Determine source of outbreak​
  • Map virulence and antibiotic resistence features of outbreak isolates​
  • Compile recommendations for outbreak response and treatment

Overview of Techniques

When performing phylogenomics, there are many options by which one can classify similarities and differences across the genome. Our approach utilizes tools from three different techniques.

Hierarical Clustering

MLST

SNP-based

SNP stands for Single Nucleotide Polymorphism, meaning that certain alleles have two or three possibilities as to which base is at a given locus. As SNPs accumulate through de novo mutations and are passed down through generations, comparing a given isolate's SNPs to other isolates and a reference genome allow ascertainment of phylogenetic distance between samples. Tools have been developed to compare bases position by position (SNP-calling) and create matrices to compute relatedness between samples based on common SNPs. Generalized Algorithm Overview:

  • Pre-processing and read cleaning
  • Mapping
  • SNP calling against reference genome
  • Phylogeny generation based on SNP profiles