Team II Gene Prediction Group: Difference between revisions
Line 37: | Line 37: | ||
=='''Non-Coding RNA Methods'''== | =='''Non-Coding RNA Methods'''== | ||
* Non-coding RNA (ncRNA) - an RNA molecule that is not translated into a protein | |||
* transfer RNAs (tRNAs), ribosomal RNAs (rRNAs) and small RNAs(sRNAs) | |||
* Protein synthesis/Translation (tRNA and rRNA) & gene regulation (sRNA) | |||
* Related to antibiotic resistance | |||
=='''Final Pipeline'''== | =='''Final Pipeline'''== | ||
=='''References'''== | =='''References'''== |
Revision as of 14:53, 23 February 2020
Team 2: Gene Prediction
Team Members: Danielle Temples, Kara Lee, Paarth Parekh, Shuting Lin
Introduction
Our Project
Purpose: Investigate an unknown outbreak pathogen using raw genome sequence data from the Centers for Disease Control and Prevention (CDC) foodborne illness surveillance outbreak investigations
Overall Objective: Identify and characterize the pathogenic organism, make recommendations for the outbreak control, and build a public webserver that automates the computational steps
Our Objective: From assembled genomes, predict genes or features using different prediction methods and evaluate selected tools on their accuracy and performance
What is Gene Prediction?
Identification of the regions of genomic DNA that encode genes, which are fragments of DNA that encodes a functional molecule:
- Protein-coding genes
- RNA genes
- May also include other functional elements (i.e. regulatory regions)
Prokaryotic Genome
Prokaryotic Genomes have a high gene density and do not contain introns in their protein-coding regions. Genes are called Open Reading Frames or “ORFs” (include start & stop codon).
Prediction of prokaryotic genes tends to be relatively simpler with contiguous ORFs. However, overlapping ORFs and short genes can cause issues. Each gene is an ORF, but not every ORF is a gene.
Homology Methods
- Makes predictions via comparisons with sequences of previously known genes
- Based on extrinsic information
- Can be used to validate/support Ab Initio findings
- Limited by the use of no new knowledge
Ab Initio Methods
- Inspect the input sequence and searches for traces of gene presence
- Simplest method is to inspect ORFs
- Relies on probability models & specific DNA motifs (signals)
- Markov Models and Dynamic Programming
Tool Evaluation
Non-Coding RNA Methods
- Non-coding RNA (ncRNA) - an RNA molecule that is not translated into a protein
- transfer RNAs (tRNAs), ribosomal RNAs (rRNAs) and small RNAs(sRNAs)
- Protein synthesis/Translation (tRNA and rRNA) & gene regulation (sRNA)
- Related to antibiotic resistance