Team I Webserver Group: Difference between revisions
Jump to navigation
Jump to search
Line 33: | Line 33: | ||
*Visualisation: Quast output | *Visualisation: Quast output | ||
*For more details to visit: [https://compgenomics2020.biosci.gatech.edu/Team_I_Genome_Assembly_Group Team1_Genome_Assembly] | *For more details to visit: [https://compgenomics2020.biosci.gatech.edu/Team_I_Genome_Assembly_Group Team1_Genome_Assembly] | ||
===Gene Prediction=== | ===Gene Prediction=== | ||
*Gene finding in assembled isolates or provided '''FASTA fileTakes FastQ files''' as input | *Gene finding in assembled isolates or provided '''FASTA fileTakes FastQ files''' as input | ||
Line 46: | Line 45: | ||
*Outputs as '''*.gff file, *_cds.fna file, *_protein.faa file and *_rna.fna file''' | *Outputs as '''*.gff file, *_cds.fna file, *_protein.faa file and *_rna.fna file''' | ||
*For more details to visit: [https://compgenomics2020.biosci.gatech.edu/Team_I_Gene_Prediction_Group Team1_Gene_Prediction] | *For more details to visit: [https://compgenomics2020.biosci.gatech.edu/Team_I_Gene_Prediction_Group Team1_Gene_Prediction] | ||
===Functional Annotation=== | ===Functional Annotation=== | ||
*Obtain functional information about predicted genes | *Obtain functional information about predicted genes | ||
Line 62: | Line 59: | ||
*Output: '''*.tsv file''' | *Output: '''*.tsv file''' | ||
*For more details to visit: [https://compgenomics2020.biosci.gatech.edu/Team_I_Functional_Annotation_Group Team1_Functional_Annotation] | *For more details to visit: [https://compgenomics2020.biosci.gatech.edu/Team_I_Functional_Annotation_Group Team1_Functional_Annotation] | ||
===Comparative Genomics=== | ===Comparative Genomics=== | ||
*Comparison of genomic features of input files to identify outbreak cluster | *Comparison of genomic features of input files to identify outbreak cluster | ||
Line 77: | Line 72: | ||
*Output: '''.tsv file(for chewBBACA, MUMmer), .png(kSNP)''' | *Output: '''.tsv file(for chewBBACA, MUMmer), .png(kSNP)''' | ||
*Visualisation: '''Phylogenetic tree''' for identified SNP’s, phylogenetic tree for MLST, graph for epidemiological data visualisation | *Visualisation: '''Phylogenetic tree''' for identified SNP’s, phylogenetic tree for MLST, graph for epidemiological data visualisation | ||
*For more details to visit: [https://compgenomics2020.biosci.gatech.edu/Team_I_Comparative_Genomics_Group Team1_Comparative_Genomics] | *For more details to visit: [https://compgenomics2020.biosci.gatech.edu/Team_I_Comparative_Genomics_Group Team1_Comparative_Genomics] | ||
Revision as of 03:31, 18 April 2020
Members: Devishi Kesar, Shuheng Gan, Winnie Zheng, Priya Narayanan, Aaron Pfennig
Introduction
Background
Objective
- Provide a comprehensive, automated platform to analyze E.coli isolates in order to predict virulence factors and outbreak cluster
- Functionalities of the webserver:
- Identify virulence factors/microbial resistance and outbreak response for provided isolates
- Allow data upload at each step of outline pipeline
- Visualize findings in a comprehensible way
- Design
- Intuitive usage
- Provide only essential options
WebServer
- Structure
- Access to Webserver
Here is Link to access our webserver:
Functionalities
Genome Assembly
- Performs de-novo assembly with FastQ files as input
- Runs following tools by default:
- fastp: read pre-processing
- Unicycler: Genome assembly
- Options:
- Perform read preprocessing
- Kmer-size
- Spades as alternative assembly method
- The input FastQ files must be paired-end reads
- Outputs as FASTA file
- Visualisation: Quast output
- For more details to visit: Team1_Genome_Assembly
Gene Prediction
- Gene finding in assembled isolates or provided FASTA fileTakes FastQ files as input
- Runs following tools by default:
- CDS: Prodigal
- tRNA: Aragorn
- rRNA: barrnap
- Options:
- GeneMarkS-2 as alternative tool for CDS predictions
- tRNAscan-SE as alternative tool for tRNA predictions
- RNAmmer as alternative tool for rRNA predictions
- Outputs as *.gff file, *_cds.fna file, *_protein.faa file and *_rna.fna file
- For more details to visit: Team1_Gene_Prediction
Functional Annotation
- Obtain functional information about predicted genes
- Input: FASTA file
- Cluster Tool: usearch
- Output: centroid.fasta
- Homology Tools:
- General annotation: InterProScan, EggNOGmapper
- Antibiotic resistance gene: DeepARG
- Abinitio Tools:
- Signal Peptides: SignalP 5.0
- Transmembrane Proteins: TMHMM
- CRISPR Sites: PilerCR
- Output: *.tsv file
- For more details to visit: Team1_Functional_Annotation
Comparative Genomics
- Comparison of genomic features of input files to identify outbreak cluster
- Input: FASTA file, prodigal training file(chewBBACA)
- Tools used:
- MUMmer 4.0
- chewBBACA
- kSNP 3.0
- FigTree
- Options:
- Parsimony tree, maximum likelihood and neighbour joining trees as option for kSNP
- k-mer size option for kSNP
- Output: .tsv file(for chewBBACA, MUMmer), .png(kSNP)
- Visualisation: Phylogenetic tree for identified SNP’s, phylogenetic tree for MLST, graph for epidemiological data visualisation
- For more details to visit: Team1_Comparative_Genomics
Method
Webserver Demo
- Choice One: Running General Pipline
- Choice Two: Running each step separately