User Tools

Site Tools


archive:bioinformatic_tools

List of Tools

Here is a list of the tools we are considering using for assembling and annotating the Banana Slug genome. Each of these listed tools will have their own page describing what the tool is supposed to do, and what we were able to get it to do.

Pre-Processing tools

Tools used on reads before assembly to improve assembly quality

  • SeqPrep1) Trim adapters/primers from ends of illumina reads, and merges paired illumina reads into singles if there is a user defined minimum overlap.
  • JELLYFISH2) Do K-mer counting on reads
  • Quake3) Do K-mer correction on reads (Can use JELLYFISH results as input)

Assembly Tools

The initial list of assembly tools was created from the Sequence Assembly page. However the links to the programs have been double checked, and some of them are different from those listed on the wikipedia page. — John St. John 2010/03/29 20:02

The only de-novo assemblers that can use Solid colorspace are Velvet (DBG) and Shorty (Overlap Concensus). CLC bio’s de novo assembler uses Solid reads in a limited way for mate-paired resolution of contigs only.

Here is an analysis of assemblers that used many of the assembler we are looking into. It includes “recipes” for how they ran each assembler.

Probably Open Source Tools
Probably Proprietary Tools and Services

Mapping tools

Tools used to map reads or contigs onto contigs, scaffolds, or genomes.

Data Transfer Tools

  • UDT 35) TCP too slow? Transfer data fast with UDT via UDP.

Visualization Tools

  • Hawkeye36) Visualize and validate assemblies.
  • Artemis 37) Visualize and annotate genomic sequences
  • Consed 38) Visualize assemblies. Compatible with Newbler output

Utilities

  • pluck-scripts Python scripts written by Kevin Karplus.
  • rdb Perl scripts for handling tab-separated columns as if they were a relational database.
  • sff_extract Python script to extra sff files into standard component files (.fasta, .fasta.qual, .xml) (Overlaps in functionality with the Roche-provided sffinfo tool, which may do a better job of decoding flow-space to base space.)
  • illuminaToFastq C program to convert the illumina .txt files into fastq format, discarding pairs of reads that do not both pass illumina's quality filter.
  • SAM Tools A utility for manipulating alignments in SAM format.
You could leave a comment if you were logged in.
archive/bioinformatic_tools.txt · Last modified: 2015/09/04 14:24 by 68.180.228.52