This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
archive:bioinformatic_tools:velvet [2010/04/11 20:45] galt |
archive:bioinformatic_tools:velvet [2015/07/28 06:27] (current) ceisenhart ↷ Page moved from bioinformatic_tools:velvet to archive:bioinformatic_tools:velvet |
||
---|---|---|---|
Line 1: | Line 1: | ||
===== VELVET ===== | ===== VELVET ===== | ||
- | ====High Level Overview==== | ||
- | Velvet was developed by Ewan Birney and Daniel R. Zerbino for de-novo assembly of short-reads using de Bruijn graphs. | ||
- | Zerbino D, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. (2008) 18:821–829. | + | ==== Overview==== |
- | [[http://nar.oxfordjournals.org/cgi/ijlink?linkType=ABST&journalCode=genome&resid=18/5/821|Free full text]] | + | Velvet was developed by Daniel R. Zerbino and Ewan Birney. |
+ | |||
+ | **Velvet: algorithms for de novo short read assembly using de Bruijn graphs** | ||
+ | [(cite:velvet>Daniel R. Zerbino and Ewan Birney.\\ | ||
+ | Velvet: Algorithms for de novo short read assembly using de Bruijn graphs\\ | ||
+ | Genome Res. May 2008 18: 821-829; Published in Advance March 18, 2008, \\ | ||
+ | doi:[[http://dx.doi.org/10.1101/gr.074492.107|10.1101/gr.074492.107]] | ||
+ | )] | ||
Velvet may be downloaded free from [[http://www.ebi.ac.uk/~zerbino/velvet/|here]] (GPL license). | Velvet may be downloaded free from [[http://www.ebi.ac.uk/~zerbino/velvet/|here]] (GPL license). | ||
- | There is a [[http://en.wikipedia.org/wiki/Velvet_(software)|wiki article]] about velvet. | + | On wikipedia: [[wp>Velvet_(software)|Velvet]]. |
+ | [[http://www.ebi.ac.uk/training/ftp/PhDtheses/Daniel_Zerbino.pdf|Daniel Zerbino's PhD Thesis on Velvet]] | ||
Velvet has support for COLORSPACE, possibly the only de-novo short-read DBG assembler that does at this time. | Velvet has support for COLORSPACE, possibly the only de-novo short-read DBG assembler that does at this time. | ||
+ | The colorspace version of velvet (_de) expects all data to be double-encoded. Mixed-space not directly supported. | ||
Velvet has support for long-read data. | Velvet has support for long-read data. | ||
- | === Installing === | + | Velvet will accept sequence data from fastq input files, but does not use the quality information. |
- | ssh campusrocks.cse.ucsc.edu | + | ==== Color-Space ==== |
- | + | ||
- | cd /campusdata/BME235/programs | + | |
- | # but currently having problems with my group membership | + | |
- | #cd $HOME | + | |
- | # mkdir programs | + | |
- | # cd programs | + | |
- | # | + | |
- | wget http://www.ebi.ac.uk/~zerbino/velvet/velvet_0.7.62.tgz | + | |
- | tar xfz velvet_0.7.62.tgz | + | |
- | mv velvet_0.7.62 velvet | + | |
- | mv velvet_0.7.62.tgz velvet/ | + | |
- | cd velvet | + | |
- | make | + | |
- | make color | + | |
- | # color versions work with solid, have _de extension | + | |
- | # install to bin dir | + | |
- | cp velveth velvetg velveth_de velvetg_de /campusdata/BME235/bin/ | + | |
- | + | ||
- | + | ||
- | de-novo Tools for velvet from ABI for Solid. | + | |
- | http://solidsoftwaretools.com/gf/project/denovo/ | + | |
- | + | ||
- | === Color-Space === | + | |
- | == DE double-encoded == | + | === DE double-encoded === |
This is done by the pre-processor. | This is done by the pre-processor. | ||
The primer base from the colorspace read is | The primer base from the colorspace read is | ||
Line 54: | Line 38: | ||
and then re-encoded as if bases in base-space. | and then re-encoded as if bases in base-space. | ||
- | == colorspace programs == | + | === colorspace programs === |
denovo_preprocessor | denovo_preprocessor | ||
Line 61: | Line 45: | ||
velveth_de | velveth_de | ||
- | colorspace version of velveth hashes reads | + | colorspace version of velveth hashes reads. |
velvetg_de | velvetg_de | ||
Line 72: | Line 56: | ||
to base-space while reducing read errors in colorspace as much as possible. | to base-space while reducing read errors in colorspace as much as possible. | ||
+ | [[http://solidsoftwaretools.com/gf/project/denovo/|De-novo Tools for velvet from ABI for Solid]] | ||
+ | |||
+ | ==== Running ==== | ||
+ | |||
+ | Strategy: | ||
+ | - Find the right value for k. For short reads remember to keep k small for good kmer coverage. | ||
+ | - Find the right values for exp_cov and cov-cutoff. This is very important. | ||
+ | * velvet-estimate-exp_cov.pl out/stats.txt makes a useful graph. | ||
+ | - If you only have long reads, use them also as your short reads. | ||
+ | |||
+ | For 454 long reads, this was our best result: | ||
+ | velveth out 31 -short 454/?.TCA.454Reads.fna -long 454/?.TCA.454Reads.fna | ||
+ | velvetg out -exp_cov 60 -cov_cutoff 13 | ||
+ | Final graph has 1755 nodes and n50 of 41723, max 142286, total 2468925, using 778257/782604 reads | ||
+ | |||
+ | |||
+ | ==== Failures ==== | ||
+ | |||
+ | === VelvetOptimiser === | ||
+ | The contributed (velvet/contrib/) utility VelvetOptimiser is intended to help find | ||
+ | the critical parameters k, exp_cov, and cov_cutoff. However although it found k, | ||
+ | it got stuck on a local maximum on coverage and failed to produce anything useful. | ||
+ | |||
+ | === pseudoFlow === | ||
+ | Wondering if homopolymer errors in 454 data could cause trouble for the DBG, | ||
+ | I made a utility called pseudoFlow.c that takes all homopolymers longer than | ||
+ | 6 and shortens them to 6. We know that in the range 1 to 6, 454 is accurate. | ||
+ | In any case, the pseudoFlow version of the data did not perform better, | ||
+ | in fact it was a little worse. | ||
+ | |||
+ | ==== Installing ==== | ||
+ | |||
+ | ssh campusrocks.cse.ucsc.edu | ||
+ | | ||
+ | cd /campusdata/BME235/programs | ||
+ | wget http://www.ebi.ac.uk/~zerbino/velvet/velvet_0.7.62.tgz | ||
+ | tar xfz velvet_0.7.62.tgz | ||
+ | mv velvet_0.7.62 velvet | ||
+ | mv velvet_0.7.62.tgz velvet/ | ||
+ | cd velvet | ||
+ | make | ||
+ | make color | ||
+ | # color versions work with solid, have _de extension | ||
+ | # install to bin dir | ||
+ | cp velveth velvetg velveth_de velvetg_de /campusdata/BME235/bin/ | ||
+ | |||
+ | |||
+ | ==== Examples ==== | ||
+ | |||
+ | [[http://kevin-gattaca.blogspot.com/2009/12/de-novo-assembly-with-abi-solid-reads.html|example of using velvet with solid]] | ||
+ | |||
+ | ==== Website ==== | ||
+ | [[http://www.ebi.ac.uk/~zerbino/velvet/]] | ||
+ | |||
+ | ==== Source with Binaries and Documentation ==== | ||
+ | [[http://www.ebi.ac.uk/~zerbino/velvet/velvet_0.7.62.tgz]] | ||
+ | |||
+ | ===== References ===== | ||
+ | <refnotes>notes-separator: none</refnotes> | ||
+ | ~~REFNOTES cite~~ | ||