User Tools

Site Tools


lecture_notes:04-13-2011

This is an old revision of the document!


A PCRE internal error occured. This might be caused by a faulty plugin

====== Velvet, Dan Zerbino ====== ===== De Bruijn Graphs ===== [[bioinformatic_tools:velvet|Velvet]] uses de Bruijn graphs to condense reads and resolve common sequencing problems. Compared to overlap layout consensus, it simplifies multiple reads by resolving repeats into only one node, so that their counts only need to be stored on top. This reduces the amount of memory required to store the reads by the dimension of coverage. Specifically, the de Bruijn graph breaks each read into words and paths through the words, mapping new reads to form a graph structure. Velvet then simplifies this graph by removing unjoined tips and reducing parallel strands into the strand with the maximum coverage (this error corrects for mismatched bases). Velvet leaves loops unresolved in the final structure, as these represent repeat regions. [(cite:Velvet>Zerbino, D. and Birney, E. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008. 18: 821-829. doi: [[http://dx.doi.org/10.1101/gr.074492.107|10.1101/gr.074492.107]])] ===== Extensions ===== ==== Shorty ==== Shorty uses the variance in paired read lengths to build larger contigs from small ones. It bears some resemblence to how [[bioinformatic_tools:PRICE|PRICE]] does contig extension. ==== Oasis ==== Oasis does splicing analysis on words with more than one connection, breaking them up into separate contigs. This resolves nodes in the de Bruijn graph. ==== Columbus ==== Columbus is a combined approach between mapping and de novo sequencing. It uses a reference sequence to organize contigs, but allows for novel structures within the contigs.

You could leave a comment if you were logged in.
lecture_notes/04-13-2011.1302895374.txt.gz · Last modified: 2011/04/15 19:22 by eyliaw