User Tools

Site Tools


archive:computer_resources:assemblies

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
archive:computer_resources:assemblies [2010/06/06 21:45]
galt adding velvet-assembly1a
archive:computer_resources:assemblies [2010/06/09 19:13]
svasili
Line 39: Line 39:
       * Final graph has 3602 nodes and n50 of 4851, max 94854, total 1767903, using 28785664/​61262410 reads       * Final graph has 3602 nodes and n50 of 4851, max 94854, total 1767903, using 28785664/​61262410 reads
   * SOAPdenovo   * SOAPdenovo
-    * SOAPdenovo-assembly1/​ Assembling Pog 454 long reads with SOAPdenovo. ​ After being simply unable to get any version of the program to read a FASTA file despite documentation examples, I finally found a utility sff2fastq that made it possible to run SOAPdenovo on Pog 454 fastq. ​ I have not had time to optimize parameters yet.  The largest contig made with default params was just 4k.  Later raised cutoff to 12 and got maxcontig of 70k.  Could not run the scaffold step because ​it crashedprobably because it was written for short 52bp solexa ​reads and the long 454 reads are messing it up.+    * SOAPdenovo-assembly1/​ Assembling Pog 454 long reads with SOAPdenovo. ​ After being simply unable to get any version of the program to read a FASTA file despite documentation examples, I finally found a utility sff2fastq that made it possible to run SOAPdenovo on Pog 454 fastq. ​ I have not had time to optimize parameters yet.  The largest contig made with default params was just 4k.  Later raised cutoff to 12 and got maxcontig of 70k.  Could not run the scaffold step because ​there are no paired libs in this data set. 
 +  * Ray 
 +    * Ray-assembly1/ ​    
 + 
 +                        Assembling Pog 454 long reads with Ray, 
 +                        a parallel implementation of the OpenAssembler. 
 +                        This software seems to be Canadian. 
 +                        It took 3 hours to run, and the output ​was 
 +                        not very good, max contig size being about 12k. 
 +                        Sadly there are no parameters to tweak. 
 + 
 +  * ABySS 
 +    * abyss-assembly1/ ​     
 + 
 +                        Assembling Pog454 long reads with ABySS. 
 +                        The best params found were kmer size 36 and coverage cutoff 15 
 +                        #ABYSS -k 36 -c 15 both.fq 
 +                        #Total size: mean 1844.8 sd 3479.7 min 36 (1179) max 32566 (556) median 204 
 + 
 +  * PCAP 
 +    * pcap-assembly1/​ 
 + 
 +                        Assembled Pog 454 long reads with pcap default parameters. Sanger reads are not included. 
 +                        It was necessary to increase the minimum depth coverage for repeats before we got anything good. 
 +                        Assembled Pog 454 long reads with minimum depth coverage for repeats set to 200, and rest of the parameters unmodified.  
 +                        faSize contigs.bases info :  
 +                        2506151 bases (8 N's 2506143 real 2506143 upper 0 lower) in 219 sequences in 1 files 
 +                        Total size: mean 11443.6 sd 65849.3 min 56 (Contig174.1) max 611479 (Contig0.1) median 195 
 +                        N count: mean 0.0 sd 0.2 
 +                        U count: mean 11443.6 sd 65849.3 
 +                        Using Kevin'​s makefile, the blat alignments showed large contigs that looked basically correct, except for contig 8. 
 +                        However many of them overlapped, unlike the Newbler output. ​ This may have been due to a 
 +                        difference in the way Newbler and PCAP tried to handle the mixed population in the sample where 
 +                        3 inverting regions ​are found with various frequencies. 
 +                        Also, a cutoff should probably be supplied somewhere after the 17th largest contig because 
 +                        most of the rest of the 219 was small contigs probably representing noise.
  
  
archive/computer_resources/assemblies.txt · Last modified: 2015/09/02 16:53 by 92.247.181.31