User Tools

Site Tools


archive:computer_resources:assemblies

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
archive:computer_resources:assemblies [2010/05/19 20:45]
galt Assembled Slug Genome from Illumina Paired with SOAPdenovo
archive:computer_resources:assemblies [2010/06/06 21:45]
galt adding velvet-assembly1a
Line 33: Line 33:
     * velvet-assembly1/​ Assembling Pog 454 long reads with velvet. ​ After very poor results with default settings, eventually started to get good results by getting the expected coverage (60) and cutoff (13) correct. ​ It took a long time try different parameter settings. Also using the long reads as both short and long reads gave substantially better results. ​ Because these were long reads, we could set k up to 31.  Also tried with specially compiled version of velvet that could use k > 31, but can not report any improvement yet.  Given that the average read is 370b, it should have been able to support longer k-values. ​ Best results so far:\\     * velvet-assembly1/​ Assembling Pog 454 long reads with velvet. ​ After very poor results with default settings, eventually started to get good results by getting the expected coverage (60) and cutoff (13) correct. ​ It took a long time try different parameter settings. Also using the long reads as both short and long reads gave substantially better results. ​ Because these were long reads, we could set k up to 31.  Also tried with specially compiled version of velvet that could use k > 31, but can not report any improvement yet.  Given that the average read is 370b, it should have been able to support longer k-values. ​ Best results so far:\\
       * Final graph has 2176 nodes {195 contigs over 62b) and n50 of 224364, max {contig size} 680241, total {genome size} 2481051, using 778249/​782604 reads       * Final graph has 2176 nodes {195 contigs over 62b) and n50 of 224364, max {contig size} 680241, total {genome size} 2481051, using 778249/​782604 reads
 +    * velvet-assembly1a/​ Assembling Pog 454 long reads with velvet. This is just re-running velvet-assembly1 with the best parameters found and also using kevin'​s makefile adapted to the velvet data.
     * velvet-assembly2/​ Assembling Pog Solid mate-paired 25b reads with velvet in double-encoded colorspace (24 DE base reads). Not had time to optimize parameters yet.  Did get up to a max-contig size of 95k.     * velvet-assembly2/​ Assembling Pog Solid mate-paired 25b reads with velvet in double-encoded colorspace (24 DE base reads). Not had time to optimize parameters yet.  Did get up to a max-contig size of 95k.
       * velveth_de out 21 -shortPaired /​campusdata/​BME235/​data/​Pog/​solid_run/​paired/​output/​doubleEncoded_input.de       * velveth_de out 21 -shortPaired /​campusdata/​BME235/​data/​Pog/​solid_run/​paired/​output/​doubleEncoded_input.de
Line 38: Line 39:
       * Final graph has 3602 nodes and n50 of 4851, max 94854, total 1767903, using 28785664/​61262410 reads       * Final graph has 3602 nodes and n50 of 4851, max 94854, total 1767903, using 28785664/​61262410 reads
   * SOAPdenovo   * SOAPdenovo
-    * SOAPdenovo-assembly1/​ Assembling Pog 454 long reads with SOAPdenovo. ​ After being simply unable to get any version of the program to read a FASTA file despite documentation examples, I finally found a utility sff2fastq that made it possible to run SOAPdenovo on Pog 454 fastq. ​ I have not had time to optimize parameters yet.  The largest contig made with default params was just 4k.+    * SOAPdenovo-assembly1/​ Assembling Pog 454 long reads with SOAPdenovo. ​ After being simply unable to get any version of the program to read a FASTA file despite documentation examples, I finally found a utility sff2fastq that made it possible to run SOAPdenovo on Pog 454 fastq. ​ I have not had time to optimize parameters yet.  The largest contig made with default params was just 4k.  Later raised cutoff to 12 and got maxcontig of 70k.  Could not run the scaffold step because it crashed, probably because it was written for short 52bp solexa reads and the long 454 reads are messing it up.
  
  
Line 60: Line 61:
     * It was used to assemble the Panda genome by BGI.     * It was used to assemble the Panda genome by BGI.
     * Used kolossus which has 1TB and 64cpus.     * Used kolossus which has 1TB and 64cpus.
-    * Ran with k=31 an k=23.  k=31 was better (9k maxcontig)+    * Ran with k=31 and k=23.  k=31 was better (9k maxcontig)
     * so ran with filling -R to get 12k maxcontig.     * so ran with filling -R to get 12k maxcontig.
     * Then ran the scaffolding steps with 200bp insert size.     * Then ran the scaffolding steps with 200bp insert size.
archive/computer_resources/assemblies.txt · Last modified: 2015/09/02 16:53 by 92.247.181.31