User Tools

Site Tools


lecture_notes:05-27-2011

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
lecture_notes:05-27-2011 [2011/05/27 14:25]
eyliaw
lecture_notes:05-27-2011 [2015/09/14 11:40] (current)
68.180.230.228 ↷ Links adapted because of a move operation
Line 1: Line 1:
-====== To Do ====== +====== To Do (lecture notes) ​====== 
-We went over some things that need updating in the wiki. +We went over some things that need updating in the wiki and plans for this weekend. 
-  * Document scripts added in [[computer_resources:​bin|bin]] folder. + 
-  * Find out which lanes in 454 run 3 are banana slug runs.+  * Document scripts added in [[archive:computer_resources:​bin|bin]] folder. 
 + 
 + 
 +  * Find out which lanes in 454 run 3 are banana slug runs. But it turns out that this is not necessary: run "​3"​ is not a separate run, but just run2 plus the non-banana-slug reads in other lanes.
      * Try mapping with Newbler or BWA.      * Try mapping with Newbler or BWA.
      * BLAST/BLAT it.      * BLAST/BLAT it.
 +  * Find insert lengths for the SeqPrep + Quake corrected Illumina data.
   * SOAPdenovo assembly try 1:   * SOAPdenovo assembly try 1:
-     * Only on Illumina data+     * Only on Illumina data.
   * try 2:   * try 2:
-     ​* ​Level 1: All Illumina data for contig building+     ​* ​Rank 1: All Illumina data as rank 1
-     ​* ​Level 2: 454 data for scaffolding. +     ​* ​Rank 2: 454 data as rank 2 (both for contig building and scaffolding)
-  * +  * try 3: 
 +     * Illumina + 454 data as rank 1. 
 +  * If insert length is negative, don't treat them as PE reads. ​  
 +    * This was an error in assumptions about what SOAPdenovo wants. ​ The number it wants is the total fragment length, which is what we are already estimating. ​ We just need to look at the average length for the pairs (based on the histogram) after SeqPrep. 
 +    * If Quake changes the distribution of reads (by trimming and discarding uncorrectable reads) it may be important to remap the new set of pairs to the 454 reads to get an improved estimate of fragment length. 
 +  * Newbler assemblies—no need for a new one, as there is no new 454 data. 
 +  * Take another look at Barcode of Life mapping: [[http://​www.ncbi.nlm.nih.gov/​Taxonomy/​Utils/​wprintgc.cgi#​SG5|Invertebrate mitochondrial translation table]] 
 +    * It turns out that blastall does not seem to have any documented way to tell tblastx to use a different genetic code, though the NCBI web server has the option. 
 +    * The work Kevin did so far is now in assemblies/​slug/​barcode-of-life. 
 +    * Next step is to use BWA to find all the SeqPrep+Quake treated Illumina reads that map to what has been found so far of the barcode. 
 + 
 + 
 +  * Ed presenting paper on Wednesday. ​ Get paper over the weekend.
lecture_notes/05-27-2011.1306531518.txt.gz · Last modified: 2011/05/27 14:25 by eyliaw