User Tools

Site Tools


archive:summer_2015

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
archive:summer_2015 [2015/06/09 01:05]
charles [Finished Work]
archive:summer_2015 [2015/06/12 18:11]
chkcole
Line 10: Line 10:
 | Emilio F | efeal@ucsc.edu | ABySS | | | | Emilio F | efeal@ucsc.edu | ABySS | | |
 | Charles M | cmarkell@ucsc.edu | SOAP | Haussler | | | Charles M | cmarkell@ucsc.edu | SOAP | Haussler | |
 +| Natasha | ndudek@ucsc.edu | Discovar | Shapiro | I want to work on finishing the mitochondrion assembly |
 +| Chris Eisenhart | ceisenha@ucsc.edu | Discovar | Browser Group | Get it on the browser, finish the assembly with Sspace. I wont be at the meetings as I have to work, I will correspond over email. |
 +|Christopher Kan|chkan@ucsc.edu|SGA|Pogson|Will finish SGA Assembly|
 +| Nedda | nsaremi@ucsc.edu | SOAP | Green |
 +| Josh | jolespin@ucsc.edu | SGA | Bernick |Once a good assembly is generated and we get a transcriptome,​ I can try and extract the exons, introns, and genes (5'​UTR,​ CDS, 3'​UTR). ​ I wrote some scripts that can do this and confirms with exon-junction motifs. ​ Can't meet weekly since I'll be in SD. |
 +|Robert|calef@soe.ucsc.edu|Discovar|Green|Will start looking into running SOAP gap closer on the Kolossus Discovar assembly, as well as tools for scaffolding with RNA-seq data|
 +|Charles| chkcole@ucsc.edu| Meraculous| Vollmers| I'm currently working on getting the RNA-Seq data. We have some preliminary stuff I just need to process it and confirm the quality of the libraries. Also, repetitive elements.|
  
  
Line 17: Line 24:
  
   * Meet in PSB 305 (same classroom)   * Meet in PSB 305 (same classroom)
 +  * once a week
 +  * Weds at 10:30am or 1:30pm
  
  
Line 24: Line 33:
 A list of unresolved tasks A list of unresolved tasks
  
-  * distribute T-shirts +  * distribute T-shirts ​(These should be available in the afternoon of 9 June 2015.) 
-  * Decide on a final assembly from working assemblers. +  * Apply SOAPdenovo scaffolding and gap filling to Discovar scaffolds 
-  * RNA Assembly+  * Merge assemblies 
 +  * Decide on a final assembly from working assemblers: that's going to be a moving target as we add more data and analyses. We may have to freeze one for annotation work, but there'​s not much point to doing this before we've finished all the scaffolding we can do, as our scaffolds are still quite small (about 12k for the scaffold N50 as of 2015 June 8)
 +  * RNAseq: 
 +    * Using RNAseq data to test existing scaffolding 
 +    * Transcriptome assembly 
 +    * Using RNASeq data for scaffolding
   * Annotations   * Annotations
-  * Heterozygosity +  * Heterozygosity: assuming that Ed's initial result (which is not yet on the wiki) holds up, that the collected slug was almost purely homozygous, then we have the interesting question whether the UCSC slug population is so inbred that it is nearly clonal. 
-  * Non-slug DNA in reads +  * Phylogeny: not clear what we can add here—there aren't enough mollusk genomes around for phylogenetic analysis to say much.  What meaningful questions might we answer? 
-  * How to merge assemblies+  * Non-slug DNA in reads: part of annotation is going to be identifying and removing contaminants.
   * Repeats   * Repeats
-  ​Mitochondria ​assembly +    ​Using RepARC (or building new tools) to construct highly repetitive contigs 
-  * Use of old data (2010 and 2011 classes)+    * Scaffolding and gap filling on highly repetitive contigs to build full repeat consensus (and variants) 
 +    * Annotating repeats (viral and bacterial contaminants,​ mitochondrion,​ transposons,​ … ) 
 +  * Mitochondrion ​assembly 
 +    * PCR to close gaps 
 +    * studying graph of contig neighbors to try to determine order without PCR 
 +  * Use of old data (2010 and 2011 classes:  There is not really much data there, and the quality is not very high, so there may not be much point in trying to use it.  Perhaps mapping it could find some variation—we might like to know whether the UCSC slug population is nearly clonal.
  
 +
 +Longer term: we'll need to decide whether we need more data, particularly for scaffolding. ​ Will Ed's lab be continuing to develop a new mate-pair library procedure, which could give us new libraries to work with?  Will the MinION deliver higher-throughput long reads this year, and can we get a run of long DNA through the MinION?
  
 ====Finished Work==== ====Finished Work====
  
-A list of work completed ​sense BME235 ended.+A list of work completed ​since BME235 ended:
  
   * Disovar assembly with all shotgun data, accomplished on Kolussus   * Disovar assembly with all shotgun data, accomplished on Kolussus
   * SOAPdenovo assembly with all shotgun data and gap closed, accomplished on edser2   * SOAPdenovo assembly with all shotgun data and gap closed, accomplished on edser2
archive/summer_2015.txt · Last modified: 2015/07/18 20:32 by ceisenhart