This shows you the differences between two versions of the page.
|
lecture_notes:04-27-2011 [2011/05/02 10:54] eyliaw created |
lecture_notes:04-27-2011 [2011/05/02 11:00] (current) eyliaw |
||
|---|---|---|---|
| Line 14: | Line 14: | ||
| * 37 gb per run (~28 gb of raw images) | * 37 gb per run (~28 gb of raw images) | ||
| ===== Software ===== | ===== Software ===== | ||
| - | * Amplicon variant analyzer | + | * Amplicon Variant Analyzer: for specific region analysis |
| - | * GS Assembler | + | * GS Assembler: de Novo or Reference Mapper |
| * GS Reporter | * GS Reporter | ||
| * GS RunProcessor - Image/signal processing | * GS RunProcessor - Image/signal processing | ||
| Line 51: | Line 51: | ||
| * <40 bases, throws sequence away. | * <40 bases, throws sequence away. | ||
| * (Even unfiltered, quality scores will reflect low quality areas) | * (Even unfiltered, quality scores will reflect low quality areas) | ||
| + | * SFFtools can also perform screen-trimming with a screening db for known contaminants. | ||
| A good run: expect a read length mode ~500 and mean >300. ~50% should pass filters. | A good run: expect a read length mode ~500 and mean >300. ~50% should pass filters. | ||
| + | ===== de Novo Assembly ===== | ||
| + | * All assemblers use .sff files. | ||
| + | * 3, 8 & 20kb paired end libraries. | ||
| + | * Can accept fasta/qual reads. | ||
| + | * 15-25X coverage best. | ||
| + | * Low coverage: poor contig building | ||
| + | * High coverage: may cause contig breaks | ||
| + | * 4 bytes per __read__ base in RAM. | ||