User Tools

Site Tools


lecture_notes:04-26-2015

This is an old revision of the document!


Team 4 Report: ABySS

ABySS stands for Assembly by short sequences.

Assembler Overview

  • Load kmers
  • Find adjacent kmers
  • Generate de Bruijn graphs

Workflow

  • Generate paths through the reads
  • Merge paths
  • Generate contigs
  • ParseAligns: Empirical fragment-size distribution
    1. Maximum Likelihood Estimator
    2. Use empirical paired-end size distribution

Running ABySS

Single processor version: straight forward step

  • Qsub
  • Embedded qsub
  • Exporting paths
  • Abuss-pe [parameters] parallel environment in campusrocks2

Parameters: Primary Name: name of assembly K: size of k-mer If 1 library of pe data: In = ‘reads1.fq reads2.fq’ Pipeline organized via makefile: abyss-pe Autogenerated assembly statistics Contig, scafold metrics

  • Does not necessarily clean up things that failed.
  • It is better to manually clean the file.

Using ABySS, the plan

  • Use all libraries, after processing, but no error correction
  • Run the seqprep
  • Run adapter trimming only
  • Run adapter trimming plus merging

Initil run

  • The initial run is located on Edser
  • K = 55, arbitrary
  • Did not work

For the future

  • Get parallel versions working
  • Finish data analysis (kmergenie, fastqc, etc)
  • Do assemblies
  • RNA-seq rescaffolding with TransABySS
  • Meta-assembly
You could leave a comment if you were logged in.
lecture_notes/04-26-2015.1430291926.txt.gz · Last modified: 2015/04/29 00:18 by gepoliano