Table of Contents

Overview of Assembly

Kevin outlined the processes involving in assembling a genome.

Clean Up Reads

There are two separate and distinct parts of data clean up; error correction and contaminant removal.

Error Correction

K-mer Counting

Contaminant Removal

Cluster Reads and Build Contigs

Order and Orient Contigs

Homework

Learn about the Jellyfish tool for K-mer counting. Try running it on the Pyrobaculum data. Use different parameters and monitor its memory usage. Fill the the wiki page for Jellyfish.