edser2:~> ./DiscovarDeNovo READS=“frac:0.05,sample:18H::/scratch/bananaSlugBAMS/SW018_S1_L007_001.bam+frac:0.05,sample:19M::/sc

ratch/bananaSlugBAMS/SW019_S1_L001_001.bam+frac:0.05,sample:19H::/scratch/bananaSlugBAMS/SW019_S2_L008_001.bam” OUT_DIR=/scratc

h/bananaSlugAssemblies

Performing re-exec to adjust stack size.


Mon Apr 27 19:16:20 2015 run on edser2, pid=21615 [Apr 10 2015 12:24:57 R52415 ]

DiscovarDeNovo \

             READS="frac:0.05,sample:18H::/scratch/bananaSlugBAMS/SW018_S1_L \
             007_001.bam+frac:0.05,sample:19M::/scratch/bananaSlugBAMS/SW019 \
             _S1_L001_001.bam+frac:0.05,sample:19H::/scratch/bananaSlugBAMS/ \
             SW019_S2_L008_001.bam" OUT_DIR=/scratch/bananaSlugAssemblies

SYSTEM INFO

- OS: Linux :: 3.5.0-54-generic :: #81~precise1-Ubuntu SMP Tue Jul 15 04:02:22 UTC 2014

- node name: edser2

- hardware type: x86_64

- cache size: 30720 KB

- cpu MHz: 2699.924

- cpu model name: Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz

- physical memory: 314.88 GB

Omitting memory check. If you run into problems with memory,

you might try rerunning with MEMORY_CHECK=True.

Mon Apr 27 19:16:20 2015: finding input files

Mon Apr 27 19:16:20 2015: reading 3 files (which may take a while)

Mon Apr 27 19:16:20 2015: processing /scratch/bananaSlugBAMS/SW018_S1_L007_001.bam.

Mon Apr 27 19:16:20 2015: memory in use = 0.01 GB, peak = 0.01 GB

Mon Apr 27 19:40:42 2015: there are 14,301,934 reads of mean length 100

Mon Apr 27 19:40:42 2015: memory in use = 42.71 GB, peak = 43.92 GB

Mon Apr 27 19:41:21 2015: reads sorted

Mon Apr 27 19:41:21 2015: memory in use = 43.78 GB, peak = 43.92 GB

Mon Apr 27 19:43:16 2015: data stashed in output structures

Mon Apr 27 19:43:16 2015: memory in use = 1.33 GB, peak = 43.92 GB

Mon Apr 27 19:43:16 2015: processing /scratch/bananaSlugBAMS/SW019_S1_L001_001.bam.

Mon Apr 27 19:43:16 2015: memory in use = 1.33 GB, peak = 43.92 GB

Mon Apr 27 20:02:13 2015: there are 2,726,872 reads of mean length 301

Mon Apr 27 20:02:13 2015: memory in use = 17.50 GB, peak = 43.92 GB

Mon Apr 27 20:02:19 2015: reads sorted

Mon Apr 27 20:02:20 2015: memory in use = 17.70 GB, peak = 43.92 GB

Mon Apr 27 20:02:59 2015: data stashed in output structures

Mon Apr 27 20:02:59 2015: memory in use = 1.94 GB, peak = 43.92 GB

Mon Apr 27 20:02:59 2015: processing /scratch/bananaSlugBAMS/SW019_S2_L008_001.bam.

Mon Apr 27 20:02:59 2015: memory in use = 1.94 GB, peak = 43.92 GB

Mon Apr 27 20:18:24 2015: there are 9,863,615 reads of mean length 100

Mon Apr 27 20:18:24 2015: memory in use = 31.60 GB, peak = 43.92 GB

Mon Apr 27 20:18:49 2015: reads sorted

Mon Apr 27 20:18:49 2015: memory in use = 32.34 GB, peak = 43.92 GB

Mon Apr 27 20:20:10 2015: data stashed in output structures

Mon Apr 27 20:20:10 2015: memory in use = 2.84 GB, peak = 43.92 GB

INPUT FILES:

[1,type=frag,sample=18H,lib=1,frac=0.05] /scratch/bananaSlugBAMS/SW018_S1_L007_001.bam

[2,type=frag,sample=19M,lib=1,frac=0.05] /scratch/bananaSlugBAMS/SW019_S1_L001_001.bam

[3,type=frag,sample=19H,lib=1,frac=0.05] /scratch/bananaSlugBAMS/SW019_S2_L008_001.bam

Mon Apr 27 20:20:10 2015: found 3 samples

Mon Apr 27 20:20:10 2015: starts = 0,14301934,17028806

Mon Apr 27 20:20:59 2015: using 26,892,422 reads

Mon Apr 27 20:20:59 2015: data extraction complete

1.08 hours used extracting reads

Mon Apr 27 20:21:00 2015: see total physical memory of 338,094,714,880 bytes

Mon Apr 27 20:21:00 2015: 104.30 bytes per read base, assuming max memory available

We need 1 passes.

Expect 1374449 keys per batch.

Provide 3054330 keys per batch.

We need 1 passes.

Expect 1374449 keys per batch.

Provide 3054330 keys per batch.

Mon Apr 27 20:31:03 2015: back from buildReadQGraph

memory in use = 5,837,869,056

checksum_60 = 228984253420209548

Mon Apr 27 20:31:25 2015: constructing places

Mon Apr 27 20:31:31 2015: sorting places

Mon Apr 27 20:32:32 2015: building all

Mon Apr 27 20:33:11 2015: calling LongReadsToPaths

Mon Apr 27 20:33:34 2015: writing

Mon Apr 27 20:33:37 2015: translating paths

Mon Apr 27 20:33:41 2015: final stage of path translation

Mon Apr 27 20:37:46 2015: writing paths

4.51 seconds used reloading assembly

Mon Apr 27 20:38:32 2015: start walking

memory in use = 3,881,275,392

Mon Apr 27 20:38:44 2015: start walking

memory in use = 3,905,556,480

21.1 seconds used cleaning 200-mer graph

18.1 minutes used in ReadQGrapher

5.01e-06 seconds used reloading reads

checksum_200 = 2464942371563220

1 peak mem usage = 78.10 GB

4.52 seconds used loading stuff

2 peak mem usage = 78.10 GB

launching gap assemblies, mem usage = 3,768,946,688

Mon Apr 27 20:39:15 2015: finding unsatisfieds

Mon Apr 27 20:39:16 2015: creating multiplicity map

Mon Apr 27 20:39:16 2015: economizing links

Mon Apr 27 20:39:16 2015: forming neighborhoods

Mon Apr 27 20:39:16 2015: forming initial clusters

Mon Apr 27 20:39:16 2015: start sort

0.0509 seconds used sorting

Mon Apr 27 20:39:16 2015: merging clusters

xs.size( ) = 33377

2.92 seconds used merging

xs.size( ) = 5156

Mon Apr 27 20:39:19 2015: start overlap-based merging

Mon Apr 27 20:39:20 2015: start overlap-based merging

LR.size( ) = 4784

LR.size( ) = 2404

Mon Apr 27 20:39:25 2015: now processing 2404 blobs

Mon Apr 27 20:39:25 2015: memory in use = 3.55 GB, peak = 78.10 GB

………. ………. ………. ………. ……….

………. ………. ………. ………. ……….

4.13 minutes spent in local assemblies, memory in use = 3.64 GB, peak = 78.10 GB

Mon Apr 27 20:43:33 2015: patch reserving space

Mon Apr 27 20:43:33 2015: memory in use = 3.64 GB

0.159 seconds used patching, peak mem usage = 78.10 GB

new_stuff.size( ) = 25139

Mon Apr 27 20:43:35 2015: building hb2

1.57 seconds used in new stuff 1 test

memory in use now = 3,913,150,464

Mon Apr 27 20:44:02 2015: back from buildBigKHBVFromReads

26.4 seconds used in new stuff 2 test

peak mem usage = 78.10 GB

4.07 seconds used in new stuff 5

Mon Apr 27 20:44:09 2015: finding interesting reads

Mon Apr 27 20:44:09 2015: memory in use = 3.61 GB, peak = 78.10 GB

Mon Apr 27 20:44:18 2015: building dictionary

Mon Apr 27 20:44:18 2015: memory in use = 3.61 GB, peak = 78.10 GB

Mon Apr 27 20:44:22 2015: reducing

Mon Apr 27 20:44:22 2015: memory in use = 5.10 GB, peak = 78.10 GB

We need 1 passes.

Expect 12921 keys per batch.

Provide 100000 keys per batch.

Mon Apr 27 20:44:28 2015: kmerizing

Mon Apr 27 20:44:28 2015: memory in use = 5.21 GB, peak = 78.10 GB

We need 1 passes.

Expect 25094 keys per batch.

Provide 100000 keys per batch.

Mon Apr 27 20:44:32 2015: cleaning

Mon Apr 27 20:44:32 2015: memory in use = 5.21 GB, peak = 78.10 GB

Mon Apr 27 20:44:35 2015: finding uniquely aligning edges

Mon Apr 27 20:44:35 2015: memory in use = 5.21 GB, peak = 78.10 GB

1.1 minutes used in new phase

hb.N( ) = 369790, hb.EdgeObjectCount( ) = 246052

945 paths improved by rerouting

Sum(invalid) = 375, npids = 13446211

153 edges tamped down

Mon Apr 27 20:45:13 2015: checking involution

Mon Apr 27 20:45:13 2015: done

WARNING: 125 suspicious read-paths.

Sum(invalid) = 261, npids = 13446211

84 edges tamped down

Mon Apr 27 20:45:47 2015: making paths index for pull apart

Mon Apr 27 20:45:50 2015: pulling apart repeats

0.0133 seconds used separating paths 1

1.08 seconds used in fixing mToLeft, mToRight, and mEdgeToPathIds

Mon Apr 27 20:45:52 2015: there were 88 repeats pulled apart.

Mon Apr 27 20:45:52 2015: there were 700 read paths removed during separation.

Mon Apr 27 20:45:53 2015: improving paths

Mon Apr 27 20:46:22 2015: done

21368 paths extended

Mon Apr 27 20:46:45 2015: start degloop

Mon Apr 27 20:46:45 2015: creating path index

Mon Apr 27 20:46:49 2015: starting loop

Mon Apr 27 20:46:51 2015: degloop complete

Mon Apr 27 20:46:58 2015: unwinding three-edge plasmids

Mon Apr 27 20:46:58 2015: removing small components

Mon Apr 27 20:47:14 2015: writing a.fin files

Mon Apr 27 20:47:37 2015: determining candidates

Mon Apr 27 20:47:37 2015: determining candidates

Mon Apr 27 20:47:37 2015: determining candidates

CN fraction good = 0.22

Mon Apr 27 20:47:41 2015: deleting 0 gaps and adding 0 gaps to force symmetry

Mon Apr 27 20:47:44 2015: done making gaps, time used = 4.16 seconds

Mon Apr 27 20:47:55 2015: determining candidates

Mon Apr 27 20:47:55 2015: determining candidates

Mon Apr 27 20:47:55 2015: determining candidates

0.0732 seconds using setting up final fasta

0.381 seconds using printing final fasta

assembly has 18224 edges of mean length 171.617

contig line N50: 2,067

scaffold line N50: 2,067

total bases in 1 kb+ scaffolds: 592,685

total bases in 10 kb+ scaffolds: 11,088

There are 26,892,422 reads of mean length 120.5 and mean base quality 35.4.

MPL1 = mean length of first read in pair up to first error = 2

(normal range is 175-225 for 250 base reads)

Estimated chimera rate in read pairs (including mismapping) = 1.43%.

genomic read coverage, using 1 kb+ scaffolds for genome size estimate: 5469.5

run started Mon Apr 27 19:16:20 2015, completed Mon Apr 27 20:48:14 2015

peak mem usage = 78.10 GB, total time = 1.53 hours

final checksum = 60623142131077

DiscovarDeNovo READS=“frac:0.05,sample:18H::/scratch/bananaSlugBAMS/SW018_S1_L007_001.bam+frac:0.05,sample:19M::/scratch/bananaSlugBAMS/SW019_S1_L001_001.bam+frac:0.05,sample:19H::/scratch/bananaSlugBAMS/SW019_S2_L008_001.bam” OUT_DIR=/scratch/bananaSlugAssemblies

Mon Apr 27 20:48:14 2015: done