$ sga overlap --help Usage: sga overlap [OPTION] ... READSFILE Compute pairwise overlap between all the sequences in READS --help display this help and exit -v, --verbose display verbose output -t, --threads=NUM use NUM worker threads to compute the overlaps (default: no threading) -e, --error-rate the maximum error rate allowed to consider two sequences aligned (default: exact matches only) -m, --min-overlap=LEN minimum overlap required between two reads (default: 45) -p, --prefix=PREFIX use PREFIX for the names of the index files (default: prefix of the input file) -f, --target-file=FILE perform the overlap queries against the reads in FILE -x, --exhaustive output all overlaps, including transitive edges --exact force the use of the exact-mode irreducible block algorithm. This is faster but requires that no substrings are present in the input set. -l, --seed-length=LEN force the seed length to be LEN. By default, the seed length in the overlap step is calculated to guarantee all overlaps with --error-rate differences are found. This option removes the guarantee but will be (much) faster. As SGA can tolerate some missing edges, this option may be preferable for some data sets. -s, --seed-stride=LEN force the seed stride to be LEN. This parameter will be ignored unless --seed-length is specified (see above). This parameter defaults to the same value as --seed-length -d, --sample-rate=N sample the symbol counts every N symbols in the FM-index. Higher values use significantly less memory at the cost of higher runtime. This value must be a power of 2 (default: 128) Report bugs to js18@sanger.ac.uk
Discussion
? Optimal: i) algorithm (-a)