This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
|
contributors:team_3:filter [2015/04/15 00:04] jolespin created |
contributors:team_3:filter [2015/09/02 16:24] (current) ceisenhart ↷ Page moved from contributors:2015:team_3:filter to contributors:team_3:filter |
||
|---|---|---|---|
| Line 1: | Line 1: | ||
| <code> | <code> | ||
| - | $ sga filter --help | + | $ sga filter --help |
| - | Usage: sga filter [OPTION] ... READSFILE | + | Usage: sga filter [OPTION] ... READSFILE |
| - | Remove reads from a data set. | + | Remove reads from a data set. |
| - | The currently available filters are removing exact-match duplicates | + | The currently available filters are removing exact-match duplicates |
| - | and removing reads with low-frequency k-mers. | + | and removing reads with low-frequency k-mers. |
| - | Automatically rebuilds the FM-index without the discarded reads. | + | Automatically rebuilds the FM-index without the discarded reads. |
| - | --help display this help and exit | + | --help display this help and exit |
| - | -v, --verbose display verbose output | + | -v, --verbose display verbose output |
| - | -p, --prefix=PREFIX use PREFIX for the names of the index files (default: prefix of the input file) | + | -p, --prefix=PREFIX use PREFIX for the names of the index files (default: prefix of the input file) |
| - | -o, --outfile=FILE write the qc-passed reads to FILE (default: READSFILE.filter.pass.fa) | + | -o, --outfile=FILE write the qc-passed reads to FILE (default: READSFILE.filter.pass.fa) |
| - | -t, --threads=NUM use NUM threads to compute the overlaps (default: 1) | + | -t, --threads=NUM use NUM threads to compute the overlaps (default: 1) |
| - | -d, --sample-rate=N use occurrence array sample rate of N in the FM-index. Higher values use significantly | + | -d, --sample-rate=N use occurrence array sample rate of N in the FM-index. Higher values use significantly |
| - | less memory at the cost of higher runtime. This value must be a power of 2 (default: 128) | + | less memory at the cost of higher runtime. This value must be a power of 2 (default: 128) |
| - | --no-duplicate-check turn off duplicate removal | + | --no-duplicate-check turn off duplicate removal |
| - | --substring-only when removing duplicates, only remove substring sequences, not full-length matches | + | --substring-only when removing duplicates, only remove substring sequences, not full-length matches |
| - | --no-kmer-check turn off the kmer check | + | --no-kmer-check turn off the kmer check |
| - | --kmer-both-strand mimimum kmer coverage is required for both strand | + | --kmer-both-strand mimimum kmer coverage is required for both strand |
| - | --homopolymer-check check reads for hompolymer run length sequencing errors | + | --homopolymer-check check reads for hompolymer run length sequencing errors |
| - | --low-complexity-check filter out low complexity reads | + | --low-complexity-check filter out low complexity reads |
| - | K-mer filter options: | + | K-mer filter options: |
| - | -k, --kmer-size=N The length of the kmer to use. (default: 27) | + | -k, --kmer-size=N The length of the kmer to use. (default: 27) |
| - | -x, --kmer-threshold=N Require at least N kmer coverage for each kmer in a read. (default: 3) | + | -x, --kmer-threshold=N Require at least N kmer coverage for each kmer in a read. (default: 3) |
| - | Report bugs to js18@sanger.ac.uk | + | Report bugs to js18@sanger.ac.uk |
| </code> | </code> | ||