Releases: shenwei356/kmcp
Releases · shenwei356/kmcp
KMCP v0.9.4
Changes
v0.9.4 - 2023-09-18 
kmcp merge:- trim tailing tab in merged result, down-stream results are not affected. #38
kmcp profile:- fix log of "xx% (yy/zz) reads matched", the numerator should be the number of matched reads instead of reads belonging to references.
- early quit when no target detected.
- slightly speedup.
Links
| OS | Arch | File, 中国镜像 | Download Count |
|---|---|---|---|
| Linux | 64-bit | kmcp_linux_amd64.tar.gz, 中国镜像 |
|
| Linux | arm64 | kmcp_linux_arm64.tar.gz, 中国镜像 |
|
| macOS | 64-bit | kmcp_darwin_amd64.tar.gz, 中国镜像 |
|
| macOS | arm64 | kmcp_darwin_arm64.tar.gz, 中国镜像 |
|
| Windows | 64-bit | kmcp_windows_amd64.exe.tar.gz, 中国镜像 |
Notes:
- please open an issue to request binaries for other platforms or compile from the source.
- run
kmcp versionto check update !!! - run
kmcp autocompletionto update shell autocompletion script !!!
KMCP v0.9.3
Changes
v0.9.3 - 2023-07-16 
-
kmcp compute/split-genomes:- fix a bug in chunk computation when splitting circular genomes (
--circular).
- fix a bug in chunk computation when splitting circular genomes (
-
kmcp search/merge:-
append simple stats to the search result as comment lines, including the number of input and matched queries. e.g.,
# input queries: 930254 # matched queries: 923820 # matched percentage: 99.3084%
-
-
kmcp profile:-
fix metaphlan out format. #34
-
show stats of the number of input and matched queries in log. It would be helpful to show whether the reference genomes cover all microorganisms in the sample.
20:00:55.295 [INFO] 99.3084% (923820/930254) reads matched 20:00:55.295 [INFO] 100.0000% (923820/923820) matched reads belong to the 2 references in the profile
-
-
new tutorials:
KMCP v0.9.2
v0.9.2 - 2023-05-16 
kmcp profile/cos2simi/filter/index-info/merge-regions/query-fpr:- rename/unify the long flag
--out-prefixto--out-file.
- rename/unify the long flag
kmcp profile:- fix the number of reads belonging to references in the profile when no matches are found, which should be 0 instead of 1.
- new command:
kmcp utils index-density: plotting the element density of bloom filters for an index file.
An audience was concerned about it, but the results showed the elements (1s) are uniformly distributed in all BFs.
KMCP v0.9.1
v0.9.1 - 2022-12-26 
kmcp search- faster speed for ARM architectures.
- fix compilation for ARM architectures.
KMCP v0.9.0
Changes
v0.9.0 - 2022-09-28 
compute:- smaller output files and faster speed.
- more even genome splitting.
index:- faster speed due to smaller input files.
search:- more accurate and smaller query FPR following Theorem 2 in SBT paper, instead of the Chernoff bound.
- change the default value of
-f/--max-fprfrom 0.05 to 0.01. - 10-20% speedup.
profile:- more accurate abundance estimation using EM algorithm.
- change the default value of
-f/--max-fprfrom 0.05 to 0.01. - mode 0: change the default value of
-H/--min-hic-ureads-qcovfrom 0.55 to 0.7. - increase float width of reference coverage in KMCP profile format from 2 to 6.
util query-fpr:- compute query FPR following Theorem 2 in SBT paper, instead of the Chernoff bound.
- new commands:
utils split-genomesfor splitting genomes into chunks.utils ref-infofor printing information of reference (chunks), including the number of k-mers
and the actual false-positive rate.
KMCP v0.8.3
Changes
v0.8.3 - 2022-08-15
kmcp: fix compiling from source for ARM architectures. #17search:- fix searching with paired-end reads where the read2 is shorter than the value of
--min-query-len. #10 - fix the log. #8
- a new flag
-f/--max-fpr: maximal false positive rate of a query (default 0.05). It reduces the unnecessary output when searching with a low minimal query coverage (-t/--min-query-cov).
- fix searching with paired-end reads where the read2 is shorter than the value of
profile:- recommend using the flag
--no-amb-corrto disable ambiguous reads correction when >= 1000 candidates are detected. - fix logging when using
--level strainand no taxonomy given.
- recommend using the flag
KMCP v0.8.3-alpha
KMCP v0.8.2
Changes
- v0.8.2 - 2022-03-26
search:- flag
-g/--query-whole-file:- fix panic for invalid input.
- add gaps of
k-1bp before concatatenating seqs.
- add warning for invalid input.
- flag
profile:- allow modifying parts of parameters in preset profiling modes. #5
- decrease thresholds of minimal reads and unique reads in preset profiling modes 1 and 2 for low coverage sequence data.
the profiling results generated with mode 3 in the manuscript are not affected.
KMCP v0.8.1
Changelog
- v0.8.1 - 2022-03-07
- update help message, show common usages, add examples, add notes to important options.
KMCP v0.8.0
v0.8.0 - 2022-02-24 
- commands:
- new command
utils cov2simi: Convert k-mer coverage to sequence similarity. - new command
utils query-fpr: Compute the maximal false positive rate of a query.
- new command
compute:- update doc.
- add flags compatibility check.
search:- output the false positive rate of each match, rather than the FPR upper bound of the query.
this could save some short queries with high similarity. - change default values of reads filter, because clinical data contain many short reads.
-c/--min-uniq-reads:30->10.-m/--min-query-len:70->30.
- update doc.
- output the false positive rate of each match, rather than the FPR upper bound of the query.
profile:- rename flags:
--keep-main-matches->--keep-main-matches.--keep-perfect-match->--keep-perfect-matches.
- change default values:
--max-qcov-gap:0.2->0.4.
- mode 0 (pathogen detection):
- switch on flag
--keep-main-matches - use
--max-qcov-gap 0.4
- switch on flag
- update doc.
- rename flags: