README 1.3 KB

123456789101112131415161718192021222324252627
  1. SeqKit - a cross-platform and ultrafast toolkit for FASTA/Q file
  2. manipulation
  3. FASTA and FASTQ are basic and ubiquitous formats for storing nucleotide
  4. and protein sequences. Common manipulations of FASTA/Q file include
  5. converting, searching, filtering, deduplication, splitting, shuffling,
  6. and sampling. Existing tools only implement some of these manipulations,
  7. and not particularly efficiently, and some are only available for
  8. certain operating systems. Furthermore, the complicated installation
  9. process of required packages and running environments can render these
  10. programs less user friendly.
  11. This project describes a cross-platform ultrafast comprehensive toolkit
  12. for FASTA/Q processing. SeqKit provides executable binary files for all
  13. major operating systems, including Windows, Linux, and Mac OS X, and can
  14. be directly used without any dependencies or pre-configurations. SeqKit
  15. demonstrates competitive performance in execution time and memory usage
  16. compared to similar tools. The efficiency and usability of SeqKit enable
  17. researchers to rapidly accomplish common FASTA/Q file manipulations.
  18. Note: This just repackages the binaries provided from upstream.
  19. Please cite:
  20. Wei Shen,Shuai Le,Yan Li ,Fuquan Hu. SeqKit: A Cross-Platform and
  21. Ultrafast Toolkit for FASTA/Q File Manipulation. October 5, 2016
  22. https://doi.org/10.1371/journal.pone.0163962