muscle5.1 3.5 KB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364656667686970717273747576777879808182838485868788899091929394
  1. .\" DO NOT MODIFY THIS FILE! It was generated by help2man 1.48.5.
  2. .TH MUSCLE "1" "January 2022" "muscle 5.1" "User Commands"
  3. .SH NAME
  4. muscle \- Multiple alignment program of protein sequences
  5. .SH DESCRIPTION
  6. MUSCLE is a multiple alignment program for protein sequences. MUSCLE
  7. stands for multiple sequence comparison by log-expectation. In the
  8. authors tests, MUSCLE achieved the highest scores of all tested
  9. programs on several alignment accuracy benchmarks, and is also one of
  10. the fastest programs out there.
  11. .SH USAGE
  12. .SS "Align FASTA input, write aligned FASTA (AFA) output:"
  13. .IP
  14. muscle \fB\-align\fR input.fa \fB\-output\fR aln.afa
  15. .PP
  16. Align large input using Super5 algorithm if \fB\-align\fR is too expensive,
  17. typically needed with more than a few hundred sequences:
  18. .IP
  19. muscle \fB\-super5\fR input.fa \fB\-output\fR aln.afa
  20. .SS "Single replicate alignment:"
  21. .IP
  22. muscle \fB\-align\fR input.fa \fB\-perm\fR PERM \fB\-perturb\fR SEED \fB\-output\fR aln.afa
  23. muscle \fB\-super5\fR input.fa \fB\-perm\fR PERM \fB\-perturb\fR SEED \fB\-output\fR aln.afa
  24. .IP
  25. PERM is guide tree permutation none, abc, acb, bca (default none).
  26. SEED is perturbation seed 0, 1, 2... (default 0 = don't perturb).
  27. .PP
  28. Ensemble of replicate alignments, output in Ensemble FASTA (EFA) format,
  29. EFA has one aligned FASTA for each replicate with header line "<PERM.SEED":
  30. .IP
  31. muscle \fB\-align\fR input.fa \fB\-stratified\fR \fB\-output\fR stratified_ensemble.efa
  32. muscle \fB\-align\fR input.fa \fB\-diversified\fR \fB\-output\fR diversified_ensemble.afa
  33. .HP
  34. \fB\-replicates\fR N
  35. .IP
  36. Number of replicates, defaults 4, 100, 100 for stratified,
  37. .IP
  38. diversified, resampled. With \fB\-stratified\fR there is one
  39. replicate per guide tree permutation, total is 4 x N.
  40. .PP
  41. Generate resampled ensemble from existing ensemble by sampling columns
  42. with replacement:
  43. .IP
  44. muscle \fB\-resample\fR ensemble.efa \fB\-output\fR resampled.efa
  45. .HP
  46. \fB\-maxgapfract\fR F
  47. .IP
  48. Maximum fraction of gaps in a column (F=0..1, default 0.5).
  49. .HP
  50. \fB\-minconf\fR CC
  51. .IP
  52. Minimum column confidence (CC=0..1, default 0.5).
  53. .PP
  54. If ensemble output filename has @, then one FASTA file is generated
  55. for each replicate where @ is replaced by perm.s, otherwise all replicates
  56. are written to one EFA file.
  57. .SS "Calculate disperson of an ensemble:"
  58. .IP
  59. muscle \fB\-disperse\fR ensemble.efa
  60. .SS "Extract replicate with highest total CC (diversified input recommended):"
  61. .IP
  62. muscle \fB\-maxcc\fR ensemble.efa \fB\-output\fR maxcc.afa
  63. .SS "Extract aligned FASTA files from EFA file:"
  64. .IP
  65. muscle \fB\-efa_explode\fR ensemble.efa
  66. .SS "Convert FASTA to EFA, input has one filename per line:"
  67. .IP
  68. muscle \fB\-fa2efa\fR filenames.txt \fB\-output\fR ensemble.efa
  69. .PP
  70. Update ensemble by adding two sequences of digits to each replicate, digits
  71. are column confidence (CC) values, e.g. "73" means CC=0.73, "++" is CC=1.0:
  72. .IP
  73. muscle \fB\-addconfseqs\fR ensemble.efa \fB\-output\fR ensemble_cc.efa
  74. .PP
  75. Calculate letter confidence (LC) values, \fB\-ref\fR specifies the alignment to
  76. compare against the ensemble (e.g. from \fB\-maxcc\fR), output is in aligned
  77. FASTA format with LC values 0, 1 ... 9 instead of letters:
  78. .IP
  79. muscle \fB\-letterconf\fR ensemble.efa \fB\-ref\fR aln.afa \fB\-output\fR letterconf.afa
  80. .HP
  81. \fB\-html\fR aln.html
  82. .IP
  83. Alignment colored by LC in HTML format.
  84. .HP
  85. \fB\-jalview\fR aln.features
  86. .IP
  87. Jalview feature file with LC values and colors.
  88. .SS "More documentation at:"
  89. .IP
  90. https://drive5.com/muscle
  91. .SH AUTHOR
  92. This manpage was written by Andreas Tille for the Debian distribution and
  93. can be used for any other usage of the program.