README 2.8 KB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758
  1. HyPhy: Hypothesis testing using Phylogenies
  2. HyPhy is an open-source software package for the analysis of genetic
  3. sequences (in particular the inference of natural selection) using
  4. techniques in phylogenetics, molecular evolution, and machine learning.
  5. It features a rich scripting language for limitless customization of
  6. analyses. Additionally, HyPhy features support for parallel computing
  7. environments (via message passing interface).
  8. HyPhy was designed to allow the specification and fitting of a broad
  9. class of continuous-time discrete-space Markov models of sequence
  10. evolution. To implement these models, HyPhy provides its own scripting
  11. language - HBL, or HyPhy Batch Language, which can be used to develop
  12. custom analyses or modify existing ones. Importantly, it is not
  13. necessary to learn (or even be aware of) HBL in order to use HyPhy, as
  14. most common models and analyses have been implemented for user
  15. convenience. Once a model is defined, it can be fitted to data (using a
  16. fixed topology tree), its parameters can be constrained in user-defined
  17. ways to test various hypotheses (e.g. is rate1 > rate2), and simulate
  18. data from. HyPhy primarily implements maximum likelihood methods, but
  19. it can also be used to perform some forms of Bayesian inference (e.g.
  20. FUBAR), fit Bayesian graphical models to data, run genetic algorithms to
  21. perform complex model selection.
  22. Features
  23. - Support for arbitrary sequence data, including nucleotide, amino-acid,
  24. codon, binary, count (microsattelite) data, including multiple
  25. partitions mixing differen data types.
  26. - Complex models of rate variation, including site-to-site, branch-to-
  27. branch, hidden markov model (autocorrelated rates), between/within
  28. partitions, and co-varion type models.
  29. - Fast numerical fitting routines, supporting parallel and distributed
  30. execution.
  31. - A broad collection of pre-defined evolutionary models.
  32. - The ability to specify flexible constraints on model parameters and
  33. estimate confidence intervals on MLEs.
  34. - Ancestral sequence reconstruction and sampling.
  35. - Simulate data from any model that can be defined and fitted in the
  36. language.
  37. - Apply unique (for this domain) machine learning methods to discover
  38. patterns in the data, e.g. genetic algorithms, stochastic context free
  39. grammars, Bayesian graphical models.
  40. - Script analyses completely in HBL including flow control, I/O,
  41. parallelization, etc.
  42. Registration
  43. you are highly advised to fill the registration form found at:
  44. https://veg.github.io/hyphy-site/register/
  45. NOTE!
  46. The script builds two executables: HYPHYMP, which uses pthreads to do
  47. multiprocessing and HYPHYMPI, which uses openmpi (hence the dependency).
  48. Citing
  49. Sergei L. Kosakovsky Pond, Simon D. W. Frost and Spencer V. Muse (2005)
  50. HyPhy: hypothesis testing using phylogenies.
  51. Bioinformatics 21(5): 676-679