12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758 |
- HyPhy: Hypothesis testing using Phylogenies
- HyPhy is an open-source software package for the analysis of genetic
- sequences (in particular the inference of natural selection) using
- techniques in phylogenetics, molecular evolution, and machine learning.
- It features a rich scripting language for limitless customization of
- analyses. Additionally, HyPhy features support for parallel computing
- environments (via message passing interface).
- HyPhy was designed to allow the specification and fitting of a broad
- class of continuous-time discrete-space Markov models of sequence
- evolution. To implement these models, HyPhy provides its own scripting
- language - HBL, or HyPhy Batch Language, which can be used to develop
- custom analyses or modify existing ones. Importantly, it is not
- necessary to learn (or even be aware of) HBL in order to use HyPhy, as
- most common models and analyses have been implemented for user
- convenience. Once a model is defined, it can be fitted to data (using a
- fixed topology tree), its parameters can be constrained in user-defined
- ways to test various hypotheses (e.g. is rate1 > rate2), and simulate
- data from. HyPhy primarily implements maximum likelihood methods, but
- it can also be used to perform some forms of Bayesian inference (e.g.
- FUBAR), fit Bayesian graphical models to data, run genetic algorithms to
- perform complex model selection.
- Features
- - Support for arbitrary sequence data, including nucleotide, amino-acid,
- codon, binary, count (microsattelite) data, including multiple
- partitions mixing differen data types.
- - Complex models of rate variation, including site-to-site, branch-to-
- branch, hidden markov model (autocorrelated rates), between/within
- partitions, and co-varion type models.
- - Fast numerical fitting routines, supporting parallel and distributed
- execution.
- - A broad collection of pre-defined evolutionary models.
- - The ability to specify flexible constraints on model parameters and
- estimate confidence intervals on MLEs.
- - Ancestral sequence reconstruction and sampling.
- - Simulate data from any model that can be defined and fitted in the
- language.
- - Apply unique (for this domain) machine learning methods to discover
- patterns in the data, e.g. genetic algorithms, stochastic context free
- grammars, Bayesian graphical models.
- - Script analyses completely in HBL including flow control, I/O,
- parallelization, etc.
- Registration
- you are highly advised to fill the registration form found at:
- https://veg.github.io/hyphy-site/register/
- NOTE!
- The script builds two executables: HYPHYMP, which uses pthreads to do
- multiprocessing and HYPHYMPI, which uses openmpi (hence the dependency).
- Citing
- Sergei L. Kosakovsky Pond, Simon D. W. Frost and Spencer V. Muse (2005)
- HyPhy: hypothesis testing using phylogenies.
- Bioinformatics 21(5): 676-679
|