XRATE is a program for prototyping phylogenetic hidden Markov models and stochastic context-free grammars.[1][2] It is used to discover patterns of evolutionary conservation in sequence alignments. The program can be used to estimate parameters for such models from "training" alignment data, or to apply the parameterized model so as to annotate new alignments. The program allows specification of a variety of models of DNA sequence evolution which may be arbitrarily organized using formal grammars.

XRATE
Developer(s)Ian Holmes (UC Berkeley)
Stable release
1
Operating systemUNIX, Linux, Mac, Cygwin on Windows XP
TypeBioinformatics tool
LicenceOpen source
WebsiteXRate homepage

As an example of how XRATE is used, consider a protein-coding gene consisting of exons interspersed with introns. The exons contain triplets of nucleotides (codons) that are translated by ribosomes according to the genetic code, and consequently are under selection pressure (since any mutation may affect the translated amino acid sequence). In contrast, the introns are under fewer selective constraints and tend to evolve faster. These varying pressures show up clearly in multiple alignments. The sequential layout of introns and exons can be described using grammar theory (from linguistics) and each of their distinct evolutionary signatures modeled as a continuous-time Markov process. XRATE allows the user to specify such models in a configuration file and estimate their parameters (evolutionary rates, length distributions of exons and introns, etc.) directly from alignment data, using the Expectation-maximization algorithm.[3]

XRATE can be downloaded as part of the DART software package. It accepts input files in Stockholm format.

References

edit
  1. ^ Westesson, O.; Holmes, I. (2012). "Developing and applying heterogeneous phylogenetic models with XRate". PLOS ONE. 7 (6): e36898. arXiv:1202.3834. Bibcode:2012PLoSO...736898W. doi:10.1371/journal.pone.0036898. PMC 3367922. PMID 22693624.
  2. ^ Klosterman, P. S.; Uzilov, A. V.; Bendaña, Y. R.; Bradley, R. K.; Chao, S.; Kosiol, C.; Goldman, N.; Holmes, I. (2006). "XRate: A fast prototyping, training and annotation tool for phylo-grammars". BMC Bioinformatics. 7: 428. doi:10.1186/1471-2105-7-428. PMC 1622757. PMID 17018148.
  3. ^ Holmes, I.; Rubin, G. M. (2002). "An expectation maximization algorithm for training hidden substitution models". Journal of Molecular Biology. 317 (5): 753–764. doi:10.1006/jmbi.2002.5405. PMID 11955022.
edit