Accounting for Solvent Accessibility and Secondary Structure in Protein Phylogenetics is Highly Beneficial.

Le S.Q., Gascuel O.
Systematic Biology, 59(3): 277-287, 2010

Example

example

Command line

./PhyML-structure -m EX_EHO -M CONF/MIX -i Ord0300_2hhi.STR
   
Note: PhyML-structure automatically extracts structure information if structure information are put at the end of alignment files. If both alignment files and '-y' option files contain structure information, then PhyML-structure uses the structure in '-y' option.

CONF/MIX

./PhyML-structure -m EX_EHO -M CONF/MIX -i Ord0300_2hhi.STR

Output file:
. Sequence file : Ord0300_2hhi.STR
. Data set : #1
. Initial tree : user tree (Ord0300_2hhi.STR.tree)
. Model of amino acids substitution: EX_EHO
. Mode: CONF/MIX [Ord0300_2hhi.STR]
. Number of taxa : 10
. Log-likelihood : -2341.10455
. Discrete gamma model : Yes
  - Number of categories : 4
  - Gamma shape parameter : 3.538
. Confidence    0.592200

------ mixture model in detail -----
Model   Proportion      Pinvar          Rate
E-bur   0.076740        0.000000        0.761817
H-bur   0.009645        0.000000        0.744426
O-bur   0.479179        0.000000        0.532458                    
E-exp   0.058938        0.000000        1.563939
H-exp   0.000299        0.000000        2.064034
O-exp   0.375200        0.000000        1.433368
. Time used 0h0m17s
   

CONF/LG

./PhyML-structure -m EX_EHO -M CONF/LG -i Ord0300_2hhi.STR

Output file:
. Sequence file : Ord0300_2hhi.STR
. Data set : #1
. Initial tree : user tree (Ord0300_2hhi.STR.tree)
. Model of amino acids substitution: EX_EHO
. Mode: CONF/LG  [Ord0300_2hhi.STR]
. Number of taxa : 10
. Log-likelihood : -2344.08101
. Discrete gamma model : Yes
  - Number of categories : 4
  - Gamma shape parameter : 2.572
. Confidence    0.598922
. Time used 0h1m29s
. -> 89 seconds
   

PART

./PhyML-structure -m EX_EHO -M PART -i Ord0300_2hhi.STR

Output file:
. Sequence file : Ord0300_2hhi.STR
. Data set : #1
. Initial tree : user tree (Ord0300_2hhi.STR.tree)
. Model of amino acids substitution: EX_EHO
. Mode: PART     [Ord0300_2hhi.STR]
. Number of taxa : 10
. Log-likelihood : -2359.35756
. Discrete gamma model : Yes
  - Number of categories : 4
  - Gamma shape parameter : 1.882
. Time used 0h0m11s
   

MIX

./PhyML-structure -m EX_EHO -M MIX -i Ord0300_2hhi.STR

Output file:
. Sequence file : Ord0300_2hhi.STR
. Data set : #1
. Initial tree : user tree (Ord0300_2hhi.STR.tree)
. Model of amino acids substitution: EX_EHO
. Mode: MIX
. Number of taxa : 10
. Log-likelihood : -2360.83788
. Discrete gamma model : Yes
  - Number of categories : 4
  - Gamma shape parameter : 4.060

------ mixture model in detail -----
Model   Proportion      Pinvar          Rate
E-bur   0.160239        0.000000        0.761817
H-bur   0.036653        0.000000        0.744426
O-bur   0.182865        0.000000        0.532458
E-exp   0.146034        0.000000        1.563939
H-exp   0.000429        0.000000        2.064034
O-exp   0.473781        0.000000        1.433368
. Time used 0h0m17s