Use simulation/simulate_all_data.sh. Need to specify your own path to ms and seqgen in simulation/model.py The simulation process is described below:
- generate homogeneous gene trees and markers
- generate heterogeneous gene trees and markers
- infer gene trees using iqtree and obtain estimated gene tree errors
- randomly resolve short branches
Use command Infernetwork_ML in Phylonet to infer network prepare_ML_nex.sh, run_ML.sh are two scripts may help
Use summarize_result.sh
You can use CalGTProb Command in PhyloNet for each gene tree topology
- write gt probs to a csv: summarize_probs.py
- compute test statistics- approximate: summarize_stats_5taxa.py
- plot figures with summarize_plot.py
- summarize_triplet_probs.py
- summarize_triplet_stats.py
- summarize_triplet_multitest.py
- plot figures with summarize_triplet_plot.py
The scripts to run D3 are: D3_test/run_D3.sh and D3_test/run_D3_gt.sh
- Use run_quartet_probs.sh to prepare input files.
- Put mydata.jl and run_mydata.sh in to directory of QuartetGoodnessFit and run
- summarize results with summarize_pval.py
This is implemented in the code base of PhyloNet.
Download: https://github.com/NakhlehLab/PhyloNet/releases
Nexus line:
MCMC_SEQ -cl 20000000 -bl 10000000 -sf 5000 -sd 12345 -pl 24 -tm <C:C_0;L:L_0;R:R_0;Q:Q_0>;
MCMC_SEQ -cl 20000000 -bl 10000000 -sf 5000 -sd 12345 -pl 24 -tm <C:C_0;L:L_0;R:R_0;Q:Q_0> -murate;
The -murate switch is to enable the sampling of varying substitution rates.