We use Hudson's MS simulator. This program simulates data sets with a pure neutral model of evolution, under various demographic models. The 'ms' program takes 2 arguments: - A number of individuals (samples) to simulate - A number of data sets (replicates) to simulate
In addition, it takes some switch to specify the demography model. The simplest one is the -t switch, which provides a value of \(\theta = 4 \cdot Ne \cdot u\). The following command generates one sample of 10 individuals with \(\theta = 4\).
ms 10 1 -t 4
A value of theta = 4 corresponds, for instance, to a mutation rate of 1e-6 and a population size of 1e6.