Mapping sample distribution in a collection of alignments

To map the distribution of your samples across your dataset, you only need to pass --map flag in the finding unique IDs command:

segul id -i [input-path] --map

It will generate two files. The first file is the list of unique IDs in your dataset (named default to id.txt). This file is similar to generating unique IDs. The second file is a csv file (named default to id_map.txt) containing the distrubution of your samples presented in TRUE/FALSE values across your alignments. The content of the file will look like as below:

Alignmentssequence_1sequence_2sequence_3
locus_1TRUEFALSETRUE
locus_2TRUETRUETRUE
locus_3FALSEFALSETRUE