Feature 3: Dissimilarity of distribution of two pairs of prolines and lysines in positions 17-20. Close inspection of the AKL domain visualised with Sequence Bundles exposes the dissimilar nature of the distribution of two pairs of residues in the Gram-positives (black) sub-dataset, which is difficult to be observed with the use of existing visualisation methods or in the general consensus sequence (in positions 17-20 it is: ...PPKK...). The Sequence Bundles method preserves continuity of sequences by visualising them as uninterrupted lines which reveals that while the majority of sequences in positions 17-18 display a consecutive pair of prolines (indicated by a thick horizontal 'bridge' between P-P in panel A), one part of the Gram-positive sequences display a lysine in position 19, while another part display a lysine in position 20. Note that very few black lines bridge the gap between K-K in the Sequence Bundles visualisation (panel B)--the majority of sequences include only one of the lysines. This data feature remains hidden in the Sequence Logo, as well as in the general consensus sequence itself. In fact, only 23 sequences display the exact ...PPKK... motif fully consistent with the general consensus sequence. The reason for this dissimilarity of residue distribution in the MSA remains to be explained and interpreted.