Two options:
- Encode nucleotides with two bits (four states = four nucleotides)
- Encode with four bits for presence/absence of each nucleotide, which enables us to encode mixtures (preferred).
Computing number of differences should be attainable by a fast bitwise operator.