I'd like to add a lesson somewhere where the distribution is over 8 sequences of length 3.
Each sequence is (triangle, square, pentagon).
But each shape in the sequence could be either solid or hollow.
Features are "solid triangle," "solid square," "solid pentagon",
"triangle matches square", "square matches pentagon".
This is analogous to CRF POS tagging. Maybe we can even bring out the connection via discussion or questions in the instructions.