I was wondering whether this library uses the bit parallel speed up tricks from: https://www.win.tue.nl/~jfg/educ/bit.mat.pdf https://users.dcc.uchile.cl/~gnavarro/ps/jea06.pdf They seem very worthwhile.