once features and bugs have settled, consider a more performant implementation in C. try leveraging existing C/C++ DAWG implementations.