nominally [NSIMD](https://github.com/MouseLightProject/mltk-bary/blob/master/src/barycentric.avx.c#L247) could be 16 on a machine with AVX512. but it crashes