For example academia appears as ăcădēmī^a.

I noticed this running a script to test if the key matches the first word of the entry (after stripping accents, etc.). It should be possible to write a script that fixes this; I don't think there should be (m)any false positives.