I tried running the demo on 11s audio and it took 4.2s to process. ``` extractor = PhonemeTimestampAligner( preset="en-us", # Automatically selects best English model duration_max=12, device='cpu' ) ``` I have macbook m1 pro.