Maybe we could use upstream `candle`, which upgraded cudarc here: https://github.com/huggingface/candle/pull/3078