Hi,
I just wanted to confirm whether AQLM’s quantization methods can be applied to Qwen models, since they are part of the LLaMA-family architecture. Are there any known limitations or additional steps required for using AQLM with Qwen checkpoints?
Thanks!