Skip to content

Implement I-quants#67

Open
EricLBuehler wants to merge 30 commits intomainfrom
iq_quants
Open

Implement I-quants#67
EricLBuehler wants to merge 30 commits intomainfrom
iq_quants

Conversation

@EricLBuehler
Copy link
Owner

No description provided.

@EricLBuehler
Copy link
Owner Author

@bgergely0 I saw your PR to candle upstream, and I've integrated it here for testing!

@bgergely0
Copy link

thanks @EricLBuehler, let me know if you have any questions about it.

Btw, since I did this patch, llama.cpp also got support for i8mm-based matmuls.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants