Skip to content
This repository was archived by the owner on Dec 23, 2025. It is now read-only.
This repository was archived by the owner on Dec 23, 2025. It is now read-only.

[FEATURE] - EfficientQAT? Supposedly allows for a 123b to be 35% of the size, with 4% accuracy loss.  #5

@SabinStargem

Description

@SabinStargem

Apparently it is a new method for doing quantization? Here is the reddit and Github, so that you can see whether it is worth rolling into AutoGGUF.

Quantize 123b to 35%

EfficientQAT Github

Thank you for AutoGGUF, I am looking forward to handling quantizations without being an acolyte of the command-line. :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions