Release v0.2.7
The juiciest bits 🚀
feat: add janus support for quantization+torch.compile combo(s) by @llcnt in #145
You can now decrease the memory impact and the latency of the autoregressive Image Generation model janus(pro-7b) model by quantizing and compiling it.
feat: modular pruning by @begumcig in #154
Pruning is now agnostic to submodules! This recent update enables safe, module-level pruning by finding each target module’s interior, boundary, and exterior, pruning only the interior while auto-patching surrounding shapes.
feat: extend accelerate compatibility by @johannaSommer in #234
We are rolling out more support for base models distributed with accelerate. Several cachers, factorizers and more quantizers are now compatible!
feat: enhance model saving functionality with pro support by @davidberenstein1957 in #200
Saving your smashed models to Huggingface Hub just became easier - we added additional support to better distinguish models that were smashed with pruna and pruna_pro
feat: simplify the setup (tests + dev = dev) by @GreatBahram in #210
@GreatBahram was at it again this release, simplifying the installation and setup for contributors by removing dependency groups - a simple uv sync --extra dev does the job and you can start contributing!
build: reduce core dependencies in pyproject.toml by @ParagEkbote in #227
@ParagEkbote made pruna more lightweight and reduced the package dependencies by 20%!
Pruning some bugs 🐞 and maintenance 🧑🌾
- ci: add mission permission set to
package_build.yamlby @johannaSommer in #229 - tests: simplify diffusers fixture construction by @johannaSommer in #159
- build: reduce core dependencies in
pyproject.tomlby @ParagEkbote in #227 - feat: Refactor CI to use shared setup-uv-project action by @GreatBahram in #211
- fix:
device_mapspecification foraccelerate-compatible quantizers by @johannaSommer in #226 - chore: add github documentation on privately reporting vulnerabilities by @SaboniAmine in #217
- test: add pytest and code coverage configuration in
pyproject.tomlby @ParagEkbote in #230 - build: fix MacOS installation issues with bnb and
uvindex resolution by @johannaSommer in #235 - chore: enhance CI workflows with linting and concurrency controls by @davidberenstein1957 in #237
- ci: update algorithm generation workflow by @johannaSommer in #241
- ci: cleanup workflows 🧹 by @GreatBahram in #212
- build: pin
datasetsversion by @johannaSommer in #251 - tests: add
durationsargument to pytest config by @johannaSommer in #252 - fix: change janus import into automodel import by @llcnt in #243
- fix:
gptqandllmcompressortests by @johannaSommer in #231 - docs: review documentation failures before release by @sdiazlor in #238
- ci: add nightly workflow to mark PRs as stale by @johannaSommer in #242
- fix: installation test for gliner dependency by @begumcig in #255
New Contributors
- @ParagEkbote made their first contribution in #227
- @sdiazlor made their first contribution in #238
Full Changelog: v0.2.6...v0.2.7