Release Release v0.2.7 · PrunaAI/pruna

The juiciest bits 🚀

feat: add janus support for quantization+torch.compile combo(s) by @llcnt in #145

You can now decrease the memory impact and the latency of the autoregressive Image Generation model janus(pro-7b) model by quantizing and compiling it.

feat: modular pruning by @begumcig in #154

Pruning is now agnostic to submodules! This recent update enables safe, module-level pruning by finding each target module’s interior, boundary, and exterior, pruning only the interior while auto-patching surrounding shapes.

feat: extend `accelerate` compatibility by @johannaSommer in #234

We are rolling out more support for base models distributed with accelerate. Several cachers, factorizers and more quantizers are now compatible!

feat: enhance model saving functionality with pro support by @davidberenstein1957 in #200

Saving your smashed models to Huggingface Hub just became easier - we added additional support to better distinguish models that were smashed with pruna and pruna_pro

feat: simplify the setup (tests + dev = dev) by @GreatBahram in #210

@GreatBahram was at it again this release, simplifying the installation and setup for contributors by removing dependency groups - a simple uv sync --extra dev does the job and you can start contributing!

build: reduce core dependencies in `pyproject.toml` by @ParagEkbote in #227

@ParagEkbote made pruna more lightweight and reduced the package dependencies by 20%!

Pruning some bugs 🐞 and maintenance 🧑‍🌾

ci: add mission permission set to package_build.yaml by @johannaSommer in #229
tests: simplify diffusers fixture construction by @johannaSommer in #159
build: reduce core dependencies in pyproject.toml by @ParagEkbote in #227
feat: Refactor CI to use shared setup-uv-project action by @GreatBahram in #211
fix: device_map specification for accelerate-compatible quantizers by @johannaSommer in #226
chore: add github documentation on privately reporting vulnerabilities by @SaboniAmine in #217
test: add pytest and code coverage configuration in pyproject.toml by @ParagEkbote in #230
build: fix MacOS installation issues with bnb and uv index resolution by @johannaSommer in #235
chore: enhance CI workflows with linting and concurrency controls by @davidberenstein1957 in #237
ci: update algorithm generation workflow by @johannaSommer in #241
ci: cleanup workflows 🧹 by @GreatBahram in #212
build: pin datasets version by @johannaSommer in #251
tests: add durations argument to pytest config by @johannaSommer in #252
fix: change janus import into automodel import by @llcnt in #243
fix: gptq and llmcompressor tests by @johannaSommer in #231
docs: review documentation failures before release by @sdiazlor in #238
ci: add nightly workflow to mark PRs as stale by @johannaSommer in #242
fix: installation test for gliner dependency by @begumcig in #255

New Contributors

@ParagEkbote made their first contribution in #227
@sdiazlor made their first contribution in #238

Full Changelog: v0.2.6...v0.2.7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release v0.2.7

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

The juiciest bits 🚀

feat: add janus support for quantization+torch.compile combo(s) by @llcnt in #145

feat: modular pruning by @begumcig in #154

feat: extend `accelerate` compatibility by @johannaSommer in #234

feat: enhance model saving functionality with pro support by @davidberenstein1957 in #200

feat: simplify the setup (tests + dev = dev) by @GreatBahram in #210

build: reduce core dependencies in `pyproject.toml` by @ParagEkbote in #227

Pruning some bugs 🐞 and maintenance 🧑‍🌾

New Contributors

Contributors

Uh oh!

Release v0.2.7

The juiciest bits 🚀

feat: add janus support for quantization+torch.compile combo(s) by @llcnt in #145

feat: modular pruning by @begumcig in #154

feat: extend accelerate compatibility by @johannaSommer in #234

feat: enhance model saving functionality with pro support by @davidberenstein1957 in #200

feat: simplify the setup (tests + dev = dev) by @GreatBahram in #210

build: reduce core dependencies in pyproject.toml by @ParagEkbote in #227

Pruning some bugs 🐞 and maintenance 🧑‍🌾

New Contributors

Contributors

Uh oh!

feat: extend `accelerate` compatibility by @johannaSommer in #234

build: reduce core dependencies in `pyproject.toml` by @ParagEkbote in #227