Skip to content

Add support for downloading deployed models #795

@bryceamacker

Description

@bryceamacker

We are using open source huggingface models, but are applying dynamic quantization using eland's --quantize flag. This creates a unique, dynamically quantized version of the model before uploading to Elastic.

As part of our HA/DR infrastructure, we would like to have a static backup of this model so that we can be sure the exact same binary can be saved offline and deployed across all of our multi-region ES clusters. There doesn't seem to be anyway to download or access the models once they have been deployed to Elastic, and the quantization process happens in memory while you are deploying.

Metadata

Metadata

Assignees

No one assigned

    Labels

    topic:NLPIssue or PR about NLP model support and eland_import_hub_model

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions