Skip to content

Model loaded twice? #2

@mvsoom

Description

@mvsoom

Why is the model loaded twice? I'm curious.

Once in 8 bit:

model = _load_model(checkpoint_path, device, precision, use_tp)

Once in bfloat16:

_model = PaliGemmaForConditionalGeneration.from_pretrained(

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions