Skip to content

Unable to join Discord + question about fine-tuning PaliGemma for fixed-prompt success detection #6

@YukiSato-cmd

Description

@YukiSato-cmd

Hi, thank you for your great work on AutoEval.

I have two questions:

  1. Discord invite
    I tried to join the Discord community, but the invite link does not seem to work for me.
    Could you let me know whether the Discord link is still valid, or if there is an updated invitation link?

  2. Motivation for using PaliGemma for success detection
    After trying to implement the method myself, I had a question about the use of PaliGemma in AutoEval.

From my understanding, AutoEval uses PaliGemma for success detection by fine-tuning it with fixed prompts and fixed answer spaces, such as yes/no or sink/basket. In other words, it seems that the model is trained to respond to a fixed question format with a constrained set of outputs.

While implementing this, I started wondering about the motivation for using PaliGemma in this setting.

Since one of the strengths of a vision-language model is that it can handle diverse question forms and produce flexible outputs, I was curious why AutoEval uses it in a much more constrained way, where both the prompt format and the answer space are effectively fixed through fine-tuning.

In this setting, what would you say is the main advantage of using PaliGemma over a simpler task-specific classifier?
Is the main benefit flexibility across tasks, easier specification of success criteria in natural language, better robustness/generalization, or something else?

Thank you very much.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions