Unable to join Discord + question about fine-tuning PaliGemma for fixed-prompt success detection

Hi, thank you for your great work on AutoEval.

I have two questions:

1. Discord invite
I tried to join the Discord community, but the invite link does not seem to work for me.
Could you let me know whether the Discord link is still valid, or if there is an updated invitation link?

2. Motivation for using PaliGemma for success detection
After trying to implement the method myself, I had a question about the use of PaliGemma in AutoEval.

From my understanding, AutoEval uses PaliGemma for success detection by fine-tuning it with fixed prompts and fixed answer spaces, such as yes/no or sink/basket. In other words, it seems that the model is trained to respond to a fixed question format with a constrained set of outputs.

While implementing this, I started wondering about the motivation for using PaliGemma in this setting.

Since one of the strengths of a vision-language model is that it can handle diverse question forms and produce flexible outputs, I was curious why AutoEval uses it in a much more constrained way, where both the prompt format and the answer space are effectively fixed through fine-tuning.

In this setting, what would you say is the main advantage of using PaliGemma over a simpler task-specific classifier?
Is the main benefit flexibility across tasks, easier specification of success criteria in natural language, better robustness/generalization, or something else?

Thank you very much.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to join Discord + question about fine-tuning PaliGemma for fixed-prompt success detection #6

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Unable to join Discord + question about fine-tuning PaliGemma for fixed-prompt success detection #6

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions