Skip to content

Question on GLM-4.5V Evaluation Dataset for Chartmuseum #239

@Super-Shen

Description

@Super-Shen

Hello,

I am reviewing the evaluation results of GLM-4.5V on the chartmuseum benchmark. Could you please clarify whether the reported results were obtained using the dev set or the test set?

Thank you for your time and contribution!

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions