Thanks for your excellent work!
I noticed that Evaluation/srcs/LLM_evaluator.py and Evaluation/srcs/LMM_evaluator.py appear to be identical. Is this intentional?
I would expect LMM_evaluator.py to accept chart images as input for visual comparison, rather than code-based evaluation. Could you clarify if this is planned or if there might have been an upload oversight?
Thanks!