Add evaluation harness for IMO-AnswerBench#11
Open
yurekami wants to merge 4 commits intogoogle-deepmind:mainfrom
Open
Add evaluation harness for IMO-AnswerBench#11yurekami wants to merge 4 commits intogoogle-deepmind:mainfrom
yurekami wants to merge 4 commits intogoogle-deepmind:mainfrom