Thanks for your excellent work!
In the paper there are two types of visual long-thought data, one is generated by QVQ and another is generated by self distillation.
I'm curious which type of visual long-thought data was released as "RUC-AIBOX/Virgo-Visual-Long-Thought-Dataset". Could you clarify it?
Thanks for your excellent work!
In the paper there are two types of visual long-thought data, one is generated by QVQ and another is generated by self distillation.
I'm curious which type of visual long-thought data was released as "RUC-AIBOX/Virgo-Visual-Long-Thought-Dataset". Could you clarify it?