Hi,
In you paper I found that
Our data not only contains text but also math symbols and equations.
Could you tell me more about it? Did you compare your solution with other approaches which focus more on creating formula embeddings (like Tangent-CFT, Approach0 etc.)?
What formula format mathBERT required to be able to compare two formulas? Could you please provide me a simple code snippet?