Question Regarding TA-Tok Training Code

Dear authors, 

Thank you for your great work! From my understanding, the training pipeline involves first pre-training the TA-Tok tokenizer and de-tokenizer, followed by jointly training the full MLLM. I've found the code for the MLLM training using the released TA-Tok checkpoint. Would it be possible to also release the code for pre-training the TA-Tok components themselves?

Btw, I'm also wondering how many GPU hours it takes to train TA-Tok themselves?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question Regarding TA-Tok Training Code #7

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question Regarding TA-Tok Training Code #7

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions