Replication of finetuning code

Hello, I want to try finetuning your model with own data but I have two questions:

1. I am trying to replicat eyour finetuning code but if I try finetuning the larger version of FLAN-T5 I run into memory capacity issues. I am just using the wordnet dataset from huggingface, training one epoch with a batch size of 1 and reduced lengths. It appears to not run on multiple nodes. How could I solve this?
2. How should I format my data in order to use it for further finetuning? 

Thank you for any assistance here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replication of finetuning code #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Replication of finetuning code #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions