Skip to content

Replication of finetuning code #6

@VilhelmHovland

Description

@VilhelmHovland

Hello, I want to try finetuning your model with own data but I have two questions:

  1. I am trying to replicat eyour finetuning code but if I try finetuning the larger version of FLAN-T5 I run into memory capacity issues. I am just using the wordnet dataset from huggingface, training one epoch with a batch size of 1 and reduced lengths. It appears to not run on multiple nodes. How could I solve this?
  2. How should I format my data in order to use it for further finetuning?

Thank you for any assistance here.

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions