-
Notifications
You must be signed in to change notification settings - Fork 317
fsdp Error report #232
Copy link
Copy link
Open
Description
Thanks for this wonderful project. I used the following script to train the model.
torchrun --nnodes=1 --nproc_per_node=2 /home/share/yongqi/project/AutoregressiveImageRetrieval/code/open_flamingo/open_flamingo/train/finetuning.py \
--lm_path anas-awadalla/mpt-1b-redpajama-200b \
--tokenizer_path anas-awadalla/mpt-1b-redpajama-200b \
--cross_attn_every_n_layers 1 \
--dataset_resampled \
--batch_size_mmc4 1 \
--train_num_samples_mmc4 150000\
--workers=2 \
--run_name OpenFlamingo-3B-vitl-mpt1b \
--num_epochs 2 \
--warmup_steps 2000 \
--mmc4_textsim_threshold 0.01 \
--laion_shards \
--mmc4_shards \
--logging_steps 1 \
--mmc4_max_num_images 1 \
--precision fp16 \
--fsdp
However, if I set the fsdp flag, it will report an error as follows:
AttributeError: 'MosaicGPT' object has no attribute 'set_output_embeddings'
location: flamingo.py line 294.
If I remove this flag, there is no error. Do you have any idea about this?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels