hello! Congratulations on your work !I noticed that in your white-box attack code, the combined image was normalized again, as follows:
normalized_images = normalize(combined_images).to(torch.float16)
Isn't this equivalent to normalizing the original image twice?
data_dict = item_process_func(question, target_words, image_file, tokenizer, data_args)
This code has already normalized the image once
hello! Congratulations on your work !I noticed that in your white-box attack code, the combined image was normalized again, as follows:
normalized_images = normalize(combined_images).to(torch.float16)
Isn't this equivalent to normalizing the original image twice?
data_dict = item_process_func(question, target_words, image_file, tokenizer, data_args)
This code has already normalized the image once