With thresholding, an even division of pixels will not split the work evenly. Thresholding must be performed before load balancing. It will need to be decoupled from data loading in cold.
The thresholding could be calculated, then distributed via MPI, but that opens the program up to more MPI systems problems. It may be more reliable to just duplicate the calculation across all ranks.