We have had a few instances of an experiment with multiple reactors running that was interrupted because one reactor (each time a different one) caused a "Failed to recover multiplexer on device" error, which, after 20 tries in the space of a couple of seconds leads to the whole app crashing and restarting. When the experiment is finally restarted the offending reactor works fine.
I don't have enough facility with the way the code handles the communication with this "multiplexer", but it seems to me that there could be ways to "revive" the connection, or at the very least decide to drop one reactor while the others continue to collect data?
Would you happen to have some suggestions here about what to do to solve or mitigate the issue?
Thanks a lot!