-
Notifications
You must be signed in to change notification settings - Fork 28
Description
Hi, I'm trying out clustermq on a Slurm cluster. Using a simple example, the job launches, but then nothing happens.
AFAIU, it is ZeroMQ that fails to set up a working communication with the main R session and the worker(s) launched on the cluster's compute node. BTW, I've verified that ZeroMQ works on the localhost using the rzmq main-worker example from https://cran.r-project.org/web/packages/rzmq/readme/README.html. As soon as I attempt to use the same example running across two hosts, it blocks.
I suspect I need to request to have the ZeroMQ protocol opened up on the cluster for clustermq to be able to run on this cluster. I have near-zero experience with ZeroMQ - is this a matter of opening up TCP ports in the firewall? If so, is there a standard range that ZeroMQ uses?