Skip to content

ttaaefs_peerip causes silent node failure when set with an FQDN #12

@TM553432

Description

@TM553432

Discovered when trying to set up both realtime and fullsync NextGenRepl between 2 fully functional clusters of identical node count where all nodes were running KV 3.2.4. As part of setting up fullsync, the ttaaefs_peerip setting in the source cluster has to point to the fullsync peer in the sink cluster. As an FQDN had been set up for each node, we put the FQDN in instead of the IP address in the ttaaefs_peerip setting on the source cluster. This caused Riak to fall over silently with no log of cause. This was remedied by changing it to a hard-coded IPv4 address.
When using an environment where IP addresses can change, this seems to contradict Riak's ability to use an FQDN in the nodename.
To replicate:

  1. Create two clusters
  2. Set up NextGenRepl as one would normally
  3. On a source node, set ttaaefs_peerip to an FQDN that points to a sink node
  4. Restart Riak on the source node you updated
  5. Check whether Riak started.
  6. On the same source node, change ttaaefs_peerip to the IPv4 address of the same sink node.
  7. Restart Riak on the source node you updated.
  8. Check whether Riak started.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions