Move Non-Read/Write Operations to Dedicated Reactor

Found a deadlock issue in SH test (more details in [SH issue#88](https://docs.google.com/document/d/16DMqv9J-JuNs5c25IBuX01QXe5WbxPm4B4hAfz4JNZM/edit?tab=t.0#heading=h.ndldtqs2m9q)):
1. Thread 1 (nuraft-reconfigure): During the `replace_member` process, after removing the old member, `nuraft` [acquires the `nuraft` lock ](https://github.com/eBay/NuRaft/blob/bfc91b86b9d5a7d79d51c07230922c10ea1ede6c/src/handle_commit.cxx#L478)to trigger a reconfiguration and clears the `snapshot_sync_ctx`. The cleanup operation requires the current `user_snp_ctx` to stop, which in turn depends on [all pending prefetch blobs being read](https://github.com/eBay/HomeObject/blob/60152e0b0680ee87ff834c1407d266571620f624/src/lib/homestore_backend/pg_blob_iterator.cpp#L475-L478). However, this operation is blocked, waiting for an I/O reactor to handle the read.

2. Thread 2 (IO reactor worker 1): This thread calls `monitor_replace_member_replication_status`, detects that the replace member task is completed, and attempts to reset the quorum size. However, it is blocked waiting for the `nuraft` lock, which is held by Thread 1. At the same time, [Thread 2 holds the `m_rd_map_mtx` mutex](https://github.com/eBay/HomeStore/blob/4f4a98f25da515eeb6bcbe30c0000afcfbcd57d6/src/lib/replication/service/raft_repl_service.cpp#L752).

3. Thread 3 (IO reactor worker 2): This thread calls `gc_repl_reqs`, which [attempts to acquire the `m_rd_map_mtx` mutex](https://github.com/eBay/HomeStore/blob/4f4a98f25da515eeb6bcbe30c0000afcfbcd57d6/src/lib/replication/service/raft_repl_service.cpp#L692) held by Thread 2. As a result, Thread 3 is blocked.

Since both I/O reactor threads (Thread 2 and Thread 3) are blocked, no I/O operations can proceed. This prevents Thread 1 from completing the read operation required to release the `nuraft` lock, leading to a deadlock.

Since `monitor_replace_member_replication_status` and `gc_repl_reqs` are not typical write/read operations, should we consider isolating them from the default IOMgr workers? Below are the timers currently using default IOMgr workers:
```
m_rdev_gc_timer_hdl: Triggers gc_repl_reqs and gc_repl_devs every minute.
m_rdev_fetch_timer_hdl: Triggers fetch_pending_data every second.
m_flush_durable_commit_timer_hdl: Triggers flush_durable_commit_lsn every 500ms.
m_replace_member_sync_check_timer_hdl: Triggers monitor_replace_member_replication_status every minute.
m_res_audit_timer_hdl: Triggers trigger_truncate every 2 minutes.
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move Non-Read/Write Operations to Dedicated Reactor #847

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Move Non-Read/Write Operations to Dedicated Reactor #847

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions