This article provides step-by-step instructions to replace and re-add a failed node (Analytics or Search) in a Versa cluster.
Prerequisites
Before starting, ensure the following for the replacement node:
Same Versa software version as other nodes in the cluster.
Same number of network interfaces.
Similar CPU, memory, and hyper-threading profile.
IP addresses and routes are identical to the failed node.
Hostname and DNS entries are correctly set in
/etc/hosts
.
Step 1: Prepare the Node
Make sure the following files are the same from an existing node of the same personality (Analytics or Search):
/etc/network/interfaces
/etc/hosts
/opt/versa/scripts/van-scripts/vansetup.conf
Install certificates SSL certificates if any. Refer below KB.
Perform Manual Hardening for Versa Analytics - Versa Networks
Make sure the below on
vansetup.conf
on the new node:Update the
rpc address
andlistener address
fields.zookeeper parameter should be set correctly.
Replication factor set as per your cluster configuration.
For Analytics nodes only, set the
seeds
parameter to the listener IP of an existing Analytics node:seeds="a.b.c.d"
Step 2: Perform Pre-Setup Based on Node Type
Analytics Node:
Run the following on any existing Analytics node to remove the old node's host ID:
# nodetool status # Note the host-id of the failed
node
# nodetool removenode <host-id>
Search Node:
Ensure the Zookeeper cluster is up and has a leader before running the setup. Run below command on node running zookeeper.
# vsh dbstatus
Step 3: Run vansetup.py
Execute the setup script on the new node:
# cd /opt/versa/scripts/van-scripts
# sudo ./vansetup.py
Step 4: Sync Certificates from the Director node
# sudo su versa
# cd /opt/versa/vnms/scripts/
# ./vnms-cert-sync.sh --sync
# ./vd-van-cert-upgrade.sh --pull
When prompted for "postpone restart", select y.
Step 5: Restart Directors (HA Setup Only)
On Secondary Director:
vsh stop
On Primary Director:
vsh restart
On Secondary Director:
vsh start
Verify HA sync between directors
Post-Checks
For Analytics Node
Run
nodetool status
on both the new and existing Analytics nodes.New node will initially show
UJ
(Joining). It should change toUN
(Up/Normal) once data sync completes.
For Search Node
Check search DB status:
# vsh dbstatus
Ensure
live-nodes
count is correct.collections
should be healthy
Check Solr cluster health:
# sudo /opt/versa/scripts/van-install
/cluster-install.sh solr cluster_status
All replicas should show as active.
Confirm the new node is reachable via the Director UI.
Troubleshooting
If the node fails to join properly or services remain inactive:
Review
nodetool status
,vsh dbstatus
, and Solr status.
If issues persist, open a ticket with Versa TAC and include:
Node type and personality
Output of post-check commands
Capture the tech-support and shell session logs