Overview
You may need to reset an RTR node; for example, after running a Health Check and finding it has been up and running for a long time (e.g., more than a year), which is not recommended. This article details the steps to reboot an RTR node.
Prerequisites
- Access to the nodes through SSH.
- In case you don't have SSH access, please follow instead Rebooting a Lithium Node via iLO3
Process
1. Check System Health on affected nodes
a. syswatch
# su –
# syswatch
Ensure there are no health issues noticed.
b. Ilo check
- Check from ILO if all hardware components are fine
- Check for firmware version and compare to other nodes
2. Check network interfaces to compare with other servers
# ifconfig –a
# ethtool <interface name>
# ethtool -i <interface name>
# ethtool -t <interface name>
# ethtool -S <interface name>
- If any differences in driver versions etc are noticed, check step 7.
- If no differences between this server and other servers, then proceed to the next step
3. Stop textpass /rtr processes
# su - textpass
# su – textpass -c "tp_stop"
# su – textpass -c "tp_status"
4. Reboot the server
# su - root
# shutdown –r now
Wait for the reboot to complete and the server restarts. Then, proceed to the next step.
5. Start textpass process
# tp_start --textpass
# tp_status
The next actions depend on the status of starting the textpass process. If everything is OK, go to the next step.
6. Start All processes
# tp_start
# tp_status
Ensure All processes are started and operating normally
7. Network Interfaces
If any issues/differences are found in the network interfaces of affected servers when compared to the working server, the following actions could be attempted depending on the outcome
- Update Drivers
- Remove Ethernet configuration
- Re-Configure Network interfaces
Timelines and Impact
Expected Time | Expected Impact |
30 minutes per server (in total) | No traffic on this server. However, other servers in the network should be able to cover all traffic |