Overview
At times, you might need to verify the system's health (usually) after a hardware replacement.
This article provides the steps to perform a basic Cobalt system health check and verify if the Cobalt server is operating correctly.
Process
Perform the following steps as a smsadm user:
- Verify that the SMSC status is Online:
sms_status
- Check the start time for all the processes and compare them with the SMSC start time:
ps -ef | grep smsadm
- Verify that all major SMPP clients are in the Connected state (SMPP Client status):
display-scthruput:;
- Verify the number of messages, status, and violations in the current hour (SMPP Server status):
display-ssthruput:;
- Ensure that there are no violations:
display-sviolation:;
- Verify the license utilization in SMServer:
sm_lmd -l
- Check the status for all processes:
ebs_ps
- Verify all the error codes:
You may need to execute this command several times.
sms_operator->query_db
- Check for the important traffic counters from the recent traffic files:
For example, counters likeSubmit SM-Rcvd
,Delivered SM
, andSS7-MO-Rcvd
indicate that the messages are coming to the SMSC, and the SMSC is delivering the messages.
egrep "Time_Stamp|Submit SM-Rcvd|Delivered SM|SS7-MO-Rcvd" TRF.<timestamp>
- Observe and confirm that there is no core file:
find $EBSHOME/* -name "*core.*"
- If you observe a core file, follow the next two steps as a root user.
- Note down the timestamp for the core file by running the following command:
ls -l core.*
- Note down the process that generated the core file by running the following command:
file core
- Ensure that there is no error or related log:
-
grep -i "exited (FAIL)" $EBSHOME/SMS/$VERSION/RUN/mlog/MLog*
-
grep -i "exited (KILL)" $EBSHOME/SMS/$VERSION/RUN/mlog/MLog*
- Ensure that there is no error or related logs.
(The log in the below command indicates that some process went to the blocked state.)
grep -i "IPC message failed" $EBSHOME/SMS/$VERSION/RUN/mlog/MLog*
- Ensure that there is no error or related logs:
(The log in the below command indicates that the shared memory is exhausted.)
grep -i "shared memory capacity exceeded" $EBSHOME/SMS/$VERSION/RUN/mlog/MLog*
- Ensure that there are no errors:
grep -i KILL $EBSHOME/SMS/$VERSION/RUN/mlog/MLog* | wc -l
- Observe the connection status of the SMSC.
netstat -na | grep 6200