hive-hs2-cms-gc-time
This runbook shows steps to take when CMS garbage collection time is excessive.
Alert Name: HS2 CMS GC time
Alert Message: “HS2 GC time over 7 seconds”
Alert Explanation: The alert indicates that CMS garbage collection time is greater than seven seconds.
Resolution:
Check the dashboards HS2 Memory Usage and HS2 GC Time. Look for
hive.hs2.memory.pools_CMS-Perm-Gen_usagetrend along with heap trends. If this alert appears repeatedly, it indicates that the JVM is working most of the time on garbage collection and is still not able to free up memory.Based on the above, configure or alter your workloads accordingly with the help of Dev teams.
Restart the HiveServer2 process if failures are not contained.
Logs:
HS2 logs are available on the coordinator node:
/media/ephemeral0/logs/hive2.1.1/hive.logLook for any evident errors(do basic grep and count of errors).
Look at the dashboards defined above. (title is the name of the dashboard).
Restart of Process:
sudo monit summaryto check the status of the process.sudo monit stop hs2to stop the process.sudo monit start hs2to start the process.