Service Processor System Monitoring - SurveillanceSurveillance is a function in which the service processor monitors the system, and thesystem monitors the service processor. This monitoring is accomplished by periodicsamplings called heartbeats.Surveillance is available during two phases:v System firmware bring-up (automatic)v Operating system run time (optional)System Firmware SurveillanceSystem firmware surveillance provides the service processor with a means to detectboot failures while the system firmware is running.System firmware surveillance is automatically enabled during system power-on. Itcannot be disabled by the user.If the service processor detects no heartbeats during system boot (for 7 minutes), itcycles the system power to attempt a reboot. The maximum number of retries is setfrom the service processor menus. If the failure condition repeats, the service processorleaves the machine powered on, logs an error, and displays menus to the user. Ifcall-out is enabled, the service processor calls to report the failure and displays theoperating-system surveillance failure code on the operator panel.Operating System SurveillanceThe operating system surveillance provides the service processor with a means todetect hang conditions, as well as hardware or software failures, while the operatingsystem is running. It also provides the operating system with a means to detect serviceprocessor failure caused by the lack of a return heartbeat.Operating system surveillance is enabled by default, allowing the user to run operatingsystems that do not support this service processor option.You can also use service processor menus and AIX service aid to enable or disableoperating system surveillance.For operating system surveillance to work correctly, you must set the followingparameters:v Surveillance enable/disablev Surveillance intervalThis is the maximum time (in minutes) the service processor should wait for aheartbeat from the operating system before timeout.v Surveillance delayThis is the length of time (in minutes) for the service processor to wait from when theoperating system is started to when the first heartbeat is expected.Chapter 7. Using the Service Processor 179