CASCS1117I Connection to GLM_APPLID (sysid GLM_SYSID) lost (protocol P2P)where:
CASCS3032S Connection to ES Cluster manager ESCLMGR (sysid MST1) is disabled, verify and release global locks on ES cluster manager.
Prior to any GLM restart, it is vital that you identify and correct the reason for the failure. If necessary, you should collect CAS data for further failure analysis.
On an Enterprise Server Cluster client, active jobs can continue to execute, until they attempt to DEQUEUE their global locks. If the GLM is restarted before the DEQUEUE is requested, the DEQUEUE will execute successfully. Otherwise, the DEQUEUE will fail but the failure will be ignored and the local DEQUEUE will execute successfully. The JCL can therefore execute successfully, and no client-side actions are required.
Following a catastrophic GLM failure, or in the event of an Enterprise Server Cluster client failure, or some other requirement to take an Enterprise Server region out of a cluster, some action is required.
The NOWORK state is displayed in the GLM.
On the Server Information page of ESMAC (CASRDO5), the following state information is displayed:
When all Enterprise Server Cluster clients marked as ACTIVE in CASGLM.LCK have reconnected, the following will be displayed:
CASKC6008S No reply received for lock request from ESCLSLV2. GLM work halted until reply on ESMAC control page is provided.
ESCL1 CASCS3036E GLM ESCLMGR (sysid MST1) is in "NOWORK" state, waiting for all ES Cluster clients to send their locks. Check message KC6008S on the GLM. 10:36:43 ESCL1 JCLCM0188I JOB02312 LCKSLEEP JOB STARTED 10:36:43 ESCL1 JCLCM2000E JOB02312 LCKSLEEP Unable to acquire global lock for job LCKSLEEP. 10:36:43 ESCL1 CLCM0181S JOB02312 LCKSLEEP JOB ABENDED - COND CODE S922 10:36:43
To allow the GLM to resume lock processing, a reply is expected on the CONTROL page of the GLM's ESMAC screen (CASRDO11):
To remove the Enterprise Server Cluster client ESCLSLV2 from the cluster, uncheck the checkbox.
In the event of a permanent connection loss to the GLM, both client and GLM actions are required.
The following scenario illustrates a situation in which global lock removal is necessary.
If during the connection failure of ESCLCLT1, JCL1 attempts to DEQUEUE its locks, the Enterprise Server Cluster client layer will retry for the duration set by ES_GLM_TIMEOUT. When this value is reached, the Enterprise Server Cluster layer will mark the connection to the GLM as disabled and all global ENQUEUE/DEQUEUE requests will be rejected. The JCL will terminate successfully, the locks - including the global locks on the Enterprise Server Cluster client - will be released, but the global locks will still be active on the GLM.
The next section describes the use of the caslock command and its equivalent ESMAC page (CASRDO33). Both tools provide the capability to browse and remove locks, together with the ability to take a cluster offline.