MQSeries.net :: View topic - shmmni 100% used

vsathyan · Posted: Wed Jun 03, 2015 1:13 am Post subject:

Hi all,
We are facing a strange issue in production without any change to the infrastructure which was deployed almost 4 months ago.

The shared V system memory - shmmni gets 100% full and the queue manager performance degrades, finally not accepting any connections and existing connections fail with MQRC 2009/2059.

There are only 3 server connection channels in this queue manager with a total number of 34 connections for all the three. The server connection channels have DISCINT = 0 and SHARECNV = 0. Does this create a problem?

There are other queue managers in the network, which are highly overloaded as compared to this queue manager, but they are using only around 45 out of 6400 sets of shmmni.

The queue manager is running with only 16 processes under 'mqm' user account
ps -ef | grep mqm

The operating system is Oracle Enterprise Linux 6.5. WebSphere MQ 7.5.0.2.

For a temporary fix, we increased the shmmni to 8192, but we have to identify and apply a permanent fix for this issue.

Below are the command outputs
----------------------------------------------------------
/opt/mqm/bin/mqconfig
System V Shared Memory
shmmax 68719476736 bytes IBM>=268435456 PASS
shmmni 6400 of 8192 sets (78%) IBM>=4096 WARN
shmall 417061616 of 4294967296 pages (9%) IBM>=2097152 PASS

[mqm@server ~]$ free
total used free shared buffers cached
Mem: 16330176 16168032 162144 10594096 103840 14658264
-/+ buffers/cache: 1405928 14924248
Swap: 2097144 0 2097144

-------------------------------------------------------------------

Also, in the MQ error logs we observed that there were errors logged with MQRC 2071. (MQRC_STORAGE_NOT_AVAILABLE). When we checked the NFS mount, the usage is around 6%

nfsserver:/mq_prodnfs/mq_prodnfs
50G 2.7G 47G 6% /mqdata

Out of 50GB, only 2.7GB is used and 47GB free.
Googled for MQRC 2071, and as indicated in a couple of links, the app is hosted on windows, and is not posting blank messages either.

The setup was running fine from nearly 4 months, and suddenly we have started facing this issue from last Friday.

Your inputs are much appreciated. Thanks in advance for your advise.