ASG
IBM
Zystems
Cressida
Icon
Netflexity
 
  MQSeries.net
Search  Search       Tech Exchange      Education      Certifications      Library      Info Center      SupportPacs      LinkedIn  Search  Search                                                                   FAQ  FAQ   Usergroups  Usergroups
 
Register  ::  Log in Log in to check your private messages
 
RSS Feed - WebSphere MQ Support RSS Feed - Message Broker Support

MQSeries.net Forum Index » General IBM MQ Support » Recurring QM Outage that Gens Probid = XC130003 and ZX005022

Post new topic  Reply to topic Goto page Previous  1, 2
 Recurring QM Outage that Gens Probid = XC130003 and ZX005022 « View previous topic :: View next topic » 
Author Message
belchman
PostPosted: Tue Apr 11, 2006 4:48 am    Post subject: Wow... Thanks for all the input folks. Reply with quote

Partisan

Joined: 31 Mar 2006
Posts: 386
Location: Ohio, USA

Sorry, I am not getting email notify's of your posts. Haven't been here in a few days.

Let me see if I can respond to all of this in an orderly fashion.

mvic asks "What do your network admins say about this?",

NW guys say it is an app issue. They do not see NW probs. We have had sniffer on line between both nodes.

fjb_saper asks "Could it be that you run out of bandwidth at those times ?"
Sorry jefflowrey , for misinformation. My "constantly" usage is grossly over-exaggerated. It does appear that the log spamming correlates to a batch application that fires off a bunch of accumulated datagrams. I can do some research to ID volumes.

fjb_saper asks "How about faulty switch... ?"

I have to assume that NW sniffer would have caught that.

Reminder:

The node spamming the logs is the least of my worries. The real problem is that we frequently experience and outage on the QM that last 5 - 15 secs. All client conns over SVRCONNS are booted and forced to reconnect.

This spamming node is a suspect simply because it is spamming the logs. It could be a false lead. But IMHO, if it is spamming it is causing problems. My "possibly hairbrained" theory , is that this node is acting up and MQ is trying to be patient with it. MQ maintains its patience until it blows its stack and boots everyone out of the playground (closes all conns) so it can meditate and heal itself.

Here is another piece of info that could support my theory. The memory usage of the amqpcsea constantly increases over time until our Sunday bounce of the MQ host. The curve for memory utilization is a straight line with a 30 - 40 degree slope. Today at 7:40 hrs, mem usage was 46M. At 8:40, it was 46.8M. So we have a .8M/hr increase constantly all week until bounce.

If this batch process is putting to clustered queues, could this be causing what looks like a memory leak in amqpcsea? And could amqpcsea situation be causing client disconnect issue?

Once again! I appreciate all of your input into this discussion. I welcome any other "hairbrained theories" about root cause.
Back to top
View user's profile Send private message
mvic
PostPosted: Tue Apr 11, 2006 5:11 am    Post subject: Re: Wow... Thanks for all the input folks. Reply with quote

Jedi

Joined: 09 Mar 2004
Posts: 2080

belchman wrote:
NW guys say it is an app issue. They do not see NW probs. We have had sniffer on line between both nodes.

Review again the following posts
http://www.mqseries.net/phpBB2/viewtopic.php?p=131121#131121
http://www.mqseries.net/phpBB2/viewtopic.php?p=131137#131137

What do the network guys say is causing the ECONNRESET every 11 seconds? This is really the key to understanding how to stop it happening.

(I don't understand how the conclusion is reached that it is an "app issue").

Apologies if I have focussed on the wrong part of the problem.
Back to top
View user's profile Send private message
belchman
PostPosted: Tue Apr 11, 2006 6:15 am    Post subject: Reply with quote

Partisan

Joined: 31 Mar 2006
Posts: 386
Location: Ohio, USA

mvic: When I say NW guys say it is an App issue, they mean the MQ Client app that is hosted on spamming node; Not MQ server.
Back to top
View user's profile Send private message
Display posts from previous:   
Post new topic  Reply to topic Goto page Previous  1, 2 Page 2 of 2

MQSeries.net Forum Index » General IBM MQ Support » Recurring QM Outage that Gens Probid = XC130003 and ZX005022
Jump to:  



You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
Protected by Anti-Spam ACP
 
 


Theme by Dustin Baccetti
Powered by phpBB © 2001, 2002 phpBB Group

Copyright © MQSeries.net. All rights reserved.