| Author | Message | 
		
		  | mrk.for.dev | 
			  
				|  Posted: Thu Mar 18, 2021 2:53 am    Post subject: RDQM Hearbeat timeout |   |  | 
		
		  | Novice
 
 
 Joined: 11 Jan 2021Posts: 23
 
 
 | 
			  
				| Hello, 
 Is there any way to increase the heartbeat timeout of RDQM?
 Ex. Even if the server is out, detect this after 15 secondes
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | exerk | 
			  
				|  Posted: Thu Mar 18, 2021 3:13 am    Post subject: |   |  | 
		
		  |  Jedi Council
 
 
 Joined: 02 Nov 2006Posts: 6339
 
 
 | 
			  
				| What is the technical/business reason for wanting to increase it? _________________
 It's puzzling, I don't think I've ever seen anything quite like this before...and it's hard to soar like an eagle when you're surrounded by turkeys.
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | mrk.for.dev | 
			  
				|  Posted: Thu Mar 18, 2021 6:00 am    Post subject: |   |  | 
		
		  | Novice
 
 
 Joined: 11 Jan 2021Posts: 23
 
 
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | exerk | 
			  
				|  Posted: Thu Mar 18, 2021 6:18 am    Post subject: |   |  | 
		
		  |  Jedi Council
 
 
 Joined: 02 Nov 2006Posts: 6339
 
 
 | 
			  
				| 
   
	| mrk.for.dev wrote: |  
	| mini network cut |  I assume from that you mean that part of the network between one or more of the nodes will be temporarily affected, for no more than 15 seconds?
 _________________
 It's puzzling, I don't think I've ever seen anything quite like this before...and it's hard to soar like an eagle when you're surrounded by turkeys.
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | mrk.for.dev | 
			  
				|  Posted: Thu Mar 18, 2021 7:47 am    Post subject: |   |  | 
		
		  | Novice
 
 
 Joined: 11 Jan 2021Posts: 23
 
 
 | 
			  
				| sometimes the network on the primary server is lost for a few seconds (~3s), during this time the QMs switch to a secondary server. The goal is to increase this timeout to 10 seconds for example, and keep QMs on the primary server even if it is not reachable for less then 10s. |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | bruce2359 | 
			  
				|  Posted: Thu Mar 18, 2021 7:50 am    Post subject: |   |  | 
		
		  |  Poobah
 
 
 Joined: 05 Jan 2008Posts: 9486
 Location: US: west coast, almost. Otherwise, enroute.
 
 | 
			  
				| Why the brief network outages?  What do your network gurus say about this? _________________
 I like deadlines. I like to wave as they pass by.
 ב''ה
 Lex Orandi, Lex Credendi, Lex Vivendi. As we Worship, So we Believe, So we Live.
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | exerk | 
			  
				|  Posted: Thu Mar 18, 2021 8:13 am    Post subject: |   |  | 
		
		  |  Jedi Council
 
 
 Joined: 02 Nov 2006Posts: 6339
 
 
 | 
			  
				|   
 What he said; fix the network issue rather than mitigate it.
 _________________
 It's puzzling, I don't think I've ever seen anything quite like this before...and it's hard to soar like an eagle when you're surrounded by turkeys.
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | mrk.for.dev | 
			  
				|  Posted: Thu Mar 18, 2021 8:35 am    Post subject: |   |  | 
		
		  | Novice
 
 
 Joined: 11 Jan 2021Posts: 23
 
 
 | 
			  
				| Nothing special. Difficult to identify the reason. |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | bruce2359 | 
			  
				|  Posted: Thu Mar 18, 2021 9:31 am    Post subject: |   |  | 
		
		  |  Poobah
 
 
 Joined: 05 Jan 2008Posts: 9486
 Location: US: west coast, almost. Otherwise, enroute.
 
 | 
			  
				| 
   
	| mrk.for.dev wrote: |  
	| Nothing special. Difficult to identify the reason. |  Huh?  Network failures are nothing special and difficult to identify?  You need a new network team.
 _________________
 I like deadlines. I like to wave as they pass by.
 ב''ה
 Lex Orandi, Lex Credendi, Lex Vivendi. As we Worship, So we Believe, So we Live.
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | mrk.for.dev | 
			  
				|  Posted: Fri Mar 19, 2021 2:20 am    Post subject: |   |  | 
		
		  | Novice
 
 
 Joined: 11 Jan 2021Posts: 23
 
 
 | 
			  
				| So there is no way to increase the RDQM timeout? |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | exerk | 
			  
				|  Posted: Fri Mar 19, 2021 2:30 am    Post subject: |   |  | 
		
		  |  Jedi Council
 
 
 Joined: 02 Nov 2006Posts: 6339
 
 
 | 
			  
				| 
   
	| mrk.for.dev wrote: |  
	| So there is no way to increase the RDQM timeout? |  Possibly, but it's not something I have researched. Irrespective of that, try not to use MQ to mitigate problems in other areas, have those areas fix their issues.
 _________________
 It's puzzling, I don't think I've ever seen anything quite like this before...and it's hard to soar like an eagle when you're surrounded by turkeys.
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | bruce2359 | 
			  
				|  Posted: Fri Mar 19, 2021 5:17 am    Post subject: |   |  | 
		
		  |  Poobah
 
 
 Joined: 05 Jan 2008Posts: 9486
 Location: US: west coast, almost. Otherwise, enroute.
 
 | 
			  
				| I googled 'rdqm timeout' and the first hit https://www.ibm.com/support/knowledgecenter/en/SSFKSJ_9.0.0/com.ibm.mq.tro.doc/q133450_.htm 
 I searched ths document for 'timeout' and the first hit was 'Corosync timeout'.
 
 I did a google search for 'Corosync timeout' and found the token needed to set that value.
 _________________
 I like deadlines. I like to wave as they pass by.
 ב''ה
 Lex Orandi, Lex Credendi, Lex Vivendi. As we Worship, So we Believe, So we Live.
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | mrk.for.dev | 
			  
				|  Posted: Fri Mar 19, 2021 5:45 am    Post subject: |   |  | 
		
		  | Novice
 
 
 Joined: 11 Jan 2021Posts: 23
 
 
 | 
			  
				| That's exactly what I did. I changed the token in totem in /etc/corosync/corosync.conf but I have the impression that it is not taken into account. I set it to 10000=10 seconds. When the primary server is disconnected, the secondary server becomes primary in only 2 seconds. |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  | bruce2359 | 
			  
				|  Posted: Fri Mar 19, 2021 7:19 am    Post subject: |   |  | 
		
		  |  Poobah
 
 
 Joined: 05 Jan 2008Posts: 9486
 Location: US: west coast, almost. Otherwise, enroute.
 
 | 
			  
				| Spend more time reading up on corosync configuration generally, and timeout values specifically. 
 RDQM is not my specialty.  But as I read it, the timeout token is like MQs heartbeat interval, and not a delay feature.  Someone should be along shortly.
 _________________
 I like deadlines. I like to wave as they pass by.
 ב''ה
 Lex Orandi, Lex Credendi, Lex Vivendi. As we Worship, So we Believe, So we Live.
 |  | 
		
		  | Back to top |  | 
		
		  |  | 
		
		  |  |