[omniORB] Help needed to interpret COMM_FAILURE

Wernke zur Borg wernke.zur.borg at vega.de
Mon Nov 28 09:02:36 GMT 2005


Thanks for your comment.

In our case the problem occurs with messages of max. 10 KBytes. I really
suspect a bad link quality. At least I have no better explanation.

Regards, Wernke


> -----Original Message-----
> From: omniorb-list-bounces at omniorb-support.com 
> [mailto:omniorb-list-bounces at omniorb-support.com] On Behalf 
> Of Renzo Tomaselli
> Sent: 25 November 2005 22:06
> To: Omniorb list
> Subject: Re: [omniORB] Help needed to interpret COMM_FAILURE
> 
> Hi all,
>     this sound interesting. We too had an apparently random problem, 
> appearing as a COMM_FAILURE_MarshalArguments instead.
> This occurs while the involved client manages to send very large 
> messages (e.g. several hundred megabytes) to the server, 
> co-located on 
> the same Win host.
> We initialize maxMsgSize to be 1 gigabyte, but we have never seen 
> MARSHAL_MessageSizeExceedLimitOnClient as a minor, as one 
> would expect 
> in case of overflow failures.
> But from Duncan's answer, it seems that connection might be dropped 
> after overcoming max. message size, thus resulting in a 
> shadowing comm. 
> failure. After looking at sources, I saw that this comm. failure can 
> only come along the chain notifyCommFailure<errorOnSend<Send, then 
> transport stuff.
> What is granted for sure, if we reduce message size a bit then this 
> problem disappears.
> Any comment is highly appreciated.
> Thanks,
> 
> Renzo Tomaselli
> 
> 
> Duncan Grisby wrote:
> 
> >On Friday 25 November, "Wernke zur Borg" wrote:
> >
> >  
> >
> >>>>giopStream::CommFailure from
> >>>>        
> >>>>
> >>giopStream.cc:834(0,NO,COMM_FAILURE_UnMarshalArguments)
> >>
> >>I am trying to interpret the source code at that line but I 
> am not 100% sure
> >>if the exception is thrown due to a timeout or to a remote 
> disconnect. If
> >>this is a timeout condition, I would like to know which 
> config parameter I
> >>can use to increase it. It is known that the line between 
> server and client
> >>may sometimes become unstable, so to avoid these 
> disonnections I would need
> >>to increase the stability with larger timeout values.
> >>    
> >>
> >
> >It's not a timeout. If it was, the exception minor code would be
> >TRANSIENT_CallTimedOut.
> >
> >The COMM_FAILURE means the TCP connection was unexpectedly closed.
> >Possibly the network broke. Another alternative is that the 
> server tried
> >to send something larger than the maximum message size 
> (which is 2MB by
> >default), and therefore dropped the connection. You might learn
> >something by turning traceLevel 25 on on the server.
> >
> >Cheers,
> >
> >Duncan.
> >




More information about the omniORB-list mailing list