[omniORB] giopRendezvouser exit on error

Vinouse, Jean-Pierre (Jean-Pierre) jvinouse at alcatel-lucent.com
Thu Feb 21 13:15:13 GMT 2008


Duncan,

an issue with omniOrb 4.0.7 which I reported end of last year 
http://www.omniorb-support.com/pipermail/omniorb-list/2007-December/0290
59.html

appears now more frequently since a different client is connected to our
server application. 

The good news: we managed to record omniOrb traces at level 30 (see
short file attached). 

The client behaves in a different way as the previous one, it does not
use the GIOP 1.2 message close connection to release the connection.
Instead as shown by an ethereal trace the TCP connection is released
"hardly" at TCP level (FIN packet) a few msecs after the GIOP reply has
been sent out. I presume this cause the giopStream to throw the
exception COMM_FAILURE_UnMarshalArguments. And it seems that in some
circumstances (large message sent here 66kbytes, network perhaps
degraded or slow) the SocketCollection::select() may return an error of
type "invalid file descriptor". Following the giopRrendezvouser
terminates and no further incoming connection can be accepted. Our
application then triggers a reboot.

Is it an known issue ? Would a fix exists for 4.0.7 version, we could
test it in our env.  

Regards

Jean-Pierre

Alcatel-Lucent 
UMTS dev 
Nuremberg
Germany

-------------- next part --------------
UMC1 [01-01 01:57:34Z] 0x1efeb94 omniT1  SW=06000398       3342 ?I OAM_CORBA_AGENT omniORB: Server accepted connection from giop:tcp:135.120.154.182:47432



UMC1 [01-01 01:57:34Z] 0x1efeb94 t33  SW=06000398       3343 ?I OAM_CORBA_AGENT omniORB: giopWorker task execute.



UMC1 [01-01 01:57:34Z] 0x1efeb94 t33  SW=06000398       3344 ?I OAM_CORBA_AGENT omniORB: Accepted connection from giop:tcp:135.120.154.182:47432 because of this rule: "* unix,ssl,tcp"



UMC1 [01-01 01:57:34Z] 0x1efeb94 t33  SW=06000398       3345 ?I OAM_CORBA_AGENT omniORB: inputMessage: from giop:tcp:135.120.154.182:47432 159 bytes



UMC1 [01-01 01:57:34Z] 0x1efeb94 t33  SW=06000398       3346 ?I OAM_CORBA_AGENT omniORB: 128 bytes out of 159

4749 4f50 0102 0000 0000 0093 0000 0000 GIOP............

0300 0000 0000 0000 0000 000e fe00 0000 

UMC1 [01-01 01:57:34Z] 0x1efeb94 t33  SW=06000398       3347 ?I OAM_CORBA_AGENT omniORB: Receive codeset service context and set TCS to (ISO-8859-1,UTF-16)



UMC1 [01-01 01:57:34Z] 0x1efeb94 t33  SW=06000398       3348 ?I OAM_CORBA_AGENT omniORB: Dispatching remote call 'getAttributes' to: root<3> (active)



UMC1 [01-01 01:57:34Z] 0x1efeb94 t33  SW=06000398       3349 ?I OAM_CORBA_AGENT Incoming ItfB request from <OMCU=1>: <getAttributes> for MO <OneBTSEquipment=2>



..... cut ......Application handles the request.....


UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3579 ?I OAM_CORBA_AGENT ItfB request <getAttributes> for MO <OneBTSEquipment=2> finished successfully

UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3580 ?I OAM_CORBA_AGENT omniORB: sendChunk: to giop:tcp:135.120.154.182:47432 28 bytes



UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3581 ?I OAM_CORBA_AGENT omniORB: 

4749 4f50 0102 0201 0001 020c 0000 0000 GIOP............

0000 0000 0000 0000 0001 0202           ............



UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3582 ?I OAM_CORBA_AGENT omniORB: sendCopyChunk: to giop:tcp:135.120.154.182:47432 66044 bytes



UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3583 ?I OAM_CORBA_AGENT omniORB: 128 bytes out of 66044

3c4d 4f49 4c69 7374 3e3c 4d4f 4920 6664 <MOIList><MOI fd

6e3d 224f 6e65 4254 5345 7175 6970 6d6

UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3584 ?I OAM_CORBA_AGENT omniORB: sendChunk: to giop:tcp:135.120.154.182:47432 22 bytes



UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3585 ?I OAM_CORBA_AGENT omniORB: 

4749 4f50 0102 0007 0000 000a 0000 0000 GIOP............

4c69 7374 3e00                          List>.



UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3586 ?I OAM_CORBA_AGENT omniORB: throw giopStream::CommFailure from giopStream.cc:835(0,NO,COMM_FAILURE_UnMarshalArguments)



UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3587 ?I OAM_CORBA_AGENT omniORB: Server connection refcount = 1



UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3588 ?I OAM_CORBA_AGENT omniORB: Server connection refcount = 0



UMC1 [01-01 01:57:35Z] 0x1efeb94 t33  SW=06000398       3589 ?I OAM_CORBA_AGENT omniORB: Server close connection from giop:tcp:135.120.154.182:47432



UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1  SW=06000398       3590 ?I OAM_CORBA_AGENT omniORB: select() returned socket error ERRNO=851971



UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1  SW=06000398       3591 ?I OAM_CORBA_AGENT omniORB: giopRendezvouser for endpoint giop:tcp:135.120.176.153:15678 exit.



UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1  SW=06000398       3592 ?I OAM_CORBA_AGENT omniORB: Unrecoverable error for this endpoint: giop:tcp:135.120.176.153:15678, it will no longer be serviced.



UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1  SW=06000398       3593 ?I OAM_CORBA_AGENT omniORB: TCP endpoint shut down.



UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1  SW=06000398       3594 !E PANIC giopRendezvouser::execute exit on error ERRNO=851971



UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1  SW=06000398       3595 !E PANIC UBM F:/vobs/omniORB/build/src/src/lib/omniORB/orbcore/giopRendezvouser.cc L:128 ***

UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1          38 ?P PANIC:giopRendezvouser::execute exit on error ERRNO=851971



UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1          39 ?P PANIC:UBM F:/vobs/omniORB/build/src/src/lib/omniORB/orbcore/giopRendezvouser.cc L:128 ***

UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1          40 ?P PANIC:********** UBM TRACE BACK **********

UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1  SW=06000398       3596 !W UBM 0x01dc50dc  0x01dc6454  0x01ef4140  0x0276de3c  0x01e6b2a8  

UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1  SW=06000398       3597 !W UBM *************** UBM TRACE BACK start *********************************************************

UMC1 [01-01 01:57:35Z] 0x1efeb94 omniT1          41 ?P PANIC:0x01dc50e4  0x01dc6454  0x01ef4140  0x0276de3c  0x01e6b2a8  

UMC1 [01-01 01:57:45Z] 0x1efeb94 t1  SW=06000398       3598 ?I OAM_CORBA_AGENT omniORB: Scan for idle connections (7065,30000000)



UMC1 [01-01 01:57:45Z] 0x1efeb94 t1  SW=06000398       3599 ?I OAM_CORBA_AGENT omniORB: Scavenger reduce idle count for strand 0x90fa830 to 28



UMC1 [01-01 01:57:45Z] 0x1efeb94 t1  SW=06000398       3600 ?I OAM_CORBA_AGENT omniORB: Scavenger reduce idle count for strand 0x90faf10 to 28



UMC1 [01-01 01:57:45Z] 0x1efeb94 t1  SW=06000398       3601 ?I OAM_CORBA_AGENT omniORB: Scan for idle connections done (7065,30000000).



UMC1 [01-01 01:57:45Z] 0x1efeb94 t33  SW=06000398       3602 ?I OAM_CORBA_AGENT omniORB: AsyncInvoker: thread id = 34 has exited. Total threads = 3





comUtil.c: 284:   Request for reboot: giopRendezvouser::execute exit on error ERRNO=851971

 (reason_id=128)





More information about the omniORB-list mailing list