[AG-TECH] Bridge Registry Timeout

John I. Quebedeaux, Jr johnq at lsu.edu
Wed Jun 9 10:48:25 CDT 2010


Ahhhhh... I can at least report I¹m seeing the same with the LSU bridge ­ FC
12, 3.2beta. It started happening after our upgrading to FC12 and 3.2beta.

I haven¹t been able to identify what¹s different, I¹ve been comparing logs
between the old and new...

-John



From: Matthew Leszczenski <mxl9499 at rit.edu>
Date: Wed, 09 Jun 2010 11:44:04 -0400
To: <ag-tech at lists.mcs.anl.gov>
Subject: [AG-TECH] Bridge Registry Timeout

Hello all,

My apologies if this has been covered by other people in the past, however I
have spent considerable time searching the archives for instructions on how
to fix this issue (or even exactly where it stems from).

Here at RIT I have been working on setting up a Unicast Bridge, however I
have run into a snag. I have the bridge up and working fine, I consistently
have 2 of our own nodes connected through the bridge at all times that they
are up, so it works as a bridge already. Our problem is that for about 5
minutes the bridge shows up in the registry list as an option for the nodes,
but after that 5 minutes it disappears from the registry list if a registry
purge is used, or if a node logs into AG after the registry timeout happens
it is gone. If the bridge is in the list from those 5 minutes, and the list
is not purged, that node can still connect and disconnect from the bridge
server without a problem, so it is still up and working.

For details, I am running on Fedora 12, using the Bridge python script that
is installed with AG3.2 (it has a created date of 2005/12/06 in case it has
been updated). When running the script I am running it with the following
command:

./Bridge -n "RIT Brooklyn" -l RIT

I have been watching the log file that I have been directing all the output
to, and the beginning I have found an interesting entry, but this is only
when there are no clients connected:

reached inactivity timeout and have no clients; exiting
Traceback (most recent call last):
  File 
"/usr/lib/python2.6/site-packages/AccessGrid3/AccessGrid/AGXMLRPCServer.py",
line 63, in run 
      self.handle_request()
  File "/usr/lib/python2.6/SocketServer.py", line 262, in handle_request
      fd_sets = select.selectP[self], [], [], timeout)
error: (4, 'Interrupted system call')


However when there are clients connected it every so often just prints out
the connection information as follows:

max_unicast_mem is 32
myhostname=brooklyn
myhostipaddress=129.21.x.x

using multicast 
ucport [data]=51390   ucport [rtcp]=51391
mcport [data]=56384  mcport [rtcp]=56385
making multicast port [0]
making multicast port [1]
No bridge.acl file found, no ACL set

If anyone has information that could help me track down where this problem
is, it would be a great help.

Thank you in advance,
   Matthew Leszczenski
-Collaborations Technology Specialist @ RIT Research Computing Department


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/ag-tech/attachments/20100609/25760805/attachment-0001.htm>


More information about the ag-tech mailing list