[AG-TECH] The vanishing bridge...

Philippe d'Anfray Philippe.d-Anfray at cea.fr
Tue Mar 16 04:59:32 CDT 2010


Bonjour,

We posted a while ago about this strange bridge behaviour...
when we launch the bridge it is only visible a few minutes in the registry.
has someone noticed the same problem? any idea ???

Cordialement

Philippe d'Anfray

The machine running the bridge  also runs Ubuntu 9.10

*** 1) today 10h09 am: bridge is launched

Bridge3.py -p 20000 -r 50000 52000 -n ARISTOTE -l FRANCE -u
http://www.accessgrid.org/registry/peers.txt

*** 2) 10h12 a.m.  from another machine, "purge bridge cache ",  Yes !
the ARISTOTE bridge shows up (picture 1)

*** 3) 10h20 a.m.  from the same machine, "purge bridge cache ",
ARISTOTE bridge has already vanished  from the list (picture 2)

*** 4) 10h22 a.m. here are the logs...

03/16/10 10:09:26 139917634770672 Toolkit     Config.py:674 DEBUG System
hostname of aristote-venues.polytechnique.fr is valid
03/16/10 10:09:26 139917634770672 Toolkit     Toolkit.py:116 DEBUG
Initializing AG Toolkit version 3.2 beta
03/16/10 10:09:26 139917634770672 Toolkit     Toolkit.py:117 INFO 
Command and arguments: ['/usr/bin/Bridge3.py', '-p', '20000', '-r',
'50000', '52000', '-n', 'ARISTOTE', '-l', 'FRANC
E', '-u', 'http://www.accessgrid.org/registry/peers.txt']
03/16/10 10:09:26 139917634770672 VenueClient     Preferences.py:206
DEBUG Preferences.LoadPreferences: open file
03/16/10 10:09:27 139917634770672 Toolkit     Toolkit.py:166 INFO 
Logfile Name: Bridge.log
03/16/10 10:09:27 139917634770672 Toolkit     Toolkit.py:692 INFO 
Service init: have profile None
03/16/10 10:09:27 139917634770672 Toolkit     Toolkit.py:717 INFO 
Service Initialization Complete.
03/16/10 10:09:27 139917634770672 Toolkit     BridgeFactory.py:81 INFO 
Allocator using port range: (50000, 52000)
03/16/10 10:09:27 139917634770672 Toolkit     BridgeFactory.py:89 INFO 
BridgeFactory.SetPortMin 50000
03/16/10 10:09:27 139917634770672 Toolkit     BridgeFactory.py:93 INFO 
BridgeFactory.SetPortMax 52000
03/16/10 10:09:27 139917634770672 VenueClient     Preferences.py:206
DEBUG Preferences.LoadPreferences: open file
03/16/10 10:09:27 139917634770672 VenueClient     Preferences.py:206
DEBUG Preferences.LoadPreferences: open file
03/16/10 10:09:27 139917634770672 VenueClient     Preferences.py:206
DEBUG Preferences.LoadPreferences: open file
03/16/10 10:09:28 139917634770672 VenueClient     Preferences.py:206
DEBUG Preferences.LoadPreferences: open file
03/16/10 10:10:06 139917419489552 Toolkit     Bridge3.py:59 INFO  Bridge
request: mcast 233.4.200.18 10002
03/16/10 10:10:06 139917419489552 Toolkit     BridgeFactory.py:108 INFO 
Method CreateBridge called
03/16/10 10:10:06 139917419489552 Toolkit     BridgeFactory.py:114 INFO 
Allocated port = 51168
03/16/10 10:10:06 139917419489552 Toolkit     BridgeFactory.py:134 INFO 
- creating new bridge
03/16/10 10:10:06 139917419489552 Toolkit     BridgeFactory.py:35 INFO 
Method Bridge.Start called
03/16/10 10:10:06 139917419489552 Toolkit     BridgeFactory.py:38 INFO 
Starting bridge:
03/16/10 10:10:06 139917419489552 Toolkit     BridgeFactory.py:40
INFO    [maddr,mport,mttl] = 233.4.200.18 10002 1
03/16/10 10:10:06 139917419489552 Toolkit     BridgeFactory.py:42
INFO    [uaddr,uport] = aristote-venues.polytechnique.fr 51168
03/16/10 10:10:06 139917419489552 Toolkit     BridgeFactory.py:51 INFO 
Starting bridge: /usr/bin/QuickBridge ['-g', '233.4.200.18', '-m',
'10002', '-u', '51168', '-i', '600']
03/16/10 10:20:07 139917634770672 ProcessManager    
ProcessManager.py:44 DEBUG OnSigChild: pid = 26018
03/16/10 10:20:07 139917634770672 Toolkit     BridgeFactory.py:187 INFO 
Removing bridge following death of bridge process (233.4.200.18/10002)
03/16/10 10:20:07 139917634770672 Toolkit     BridgeFactory.py:191 INFO 
Removing bridge following death of bridge process (233.4.200.18/10002)
03/16/10 10:20:07 139917634770672 ProcessManager    
ProcessManager.py:39 ERROR Exception in waitpid; breaking
Traceback (most recent call last):
  File
"/usr/lib/pymodules/python2.6/AccessGrid3/AccessGrid/Platform/unix/ProcessManager.py",
line 37, in OnSigChild
    ret = os.waitpid(-1,os.WNOHANG)
OSError: [Errno 10] No child processes

*** 5) Later..  ps shows that  the bridge process is still somewhere on
our machine...

4 S aristote 26008     1  0  80   0 - 60122 poll_s 10:09 ?       
00:00:00 /usr/bin/python /usr/bin/Bridge3.py -p 20000 -r 50000 52000 -n
ARISTOTE -l FRANCE -u http://www.accessgrid.org/registry/peers.txt


-------------- next part --------------
A non-text attachment was scrubbed...
Name: bridge-1.jpg
Type: image/jpeg
Size: 53329 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/ag-tech/attachments/20100316/7aa2df3b/attachment-0002.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: bridge-2.jpg
Type: image/jpeg
Size: 48269 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/ag-tech/attachments/20100316/7aa2df3b/attachment-0003.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Philippe_d-Anfray.vcf
Type: text/x-vcard
Size: 363 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/ag-tech/attachments/20100316/7aa2df3b/attachment-0001.vcf>


More information about the ag-tech mailing list