[Swift-devel] Re: Problem with incorrect host cert DN in coaster GSI authentication

Yi Zhu yizhu at cs.uchicago.edu
Fri Apr 30 01:33:40 CDT 2010


On 4/29/2010 6:18 PM, Mihael Hategan wrote:
> On Thu, 2010-04-29 at 17:34 -0500, Yi Zhu wrote:
>    
>> HI,
>>
>>   I've tried it with "gt2:pbs", and got a "qsub not found" error, for
>> further investigation, I  pulled the env used by globus,and  found
>> that there is no "/opt/torque-2.3.6/bin/qsub" under the PATH= ,I think
>> that's why cause  "qsub not found" problem.
>>
>> Any suggested solution ?
>>      
> Two actually.
> 1. This is for the qsub problem: you can add the relevant environment
> variables (for Torque) in sites.xml.
>    
I've tried to add
<profile namespace="env" key="PATH">/opt/torque-2.3.6/bin</profile>

to the sites.xml, but still get the same error;" qsub is not found".

make a link from /opt/torque-2.3.6/bin/qsub to /usr/bin seems works, but 
I get another error:

-bash-3.2$
-bash-3.2$ swift -tc.file tc.test.data -sites.file sshpbscoast.xml 
first.swift
Swift svn swift-r3262 cog-r2729 (cog modified locally)

RunID: 20100430-0105-nzzk6xxd
Progress:
Progress:  Stage in:1
Progress:  Submitted:1
Progress:  Active:1
Failed to transfer wrapper log from first-20100430-0105-nzzk6xxd/info/x 
on ec2
Progress:  Failed:1
Execution failed:
         Exception in echo:
Arguments: [Hello, world!]
Host: ec2
Directory: first-20100430-0105-nzzk6xxd/jobs/x/echo-xvom1arj
stderr.txt:

stdout.txt:

----

Caused by:
         No status file was found. Check the shared filesystem on ec2
Cleaning up...
Shutting down service at https://10.251.214.179:48615
Got channel MetaChannel: 1317572826 -> GSSSChannel-11921994068(1)
+ Done

and the coaster-bootstrap log:

[torqueuser at ip-10-251-214-179 ~]$
[torqueuser at ip-10-251-214-179 ~]$ cat coaster-bootstrap-11921994068.log
using plain mode
BS: http://tp-login2.ci.uchicago.edu:57278
which: no gmd5sum in 
(/opt/vdt-1.10.1/gums/scripts:/opt/vdt-1.10.1/prima/bin:/opt/vdt-1.10.1/cert-scripts/bin:/opt/vdt-1.10.1/glite/sbin:/opt/vdt-1.10.1/glite/bin:/opt/vdt-1.10.1/jdk1.5/bin:/opt/vdt-1.10.1/edg/sbin:/opt/vdt-1.10.1/gip/bin:/opt/vdt-1.10.1/gpt/sbin:/opt/vdt-1.10.1/globus/bin:/opt/vdt-1.10.1/globus/sbin:/opt/vdt-1.10.1/wget/bin:/opt/vdt-1.10.1/logrotate/sbin:/opt/vdt-1.10.1/perl/bin:/opt/pacman-3.26/bin:/opt/vdt-1.10.1/vdt/sbin:/opt/vdt-1.10.1/vdt/bin:/opt/vdt-1.10.1/gums/scripts:/opt/vdt-1.10.1/prima/bin:/opt/vdt-1.10.1/cert-scripts/bin:/opt/vdt-1.10.1/glite/sbin:/opt/vdt-1.10.1/glite/bin:/opt/vdt-1.10.1/jdk1.5/bin:/opt/vdt-1.10.1/edg/sbin:/opt/vdt-1.10.1/gip/bin:/opt/vdt-1.10.1/gpt/sbin:/opt/vdt-1.10.1/wget/bin:/opt/vdt-1.10.1/logrotate/sbin:/opt/vdt-1.10.1/perl/bin:/opt/pacman-3.26/bin:/opt/vdt-1.10.1/vdt/sbin:/opt/vdt-1.10.1/vdt/bin:/sbin:/usr/sbin:/bin:/usr/bin:/usr/X11R6/bin)
Expected checksum: 9017a89a3a700d9866592187fdb27b5b
Computed checksum: 9017a89a3a700d9866592187fdb27b5b
JAVA=/opt/vdt-1.10.1/jdk1.5/bin/java
plain /opt/vdt-1.10.1/jdk1.5/bin/java 
-Djava=/opt/vdt-1.10.1/jdk1.5/bin/java -DGLOBUS_TCP_PORT_RANGE= 
-DX509_USER_PROXY=/home/torqueuser/.globus/job/ec2-204-236-204-71.compute-1.amazonaws.com/31355.1272607512/x509_up 
-DX509_CERT_DIR=/etc/grid-security/certificates 
-DGLOBUS_HOSTNAME=ec2-204-236-204-71.compute-1.amazonaws.com -jar 
/tmp/bootstrap.t31454 http://tp-login2.ci.uchicago.edu:57278 
https://128.135.125.117:54201 11921994068
Canceling job 28.ip-10-251-214-179.ec2.internal

EC: 0
[torqueuser at ip-10-251-214-179 ~]$
[torqueuser at ip-10-251-214-179 ~]$


> 2. This is for the DN issue with gt2:gt2:pbs: Edit /etc/hosts and make
> sure that the expected DN is the first entry for the internal IP passed
> to the coaster service. If the entry is not in there at all, add it.
> This is a way to impersonate a Globus service and possibly do a
> man-in-the-middle thing, but it may also work to fix the DN mismatch
> problem.
>
> Mihael
>
>
>    
by modify the entry in /etc/hosts to the expect DN address, so solve the 
DNS mismatch problem, but still get an  " No status file was found. 
Check the shared filesystem on ec2" error As same as the one mentioned 
above.

-Yi Zhu

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20100430/24ddd20b/attachment.html>


More information about the Swift-devel mailing list