<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body bgcolor="#ffffff" text="#000000">
On 4/29/2010 6:18 PM, Mihael Hategan wrote:
<blockquote cite="mid:1272583084.7581.9.camel@localhost" type="cite">
<pre wrap="">On Thu, 2010-04-29 at 17:34 -0500, Yi Zhu wrote:
</pre>
<blockquote type="cite">
<pre wrap="">HI,
I've tried it with "gt2:pbs", and got a "qsub not found" error, for
further investigation, I pulled the env used by globus,and found
that there is no "/opt/torque-2.3.6/bin/qsub" under the PATH= ,I think
that's why cause "qsub not found" problem.
Any suggested solution ?
</pre>
</blockquote>
<pre wrap="">
Two actually.
1. This is for the qsub problem: you can add the relevant environment
variables (for Torque) in sites.xml.
</pre>
</blockquote>
I've tried to add <br>
<profile namespace="env"
key="PATH">/opt/torque-2.3.6/bin</profile><br>
<br>
to the sites.xml, but still get the same error;" qsub is not found".<br>
<br>
make a link from /opt/torque-2.3.6/bin/qsub to /usr/bin seems works,
but I get another error:<br>
<br>
<small><small>-bash-3.2$ <br>
-bash-3.2$ swift -tc.file tc.test.data -sites.file sshpbscoast.xml
first.swift<br>
Swift svn swift-r3262 cog-r2729 (cog modified locally)<br>
<br>
RunID: 20100430-0105-nzzk6xxd<br>
Progress:<br>
Progress: Stage in:1<br>
Progress: Submitted:1<br>
Progress: Active:1<br>
Failed to transfer wrapper log from first-20100430-0105-nzzk6xxd/info/x
on ec2<br>
Progress: Failed:1<br>
Execution failed:<br>
Exception in echo:<br>
Arguments: [Hello, world!]<br>
Host: ec2<br>
Directory: first-20100430-0105-nzzk6xxd/jobs/x/echo-xvom1arj<br>
stderr.txt: <br>
<br>
stdout.txt: <br>
<br>
----<br>
<br>
Caused by:<br>
No status file was found. Check the shared filesystem on ec2<br>
Cleaning up...<br>
Shutting down service at <a class="moz-txt-link-freetext" href="https://10.251.214.179:48615">https://10.251.214.179:48615</a><br>
Got channel MetaChannel: 1317572826 -> GSSSChannel-11921994068(1)<br>
+ Done</small></small><br>
<br>
and the coaster-bootstrap log:<br>
<br>
<small><small>[torqueuser@ip-10-251-214-179 ~]$ <br>
[torqueuser@ip-10-251-214-179 ~]$ cat coaster-bootstrap-11921994068.log
<br>
using plain mode<br>
BS: <a class="moz-txt-link-freetext" href="http://tp-login2.ci.uchicago.edu:57278">http://tp-login2.ci.uchicago.edu:57278</a><br>
which: no gmd5sum in
(/opt/vdt-1.10.1/gums/scripts:/opt/vdt-1.10.1/prima/bin:/opt/vdt-1.10.1/cert-scripts/bin:/opt/vdt-1.10.1/glite/sbin:/opt/vdt-1.10.1/glite/bin:/opt/vdt-1.10.1/jdk1.5/bin:/opt/vdt-1.10.1/edg/sbin:/opt/vdt-1.10.1/gip/bin:/opt/vdt-1.10.1/gpt/sbin:/opt/vdt-1.10.1/globus/bin:/opt/vdt-1.10.1/globus/sbin:/opt/vdt-1.10.1/wget/bin:/opt/vdt-1.10.1/logrotate/sbin:/opt/vdt-1.10.1/perl/bin:/opt/pacman-3.26/bin:/opt/vdt-1.10.1/vdt/sbin:/opt/vdt-1.10.1/vdt/bin:/opt/vdt-1.10.1/gums/scripts:/opt/vdt-1.10.1/prima/bin:/opt/vdt-1.10.1/cert-scripts/bin:/opt/vdt-1.10.1/glite/sbin:/opt/vdt-1.10.1/glite/bin:/opt/vdt-1.10.1/jdk1.5/bin:/opt/vdt-1.10.1/edg/sbin:/opt/vdt-1.10.1/gip/bin:/opt/vdt-1.10.1/gpt/sbin:/opt/vdt-1.10.1/wget/bin:/opt/vdt-1.10.1/logrotate/sbin:/opt/vdt-1.10.1/perl/bin:/opt/pacman-3.26/bin:/opt/vdt-1.10.1/vdt/sbin:/opt/vdt-1.10.1/vdt/bin:/sbin:/usr/sbin:/bin:/usr/bin:/usr/X11R6/bin)<br>
Expected checksum: 9017a89a3a700d9866592187fdb27b5b<br>
Computed checksum: 9017a89a3a700d9866592187fdb27b5b<br>
JAVA=/opt/vdt-1.10.1/jdk1.5/bin/java<br>
plain /opt/vdt-1.10.1/jdk1.5/bin/java
-Djava=/opt/vdt-1.10.1/jdk1.5/bin/java -DGLOBUS_TCP_PORT_RANGE=
-DX509_USER_PROXY=/home/torqueuser/.globus/job/ec2-204-236-204-71.compute-1.amazonaws.com/31355.1272607512/x509_up
-DX509_CERT_DIR=/etc/grid-security/certificates
-DGLOBUS_HOSTNAME=ec2-204-236-204-71.compute-1.amazonaws.com -jar
/tmp/bootstrap.t31454 <a class="moz-txt-link-freetext" href="http://tp-login2.ci.uchicago.edu:57278">http://tp-login2.ci.uchicago.edu:57278</a>
<a class="moz-txt-link-freetext" href="https://128.135.125.117:54201">https://128.135.125.117:54201</a> 11921994068<br>
Canceling job 28.ip-10-251-214-179.ec2.internal<br>
<br>
EC: 0<br>
[torqueuser@ip-10-251-214-179 ~]$ <br>
[torqueuser@ip-10-251-214-179 ~]$ <br>
</small></small><br>
<br>
<blockquote cite="mid:1272583084.7581.9.camel@localhost" type="cite">
<pre wrap="">2. This is for the DN issue with gt2:gt2:pbs: Edit /etc/hosts and make
sure that the expected DN is the first entry for the internal IP passed
to the coaster service. If the entry is not in there at all, add it.
This is a way to impersonate a Globus service and possibly do a
man-in-the-middle thing, but it may also work to fix the DN mismatch
problem.
Mihael
</pre>
</blockquote>
by modify the entry in /etc/hosts to the expect DN address, so solve
the DNS mismatch problem, but still get an "<small><small> No status
file was found. Check the shared filesystem on ec2" error</small></small>
As same as the one mentioned above.<br>
<br>
-Yi Zhu<br>
<br>
</body>
</html>