[Swift-devel] condor-g test on ff-grid site
Zhao Zhang
zhaozhang at uchicago.edu
Thu Jun 18 14:29:07 CDT 2009
Dear All
I am trying to run a workflow on ff-grid site with condor-g feature. My
submit host is tp-osg.ci.uchicago.edu.
I have a question about the remote site requirements. Does remote site
require a condor jobmanger in order
for us to run swift with condor-g on there? cuz ff-grid only has pbs job
manager. Here is my sites.xml
[zzhang at tp-grid1 sites]$ cat condor-g_new/ff-grid.xml
<config>
<!-- ff-grid -->
<pool handle="ff-grid" >
<gridftp url="gsiftp://ff-grid.unl.edu/" />
<execution provider="condor" />
<workdirectory >/mnt/panasas/CMS/grid_users/osg/</workdirectory>
<profile namespace="globus" key="jobType">grid</profile>
<profile namespace="globus" key="gridResource">gt2
ff-grid.unl.edu/jobmanager-pbs</profile>
</pool>
</config>
The reason I am asking this is because my test failed on ff-grid site.
All related logs are at CI network
/home/zzhang/swift_coaster/cog/modules/swift/tests/sites/logs/ff-grid/
Execution failed:
Exception in cat:
Arguments: [061-cattwo.1.in, 061-cattwo.2.in]
Host: ff-grid
Directory: 061-cattwo-20090618-1407-gfg03g57/jobs/v/cat-v66x3gcj
stderr.txt:
stdout.txt:
----
Caused by:
No status file was found. Check the shared filesystem on ff-grid
SWIFT RETURN CODE NON-ZERO - test 061-cattwo
On the remote site, the shared dir was created, but the jobs dir wasn't.
[zzhang at tp-grid1 ~]$ globus-job-run ff-grid.unl.edu /bin/ls
061-cattwo-20090618-
1407-gfg03g57/
info
kickstart
shared
status
Any idea on the job failure? Also, to make sure it is not the test
workflow's problem, I tested exactly the same suite
on the GLOW site.
best
zhao
More information about the Swift-devel
mailing list