[Swift-devel] condor-g test on ff-grid site

Zhao Zhang zhaozhang at uchicago.edu
Thu Jun 18 14:29:07 CDT 2009


Dear All

I am trying to run a workflow on ff-grid site with condor-g feature. My 
submit host is tp-osg.ci.uchicago.edu.
I have a question about the remote site requirements. Does remote site 
require a condor jobmanger in order
for us to run swift with condor-g on there? cuz ff-grid only has pbs job 
manager. Here is my sites.xml
[zzhang at tp-grid1 sites]$ cat condor-g_new/ff-grid.xml
<config>

  <!-- ff-grid -->
  <pool handle="ff-grid" >
    <gridftp  url="gsiftp://ff-grid.unl.edu/" />
    <execution  provider="condor" />
    <workdirectory >/mnt/panasas/CMS/grid_users/osg/</workdirectory>
    <profile namespace="globus" key="jobType">grid</profile>
    <profile namespace="globus" key="gridResource">gt2 
ff-grid.unl.edu/jobmanager-pbs</profile>
  </pool>

</config>


The reason I am asking this is because my test failed on ff-grid site. 
All related logs are at CI network
/home/zzhang/swift_coaster/cog/modules/swift/tests/sites/logs/ff-grid/

Execution failed:
        Exception in cat:
Arguments: [061-cattwo.1.in, 061-cattwo.2.in]
Host: ff-grid
Directory: 061-cattwo-20090618-1407-gfg03g57/jobs/v/cat-v66x3gcj
stderr.txt:

stdout.txt:

----

Caused by:
        No status file was found. Check the shared filesystem on ff-grid
SWIFT RETURN CODE NON-ZERO - test 061-cattwo

On the remote site, the shared dir was created, but the jobs dir wasn't.
[zzhang at tp-grid1 ~]$ globus-job-run ff-grid.unl.edu /bin/ls 
061-cattwo-20090618-
1407-gfg03g57/
info
kickstart
shared
status

Any idea on the job failure? Also, to make sure it is not the test 
workflow's problem, I tested exactly the same suite
on the GLOW site.

best
zhao



More information about the Swift-devel mailing list