[Swift-devel] Cant run condor-g on TeraPort
Mihael Hategan
hategan at mcs.anl.gov
Thu Jun 18 14:25:16 CDT 2009
Send mail to Ti to restart the daemon (or fix whatever configuration
problems prevent it from starting).
On Thu, 2009-06-18 at 14:22 -0500, Michael Wilde wrote:
> As far as I can tell, the condor client code is broken on TeraPort.
>
> Ive tried this on tp-login and tp-osg; I am using +osg-client and @osg
> in my .soft. I source $VDT_LOCATION/setup.sh
>
> Zhao, Glen, can you cross-check and see if you are now seeing the same
> thing?
>
> My suspicion is that the condor client config broke in the last month,
> through OSG changes, CI Support work, etc etc.
>
> - Mike
>
>
> I get this from condor_q:
>
> tp$ condor_q
> Error:
>
> Extra Info: You probably saw this error because the condor_schedd is not
> running on the machine you are trying to query. If the condor_schedd is not
> running, the Condor system will not be able to find an address and port to
> connect to and satisfy this request. Please make sure the Condor daemons
> are
> running and try again.
>
> Extra Info: If the condor_schedd is running on the machine you are
> trying to
> query and you still see the error, the most likely cause is that you have
> setup a personal Condor, you have not defined SCHEDD_NAME in your
> condor_config file, and something is wrong with your SCHEDD_ADDRESS_FILE
> setting. You must define either or both of those settings in your config
> file, or you must use the -name option to condor_q. Please see the Condor
> manual for details on SCHEDD_NAME and SCHEDD_ADDRESS_FILE.
> tp$
>
> and this from swift:
>
> tp-grid1$ swift -tc.file tc.data -sites.file sites.condorg.xml cat.swift
> Swift svn swift-r2890 cog-r2392
>
> RunID: 20090618-1404-mo0thjj4
> Progress:
> Progress: Stage in:1
> Progress: Submitted:1
> Failed to transfer wrapper log from cat-20090618-1404-mo0thjj4/info/h on
> firefly
> Progress: Failed:1
> Execution failed:
> Exception in cat:
> Arguments: [data.txt]
> Host: firefly
> Directory: cat-20090618-1404-mo0thjj4/jobs/h/cat-hv5s3gcj
> stderr.txt:
>
> stdout.txt:
>
> ----
>
> Caused by:
> Cannot submit job: Could not submit job (condor_submit reported an exit
> code of 1). no error output
> tp-grid1$ ls
>
> --
>
> Using this sites file:
>
> <config>
> <pool handle="firefly" >
> <gridftp url="gsiftp://ff-grid.unl.edu" />
> <execution provider="condor" />
> <profile namespace="globus" key="jobType">grid</profile>
> <profile namespace="globus" key="gridResource">gt2
> ff-grid.unl.edu/jobmanager-pbs</profile>
> <workdirectory
> >/panfs/panasas/CMS/data/oops/wilde/swiftwork</workdirectory>
> </pool>
> </config>
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
More information about the Swift-devel
mailing list