[Swift-user] using swift on a cluster
Hodgess, Erin
HodgessE at uhd.edu
Wed Oct 21 03:17:00 CDT 2009
Aha!
I needed the universe=vanilla line.
Erin M. Hodgess, PhD
Associate Professor
Department of Computer and Mathematical Sciences
University of Houston - Downtown
mailto: hodgesse at uhd.edu
-----Original Message-----
From: swift-user-bounces at ci.uchicago.edu on behalf of Hodgess, Erin
Sent: Wed 10/21/2009 3:07 AM
To: Michael Wilde
Cc: swift-user at ci.uchicago.edu
Subject: RE: [Swift-user] using swift on a cluster
Hello!
We are indeed using condor.
I wanted to try a small test run, but am running into trouble:
[hodgess at grid bin]$ cat myjob.submit
executable=/usr/bin/id
output=results.output
error=results.error
log=results.log
queue
[hodgess at grid bin]$ condor_submit myjob.submit
Submitting job(s).
Logging submit event(s).
1 job(s) submitted to cluster 15.
[hodgess at grid bin]$ ls results*
results.error results.log results.output
You have new mail in /var/spool/mail/hodgess
[hodgess at grid bin]$ cat results.log
000 (015.000.000) 10/21 03:06:03 Job submitted from host: <192.168.1.11:46274>
...
001 (015.000.000) 10/21 03:06:05 Job executing on host: <10.1.255.244:44508>
...
002 (015.000.000) 10/21 03:06:05 (1) Job not properly linked for Condor.
...
009 (015.000.000) 10/21 03:06:05 Job was aborted by the user.
...
[hodgess at grid bin]$
I'm not sure why the job is not linked.
Any suggestions would be much appreciated.
Thanks,
Erin
Erin M. Hodgess, PhD
Associate Professor
Department of Computer and Mathematical Sciences
University of Houston - Downtown
mailto: hodgesse at uhd.edu
-----Original Message-----
From: Michael Wilde [mailto:wilde at mcs.anl.gov]
Sent: Tue 10/20/2009 10:49 PM
To: Hodgess, Erin
Cc: swift-user at ci.uchicago.edu
Subject: Re: [Swift-user] using swift on a cluster
Hi Erin,
I'm assuming you meant "use Swift to run jobs on the compute nodes of
the cluster"?
If so, you first need to find out what scheduler (also called "batch
system" or "local resource manager") the cluster is running.
Thats typical one of these: PBS, Condor, or SGE.
Either ask your system administrator, or see if the "man" command or
similar probes give you a clue:
Condor: condor_q -version
condor_q -version
$CondorVersion: 7.2.4 Jun 16 2009 BuildID: 159529 $
$CondorPlatform: I386-LINUX_RHEL5 $
PBS: man qstat:
qstat(1B) PBS
SGE: man qstat:
QSTAT(1) Sun Grid Engine User Commands
If its PBS or Condor, then the Swift user guide gives the sites.xml
entries to use.
Tell us what you find, then try following the instructions in the user
guide, and follow up with questions as needed.
- Mike
On 10/20/09 9:41 PM, Hodgess, Erin wrote:
> Hi Swift Users:
>
> I'm on a cluster and would like to use swift on the different sites on
> the cluster.
>
> How would I do that, please?
>
> Thanks,
> Erin
>
>
> Erin M. Hodgess, PhD
> Associate Professor
> Department of Computer and Mathematical Sciences
> University of Houston - Downtown
> mailto: hodgesse at uhd.edu
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Swift-user mailing list
> Swift-user at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-user
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20091021/f3a4d729/attachment.html>
More information about the Swift-user
mailing list