[Swift-devel] swift pbs/beagle broken

Michael Wilde wilde at mcs.anl.gov
Sun Nov 13 00:54:56 CST 2011


OK, I dont need these; I can reproduce the problem as well.

For some reason, the coaster worker is exiting immediately.

I see a few possibilities:

- Beagle networking may have changed, making it no longer possible to reach the coaster service from the compute nodes using the previous IP address ranges.

- the worker.pl script is not being created in $HOME/.globus/coasters

Mike


----- Original Message -----
> From: "Michael Wilde" <wilde at mcs.anl.gov>
> To: "Ketan Maheshwari" <ketancmaheshwari at gmail.com>
> Cc: "Swift Devel" <swift-devel at ci.uchicago.edu>
> Sent: Saturday, November 12, 2011 8:39:36 PM
> Subject: Re: [Swift-devel] swift pbs/beagle broken
> Ketan, can you post the submit script and site file?
> 
> On 11/12/11, Ketan Maheshwari <ketancmaheshwari at gmail.com> wrote:
> > Hi,
> >
> > It seems the pbs-coaster provider (local:pbs) is broken for swift. I
> > tried
> > swift trunk, 0.93 svn branch, 0.93RC3 and 0.93RC4 but getting the
> > same
> > response:
> >
> > Swift svn swift-r5205 cog-r3293
> >
> > RunID: 20111113-0216-1d35h7eb
> > Progress: time: Sun, 13 Nov 2011 02:16:54 +0000
> > site setting workersPerNode has been replaced with jobsPerNode!
> > Progress: time: Sun, 13 Nov 2011 02:17:05 +0000 Active:1
> > Failed to transfer wrapper log for job cat-1hg8aoik
> > Exception in cat:
> > Arguments: [data.txt]
> > Host: pbs
> > Directory: catsn-20111113-0216-1d35h7eb/jobs/1/cat-1hg8aoik
> > stderr.txt:
> >
> > stdout.txt:
> >
> > ----
> >
> > Caused by: Task failed: 1113-160254-000000 Block task ended
> > prematurely
> >
> >
> > Final status: time: Sun, 13 Nov 2011 02:17:05 +0000 Failed:1
> > The following errors have occurred:
> > 1. Task failed: 1113-160254-000000 Block task ended prematurely
> >
> >
> >
> > Trying the submit script outside of swift also does not seem to be
> > working.
> > The scripts get submitted to the queue and immediately exits without
> > writing anything to stdout or stderr.
> >
> > Were there any recent changes that could have affected this?
> >
> > I remember to have tried this successfully in the last week of last
> > month.
> >
> > Regards,
> > --
> > Ketan
> >
> 
> --
> Sent from my mobile device
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel

-- 
Michael Wilde
Computation Institute, University of Chicago
Mathematics and Computer Science Division
Argonne National Laboratory




More information about the Swift-devel mailing list