[Swift-devel] Problems running coaster
Michael Wilde
wilde at mcs.anl.gov
Mon Jul 28 13:13:56 CDT 2008
I tried NCSA-HG instead (to avoid the issues of the UC two-architecture
site).
The coaster provider works for me there, using gt2:gt2:pbs, as you
reported previously. Same Swift script that fails on abe.
So it looks like something in the job specs that is launching coaster
for gt2:pbs is not being accepted by abe.
I also see that these log messages which I mentioned earlier do not
occur on the successful mercury coaster run:
--
2008-07-28 08:22:58,853-0500 INFO ServiceManager Service task
Task(type=JOB_SUBMISSION, identity=urn:cog-1217251364678) terminated.
Removing service.
2008-07-28 08:22:58,853-0500 INFO ServiceManager Service does not
appear to be registered with this manager
--
If someone can point me closer to where the swift job that launches the
bootstrap script is run from, and how, I can try to reproduce the
problem with globus-job-run or globus-run. In the meantime I will hunt
for that.
- Mike
On 7/28/08 11:52 AM, Michael Wilde wrote:
>
>
> On 7/28/08 9:31 AM, Ben Clifford wrote:
>> On Sun, 27 Jul 2008, Michael Wilde wrote:
>>
>>> Yes. Just *before* I tried abe with coasters, I did a simple
>>> globus-job-run to
>>> its pbs jobmanager. That worked fine.
>>
>> try submitting with the following changes to sites.xml:
>>
>> 1. not using coasters, instead using the gt2 provider
>> 2. not specifying tg allocation in sites.xml
>>
>
> Using globus-job-run I see that only 1 account is valid.
>
> globus-job-run works to jobmanager-pbs, both with and without a -p
> option specifying the valid account.
>
> Swift using the gt2 provider also works with and without the valid
> account specified in sites.xml as a globus profile.
>
> Swift and the coaster provider with gt2:gt2:pbs fails both without the
> valid account (it gets the same "This node is in dedicated user mode"
> message)
>
> In terms of sites.xml:
>
> This fails (using coaster provider):
>
> <config>
> <pool handle="abe" >
> <execution provider="coaster" url="grid-abe.ncsa.teragrid.org"
> jobManager="gt2:gt2:pbs" />
> <gridftp url="gsiftp://gridftp-abe.ncsa.teragrid.org"/>
> <workdirectory>/u/ac/wilde/swiftwork</workdirectory>
> </pool>
> </config>
>
> This works (using gt2 provider):
>
> <config>
> <pool handle="abe">
> <jobmanager universe="vanilla"
> url="grid-abe.ncsa.teragrid.org/jobmanager-pbs" major="2"/>
> <gridftp url="gsiftp://gridftp-abe.ncsa.teragrid.org"/>
> <workdirectory>/u/ac/wilde/swiftwork</workdirectory>
> </pool>
> </config>
>
> I will try the same to a different TG site (starting with UC).
>
> - Mike
>
>
More information about the Swift-devel
mailing list