[Swift-user] Exception in getFile

Jing Tie tiejing at gmail.com
Wed Aug 22 15:48:52 CDT 2007


Hi,

I have tried globusrun. It succeed.
globusrun -b -r antaeus.hpcc.ttu.edu/jobmanager-lsf -f HelloRSL

HelloRSL:
&
(executable = /mnt/lustre/antaeus/apps/test_files/myscript.sh)
(stdout = /mnt/lustre/antaeus/apps/test_files/HelloRSL.output)
(stderr = /mnt/lustre/antaeus/apps/test_files/HelloRSL.output)

myscript.sh:
#! /bin/bash
echo "I'm process id $$ on" `hostname`
date
echo "Running as binary $0" "$@"
echo "Done."

output:
Successfully completed.
Resource usage summary:
    CPU time   :      0.18 sec.
    Max Memory :         2 MB
    Max Swap   :        11 MB

    Max Processes  :         1
    Max Threads    :         1
The output (if any) follows:
I'm process id 4538 on compute-10-16.local
Wed Aug 22 15:33:30 CDT 2007
Running as binary /mnt/lustre/antaeus/apps/test_files/myscript.sh
Done.

Thanks,
Jing

On 8/21/07, Mihael Hategan <hategan at mcs.anl.gov> wrote:
> It doesn't look like the application runs.
>
> Can you try a globusrun or cog-job-submit with some dummy sleep job and
> see if it works (i.e. if you can see it with bjobs/bhist)?
>
> On Tue, 2007-08-21 at 16:42 -0500, Jing Tie wrote:
> > Hi,
> >
> > I tried SID application on TTU-ANTAEUS site. It works fine with
> > jobmanager, but has "exception in getFile" problem with
> > jobmanager-lsf. btw: globus-job-run
> > antaeus.hpcc.ttu.edu/jobmanager-lsf /bin/hostname works fine.
> >
> > site: TTU-ANTAEUS
> > gatekeeper: antaeus.hpcc.ttu.edu
> > app_dir: /mnt/lustre/antaeus/apps
> > data_dir: /mnt/hep/osg
> > lsf_dir: /opt/lsfhpc/6.2/linux2.6-glibc2.3-x86_64/bin
> > R_dir: /mnt/lustre/antaeus/apps/R-2.5.1/bin
> >
> > --------------
> > output:
> > cwtsmall failed
> > Provenance graph saved in sid-wf1-lyk35d4m9l2y0.dot
> > The following errors have occurred:
> > 1. Application "cwtsmall" failed (Exception in getFile
> > Caused by:
> >         Server refused performing the request. Custom message:  (error
> > code 1) [Nested exception message:  Custom message: Unexpected reply:
> > 500-Command failed. :
> > globus_gridftp_server_file.c:globus_l_gfs_file_send:2190:
> > 500-globus_l_gfs_file_open failed.
> > 500-globus_gridftp_server_file.c:globus_l_gfs_file_open:1694:
> > 500-globus_xio_register_open failed.
> > 500-globus_xio_file_driver.c:globus_l_xio_file_open:438:
> > 500-Unable to open
> > file /mnt/hep/osg/sid-wf1-lyk35d4m9l2y0/shared//101-FBchannel15_cwt-
> > avgResults.Rdata
> > 500-globus_xio_file_driver.c:globus_l_xio_file_open:381:
> > 500-System error in open: No such file or directory
> > 500-globus_xio: A system call failed: No such file or directory
> > 500 End.])
> >         Arguments: "scripts/runWaveletsAvg.R, 101, FB"
> >         Host: TTU-ANTAEUS
> >         Directory: sid-wf1-lyk35d4m9l2y0/cwtsmall-91u714gi
> >         STDERR:
> >         STDOUT:
> > ----------------------------
> >
> > But there is only one directory under
> > $data_dir/sid-wf1-lyk35d4m9l2y0/, i.e. shared, and no output files are
> > found.
> >
> > Do I miss some special configurations for LSF?
> >
> > Thanks a lot,
> > Jing
> >
> >
> > _______________________________________________
> > Swift-user mailing list
> > Swift-user at ci.uchicago.edu
> > http://mail.ci.uchicago.edu/mailman/listinfo/swift-user
>
>



More information about the Swift-user mailing list