[Swift-user] hung submission
Altaweel, Mark
m.altaweel at ucl.ac.uk
Sun May 3 15:18:10 CDT 2015
If I do a qsub on the script I get the same error message:
job_number: 6597054
exec_file: job_scripts/6597054
submission_time: Sun May 3 21:15:23 2015
owner: tcrnma3
uid: 147447
group: users
gid: 1002
sge_o_home: /home/tcrnma3/
sge_o_log_name: tcrnma3
sge_o_path: /shared/ucl/apps/mrxvt/0.5.4/bin:/shared/ucl/apps/nedit/5.6/bin:/shared/ucl/apps/gerun/i:/usr/mpi/qlogic//sbin:/usr/mpi/qlogic//bin:/usr/lib64/qt-3.3/bin:/usr/kerberos/bin:/usr/local/bin:/bin:/usr/bin:/sbin:/usr/sbin:/shared/ucl/apps/bin:/cm/shared/apps/intel/toolkit/Compiler/11.1/072//bin:/cm/shared/apps/intel/toolkit/Compiler/11.1/072//bin/intel64:/cm/shared/apps/sge/6.2u3/bin/lx26-amd64:/home/tcrnma3//bin
sge_o_shell: /bin/bash
sge_o_workdir: /imports/home1/tcrnma3/Scratch/UrbanModel/run005/scripts
sge_o_host: login06
account: ucl_jsv4h;S=0;T=1.0;W=1.0;X=1.0;Y=1.0;V=0;Z=1.0;U=1.0
stderr_path_list: NONE:NONE:/imports/home1/tcrnma3/Scratch/UrbanModel/run005/scripts/SGE7948718974736431209.submit.stderr
hard resource_list: batch=true,bonus=0,h_rt=540,jcs=0,jct=1,jcu=1,jcv=0,jcw=1,jcx=1,jcy=1,jcz=1,maxversion=2,memory=1M,penalty=604801,s_rt=530
mail_list: tcrnma3 at login06.data.legion.ucl.ac.uk<mailto:tcrnma3 at login06.data.legion.ucl.ac.uk>
notify: FALSE
job_name: B0503-3707460-0
stdout_path_list: NONE:NONE:/imports/home1/tcrnma3/Scratch/UrbanModel/run005/scripts/SGE7948718974736431209.submit.stdout
jobshare: 0
restart: n
shell_list: NONE:/bin/ksh
env_list: WORKER_LOGGING_LEVEL=NONE,XAUTHORITY=/scratch/scratch/tcrnma3/.Xauthority,PAID=0,GPU=0,OMP_NUM_THREADS=1,MICCOUNT=0,SCRATCH_SPACE=10737418240,MEMPERSLOT=1048576,SGE_SHARENODE=1,IFS=
script_file: SGE7948718974736431209.submit
project: AllUsers
error reason 1: 05/03/2015 21:15:57 [147447:18805]: error: can't open output file "/imports/home1/tcrnma3/Scratch/Ur
scheduling info: (Collecting of scheduler job information is turned off)
Mark
On May 3, 2015, at 9:06 PM, Mihael Hategan <hategan at mcs.anl.gov<mailto:hategan at mcs.anl.gov>> wrote:
It seems that it is more likely that the error message gets truncated
rather than the path itself. After all, stdout_path_list does contain
what seems to be the correct path.
There should be a
script: /imports/home1/tcrnma3/Scratch/UrbanModel/run005/scripts/SGE7948718974736431209.submit
(or similar) that should be available while a swift run is in progress.
I think one way to troubleshoot things would be to copy that script and
submit it manually.
Mihael
On Sun, 2015-05-03 at 19:20 +0000, Altaweel, Mark wrote:
Yes so I do import swift in the shell script that gets distributed. However, same conclusion it seems. I don’t understand why it truncates the path, unless it is there but only writes a certain number of the characters.
This is added to the script:
export PATH=$PATH:~/Scratch/swift-0.96-sge-mod/bin
module load java/1.7.0_45
So java is included. If I remove it same thing happens though.
Mark
On May 3, 2015, at 8:07 PM, Mihael Hategan <hategan at mcs.anl.gov<mailto:hategan at mcs.anl.gov><mailto:hategan at mcs.anl.gov>> wrote:
On Sun, 2015-05-03 at 18:43 +0000, Altaweel, Mark wrote:
error reason 1: 05/03/2015 19:38:15 [147447:22761]: error: can't open output file "/imports/home1/tcrnma3/Scratch/Ur
... aaand my PE suggestion had little to do with the problem.
Is /imports mounted on compute nodes?
Mihael
_______________________________________________
Swift-user mailing list
Swift-user at ci.uchicago.edu<mailto:Swift-user at ci.uchicago.edu><mailto:Swift-user at ci.uchicago.edu>
https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20150503/4fe84202/attachment.html>
More information about the Swift-user
mailing list