[Swift-devel] Lammps on BGQ: task completes but status shows active

Ketan Maheshwari ketan at mcs.anl.gov
Mon Dec 8 10:29:08 CST 2014


Hi Mihael, All,

Can you help debugging this issue.

On BG/Q (cetus), running lammps with provider coaster.

The symptom is that the lammps task completes but Swift still thinks it is
running and continues to show "Active" status. Worker logs also show that
the task is running. The _wrapperlog is stalled in EXECUTE stage.

The script (bg.sh) running in the qsub "script" mode invokes runjob and it
seems that the line after runjob is not reached, meaning runjob does not
return.

The same configuration (qsub in script mode and runjob with same
parameters) , when run outside of Swift seems to be working (ie. script
exits on completion).

Attaching the bg.sh, and a tarball with Swift run dir and worker log in
DEBUG mode.

Thanks for any help further debugging this.

Best,
Ketan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20141208/1e12bb37/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: lammps_debug.tgz
Type: application/x-gzip
Size: 210178 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20141208/1e12bb37/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: bg.sh
Type: application/x-sh
Size: 1887 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20141208/1e12bb37/attachment.sh>


More information about the Swift-devel mailing list