[Swift-devel] problem with max num jobs, array entries?
Glen Hocky
hockyg at uchicago.edu
Mon Mar 7 11:31:41 CST 2011
hey Mike, devs
i was wondering if you could help me track something down. i may not have
noticed this before because of the way I was running my jobs but i'm having
a problem running more than ~100 jobs w/ my swift script (with pbs or
pbs+coasters). it just hangs with
"Progress: "
"Progress: "
"Progress: "
in the swift log it just stalls at this point
...
2011-03-07 10:26:53,280-0600 INFO SetFieldValue Set: force=FALSE
2011-03-07 10:26:53,284-0600 INFO VDLFunction FUNCTION: arg()
2011-03-07 10:26:53,285-0600 INFO VDLFunction FUNCTION: toint()
2011-03-07 10:26:53,285-0600 INFO SetFieldValue Set: printfreq=500
2011-03-07 10:26:53,285-0600 INFO VDLFunction FUNCTION: arg()
2011-03-07 10:26:53,285-0600 INFO VDLFunction FUNCTION: toint()
2011-03-07 10:26:53,285-0600 INFO SetFieldValue Set: nmodels=5
2011-03-07 10:26:53,286-0600 INFO VDLFunction FUNCTION: arg()
2011-03-07 10:26:53,286-0600 INFO VDLFunction FUNCTION: toint()
2011-03-07 10:26:53,286-0600 INFO SetFieldValue Set: nsub=20
whereas when i decrease the number of total jobs it goes to
...
2011-03-07 11:04:34,001-0600 INFO SetFieldValue Set: nmodels=4
2011-03-07 11:04:34,001-0600 INFO SetFieldValue Set: temperature=0.9
2011-03-07 11:04:34,002-0600 INFO SetFieldValue Set:
rundir=/home/hockyg/reichman/glassy_dynamics/code/runs/overlaps/replica_exchange/code/swift/run_beagle
2011-03-07 11:04:34,001-0600 INFO SetFieldValue Set: label=1
2011-03-07 11:04:34,001-0600 INFO SetFieldValue Set: radii=unnamed
SwiftScript value.$[]/1
2011-03-07 11:04:34,002-0600 INFO SetFieldValue Set: nsub=24
2011-03-07 11:04:39,581-0600 INFO AbstractDataNode Found data
modelIn.$[]/1.[0][3][19].inputstructure
2011-03-07 11:04:39,581-0600 INFO AbstractDataNode Found data
modelIn.$[]/1.[0][4][17].inputstructure
2011-03-07 11:04:39,582-0600 INFO AbstractDataNode Found data
modelIn.$[]/1.[0][4][18].inputstructure
2011-03-07 11:04:39,582-0600 INFO AbstractDataNode Found data
modelIn.$[]/1.[0][4][19].inputstructure
2011-03-07 11:04:39,582-0600 INFO AbstractDataNode Found data
modelIn.$[]/1.[0][0][20].inputstructure
any ideas of where to look to troubleshoot this?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20110307/a5eb6de1/attachment.html>
More information about the Swift-devel
mailing list