[Swift-devel] Swift did not make progress with high throtteling rate
Justin M Wozniak
wozniak at mcs.anl.gov
Tue Apr 10 14:22:29 CDT 2012
Hi Emalayan
Are you saying that this case does run with the default throttles
and fails with jobThrottle=1000?
I just took a look at the log file. It looks like the jobs do get
scheduled. Are there any -info files to look at?
Justin
On Tue, 10 Apr 2012, Emalayan Vairavanathan wrote:
> Hi All,
>
> I tired to run my pipeline-swift benchmark on GPFS+PVFS with 128 compute
> nodes (Surveyor), JOB_THROTTLE = 1000 and JOBS_PER_NODE = 4.
>
> I used GPFS as the central storage and PVFS as the intermediate storage.
> The benchmark did not make any progress and I found the following
> messages in the log file. (This happened even with MosaStore)
>
>
> 2012-04-10 18:18:36,710+0000 WARN HangChecke No events in 10s.
> 2012-04-10 18:18:36,717+0000 WARN HangChecker
> Registered futures:
> file stage_2_output - F/stage_2_output[95]:file - Open
> file stage_1_output - F/stage_1_output[85]:file - Open
> file stage_3_output - F/stage_3_output[62]:file - Open
> file stage_3_output - F/stage_3_output[44]:file - Open
> file stage_1_output - F/stage_1_output[4]:file - Open
> file stage_2_output - F/stage_2_output[3]:file - Open
> file input_data - F/input_data[121]:file - Open
> file stage_1_output - F/stage_1_output[113]:file - Open
> file stage_1_output - F/stage_1_output[98]:file - Open
>
> I am using the swift version that I took from Justin's home directory 3 weeks before.
>
>
> Do you have any idea ? Does swift has problem with high throttling rate / jobs-per-node ? I have attached swift log file and the benchmark with this mail. I highly appreciate your suggestions.
>
>
> Thank you
> Emalayan
--
Justin M Wozniak
More information about the Swift-devel
mailing list