[mpich-discuss] ROMIO with SGI MPI MPI_TYPE_MAX

burlen burlen.loring at gmail.com
Sat Feb 27 13:30:07 CST 2010


Apparently it's still an issue, the cluster in question is an SGI Altix 
ICE system only a couple of years old. The version of SGI MPI is 1.23. 
Does anyone remember details about the problem?

Rajeev Thakur wrote:
> That blurb has been in the user guide since 1997! 
>
> Rajeev
>
>   
>> -----Original Message-----
>> From: mpich-discuss-bounces at mcs.anl.gov 
>> [mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of burlen
>> Sent: Friday, February 26, 2010 3:46 PM
>> To: mpich-discuss at mcs.anl.gov
>> Subject: [mpich-discuss] ROMIO with SGI MPI MPI_TYPE_MAX
>>
>> Hi All,
>>
>> I have been running into some issues when using SGI MPI and 
>> collective 
>> IO. My program always crashes with the error
>>
>>     MPI has run out of internal datatype entries.
>>
>> It's quite quick with the default 8192, upping the value does prolong 
>> the life of the program but it will still crash after about 
>> 300 ish type 
>> commit/read/type free operations. Which to me doesn't seem 
>> like a very 
>> large number for an HPC app.
>>
>> Also I have found a way to reproduce this after only two 
>> explicit type 
>> commit in my application. There are some point to point 
>> communication as 
>> well but none of them commit any types. And we're talking 
>> about only a 
>> handful of MPI calls issued. This reproduction depends on how many 
>> process per node are scheduled , eg 1 per core. I have been 
>> careful to 
>> see that I'm not leaking types, so my conclusion is that SGI MPI is 
>> leaking types internally.
>>
>> I was surprised to see a blurb about just this issue in ROMIO 
>> user manual.
>>
>> Can anyone out there shed some light on this?
>>
>> Thanks
>> Burlen
>>
>> . When using ROMIO with SGI MPI, you may sometimes get an 
>> error message 
>> from SGI
>> MPI: "MPI has run out of internal datatype entries. Please set the 
>> environment variable
>> MPI TYPE MAX for additional space." If you get this error 
>> message, add 
>> the following line to
>> your .cshrc ?le:
>> setenv MPI TYPE MAX 65536
>> Use a larger number if you still get the error message.
>>
>> _______________________________________________
>> mpich-discuss mailing list
>> mpich-discuss at mcs.anl.gov
>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>>
>>     
>
> _______________________________________________
> mpich-discuss mailing list
> mpich-discuss at mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>   



More information about the mpich-discuss mailing list