[MPICH] slow IOR when using fileview

Sun Jul 1 22:56:23 CDT 2007

I tried 3 cases: passing info to open only, setview only, and both open 
and setview. They did not make any difference.

Wei-keng

On Sun, 1 Jul 2007, Rajeev Thakur wrote:

> Did you pass the info object to MPI_File_set_view, not just MPI_File_open?
>
> Rajeev
>
>> -----Original Message-----
>> From: owner-mpich-discuss at mcs.anl.gov
>> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Wei-keng Liao
>> Sent: Sunday, July 01, 2007 7:55 PM
>> To: mpich-discuss at mcs.anl.gov
>> Subject: RE: [MPICH] slow IOR when using fileview
>>
>>
>> I just tried the cb_buffer_size 10485760 hint and got the
>> similar results.
>>
>> I agree the cost for using datatype should be negligible,
>> especially for
>> this I/O pattern. Note that both the two cases call MPI
>> collective writes,
>> but since the access patterns are not interleaved, ROMIO
>> internally will
>> call independent subroutines to write to the file. Still, I
>> could not see
>> any reason for such a big performance difference.
>>
>> Also, I tested the code on a different machine, both cases
>> have about the
>> same timing. I tried MPICH 1.2.7, 2-1.0.2, and 2-1.0.5. I
>> wonder if this
>> only happens on Cray. I will report this to the system people
>> to see what
>> they think.
>>
>> Wei-keng
>>
>>
>> On Sun, 1 Jul 2007, Rajeev Thakur wrote:
>>
>>> In these other cases, the code assumes that the datatype
>> represents a
>>> noncontiguous access pattern and goes through the motions
>> of collective I/O
>>> when it actually is not necessary. That costs more but
>> should not be this
>>> much more.
>>>
>>> Rajeev
>>>
>>>> -----Original Message-----
>>>> From: owner-mpich-discuss at mcs.anl.gov
>>>> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Wei-keng Liao
>>>> Sent: Sunday, July 01, 2007 3:25 PM
>>>> To: mpich-discuss at mcs.anl.gov
>>>> Subject: RE: [MPICH] slow IOR when using fileview
>>>>
>>>>
>>>> I tried different datatype constructors. Compared to the
>> case without
>>>> using MPI_File_setview(), the results are:
>>>>    MPI_Type_create_subarray()  ---- much slower
>>>>    MPI_Type_indexed()  ---- much slower
>>>>    MPI_Type_vector() with explicit offset in setview  ----
>> much slower
>>>>    MPI_Type_contiguous() with explicit offset in setview  ---- same
>>>>    MPI_BYTE with explicit offset in setview  ---- same
>>>>
>>>> The MPICH is version 2-1.0.2. I cannot test newer versions on that
>>>> machine.
>>>>
>>>> Wei-keng
>>>>
>>>>
>>>> On Sun, 1 Jul 2007, Rajeev Thakur wrote:
>>>>
>>>>> Can you see what happens if you use type_indexed instead of
>>>> type_subarray?
>>>>>
>>>>> Rajeev
>>>>>
>>>>>> -----Original Message-----
>>>>>> From: owner-mpich-discuss at mcs.anl.gov
>>>>>> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of
>> Wei-keng Liao
>>>>>> Sent: Saturday, June 30, 2007 2:03 AM
>>>>>> To: mpich-discuss at mcs.anl.gov
>>>>>> Subject: [MPICH] slow IOR when using fileview
>>>>>>
>>>>>>
>>>>>> I am experiencing slow IOR performance on Cray XT3 when using
>>>>>> fileview
>>>>>> option. I extract the code into a simpler version (attached).
>>>>>> The code
>>>>>> compares two collective writes: MPI_File_write_all and
>>>>>> MPI_File_write_at_all. The former uses an MPI fileview and
>>>>>> the latter uses
>>>>>> explicit file offset. For both cases, each process writes
>>>> 10 MB to a
>>>>>> shared file, contiguously, non-overlapping, non-interleaved.
>>>>>> On the Cray
>>>>>> XT3 with Lustre file system, the former is extremely
>>>> slower than the
>>>>>> latter. Here is an output for using 8 processes:
>>>>>>
>>>>>> 2: MPI_File_write_all() time = 4.72 sec
>>>>>> 3: MPI_File_write_all() time = 4.74 sec
>>>>>> 6: MPI_File_write_all() time = 4.77 sec
>>>>>> 1: MPI_File_write_all() time = 4.79 sec
>>>>>> 7: MPI_File_write_all() time = 4.81 sec
>>>>>> 0: MPI_File_write_all() time = 4.83 sec
>>>>>> 5: MPI_File_write_all() time = 4.85 sec
>>>>>> 4: MPI_File_write_all() time = 4.89 sec
>>>>>> 2: MPI_File_write_at_all() time = 0.02 sec
>>>>>> 1: MPI_File_write_at_all() time = 0.02 sec
>>>>>> 3: MPI_File_write_at_all() time = 0.02 sec
>>>>>> 0: MPI_File_write_at_all() time = 0.02 sec
>>>>>> 6: MPI_File_write_at_all() time = 0.02 sec
>>>>>> 4: MPI_File_write_at_all() time = 0.02 sec
>>>>>> 7: MPI_File_write_at_all() time = 0.02 sec
>>>>>> 5: MPI_File_write_at_all() time = 0.02 sec
>>>>>>
>>>>>> I tried the same code on other machines and different file
>>>>>> systems (eg.
>>>>>> PVFS), and timings for both cases were very close to each
>>>>>> other. If anyone
>>>>>> has access to a Cray XT3 machine, could you please try it and
>>>>>> let me know?
>>>>>> Thanks.
>>>>>>
>>>>>> Wei-keng
>>>>>>
>>>>>
>>>>
>>>>
>>>
>>
>>
>