[mpich-discuss] reading Ascii files using parallel I/o
Brock Palen
brockp at umich.edu
Mon May 23 10:03:43 CDT 2011
Haddop is an interesting option but requires a lot of infrastructure.
Sreenivas,
If you want to chat about what you are doing send an email into the CAC support address I send you in our internal ticket. We can setup a time to chat about options at Michigan.
Brock Palen
www.umich.edu/~brockp
Center for Advanced Computing
brockp at umich.edu
(734)936-1985
On May 23, 2011, at 10:12 AM, bob ilgner wrote:
> Have you considered using Hadoop ?
>
> http://hadoop.apache.org/
>
> Depending on much you want to scale this may be worth your while.
> Otherwise I agree with mail below.
>
>
>
> On Mon, May 23, 2011 at 3:55 PM, Brock Palen <brockp at umich.edu> wrote:
>> Sreenivas,
>>
>> Note using binary IO tends to be faster as it takes less space for the same amount of data.
>>
>> You may also want to look at the hdf5/*-parallel modules on Nyx/Flux on campus. I find using hdf5 parallel IO to be simpler and more interoperable (modifiable and readable in matlab/idl with little effort) than invoking MPI-IO directly.
>>
>> Just my own $0.02 others with more parallel IO experience may have other input.
>>
>>
>> Brock Palen
>> www.umich.edu/~brockp
>> Center for Advanced Computing
>> brockp at umich.edu
>> (734)936-1985
>>
>>
>>
>> On May 23, 2011, at 1:00 AM, Rajeev Thakur wrote:
>>
>>> MPI-IO doesn't support formatted I/O, i.e., the equivalent of fprintf and fscanf. However you can read/write unformatted character buffers.
>>>
>>> Rajeev
>>>
>>>
>>>> Date: Sat, 21 May 2011 23:48:57 -0400
>>>> From: <vvvnew at umich.edu>
>>>> Subject: [mpich-discuss] reading Ascii files using parallel I/o
>>>> To: mpich-discuss at mcs.anl.gov
>>>> Message-ID: <7f8c0ee756294fb644ab193ff0338e1c at umich.edu>
>>>> Content-Type: text/plain; charset="UTF-8"
>>>>
>>>>
>>>> Hi,
>>>> I want to know if there's a general way to read and write ASCII data
>>>> using parallel input/output. I know there are only 3 Mpi data formats
>>>> 'Native', 'Internal','External32', none of which support Ascii data. Since
>>>> all my input data is in ASCII and i also need the output to be in ASCII to
>>>> increase interoperability with other software, can someone suggest an
>>>> efficient way to get around this problem. Right now iam sending all the
>>>> data in and out through one node and makes the scaling pretty bad for large
>>>> datasets, so parallel i/o with ascii would make my life a whole lot easier.
>>>>
>>>> Thanks
>>>>
>>>> Sreenivas
>>>
>>> _______________________________________________
>>> mpich-discuss mailing list
>>> mpich-discuss at mcs.anl.gov
>>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>>>
>>>
>>
>> _______________________________________________
>> mpich-discuss mailing list
>> mpich-discuss at mcs.anl.gov
>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>>
> _______________________________________________
> mpich-discuss mailing list
> mpich-discuss at mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>
>
More information about the mpich-discuss
mailing list