Houston, we have a problem

John Tannahill tannahill1 at llnl.gov
Thu Jul 31 18:48:01 CDT 2003


Jianwei,

You may be running at bit better, but still are having problems I
think.  The values in the netCDF output file should not be 0 and the
diff, delmax, delmin should.  The read/write timings look suspicious
and you get a couple of INF's (so do I).  My screen output is about
the same.  The main issue I think is that the netCDF data does not
appear to be written out.

John


Jianwei Li wrote:
> John,
> 
> I had a quick run of your attached fortran code using pnetcdf0.8.8
> on SDSC's IBM-SP (called bluehorizon). The code ran pretty well
> and genterate these outputs:
> 
> #######################################################################
> standard output:
> 
> mype  pe_coords    totsiz_3d         locsiz_3d       kstart,jstart,istart
>   0    0  0  0   256  256  256      16  256  256        0      0      0
>   1    1  0  0   256  256  256      16  256  256       16      0      0
>  13   13  0  0   256  256  256      16  256  256      208      0      0
>   2    2  0  0   256  256  256      16  256  256       32      0      0
>   8    8  0  0   256  256  256      16  256  256      128      0      0
>   5    5  0  0   256  256  256      16  256  256       80      0      0
>   9    9  0  0   256  256  256      16  256  256      144      0      0
>   6    6  0  0   256  256  256      16  256  256       96      0      0
>  10   10  0  0   256  256  256      16  256  256      160      0      0
>   4    4  0  0   256  256  256      16  256  256       64      0      0
>  11   11  0  0   256  256  256      16  256  256      176      0      0
>  12   12  0  0   256  256  256      16  256  256      192      0      0
>  14   14  0  0   256  256  256      16  256  256      224      0      0
>  15   15  0  0   256  256  256      16  256  256      240      0      0
>   3    3  0  0   256  256  256      16  256  256       48      0      0
>   7    7  0  0   256  256  256      16  256  256      112      0      0
> write 1: 2.500E-01 6.250E-02
> write 2: 1.250E-01 0.000E+00
> write 3: 1.250E-01 6.250E-02
> write 4: 1.875E-01 0.000E+00
> write 5: 1.250E-01 0.000E+00
>  read 1: 6.250E-02 0.000E+00
> diff, delmax, delmin = 1.009E+00 1.738E+00 1.701E-02
>  read 2: 6.250E-02 0.000E+00
>  read 3: 6.250E-02 0.000E+00
>  read 4: 6.250E-02 0.000E+00
>  read 5: 6.250E-02 0.000E+00
> File size:  1.342E+02 MB
>     Write:       INF MB/s  (eff.,  1073.742 MB/s)
>     Read :       INF MB/s  (eff.,  2147.484 MB/s)
> Total number PEs:   16
>   1.250E-01  0.000E+00 1073.742  6.250E-02  0.000E+00 2147.484
> 
> ##########################################################################
> netcdf file <pnf_test.nc>:
> ncdump pnf_test.nc | more
> netcdf pnf_test {
> dimensions:
>         level = 256 ;
>         latitude = 256 ;
>         longitude = 256 ;
> variables:
>         float tt(level, latitude, longitude) ;
> data:
> 
>  tt =
>   0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
>   0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
>     0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
> 0,
> ...
> 
> I think it's a successful run, right?
> 
> So what? Is it the Fortran Binding problem specially related to the Frost
> platform? or something else?
> 
> btw, I build my pnetcdf lib as below and maybe you want to try this:
> 
> setenv CC xlc
> setenv FC xlf
> setenv F90 xlf90
> setenv CXX xlC
> setenv FFLAGS '-d -O2'
> setenv MPICC mpcc_r
> setenv MPIF77 mpxlf_r
> 
> #make
> #make install
> 
> //what else can I do?:)
> 
> Jianwei
> 
> On Thu, 31 Jul 2003, John Tannahill wrote:
> 
> 
>>Rob,
>>
>>I am hoping that I can catch you before you leave, so that you can
>>pass this on to someone, but if you are already gone, can anyone
>>else take a look at this?
>>
>>I have graduated up to my original bigger test case and the C version
>>works, but the Fortran version doesn't.  It's certainly possible that
>>I have screwed up the translation from C to Fortran and I will be
>>looking at that, but I wanted to pass this back to you folks, so that
>>you can take a look at it to.
>>
>>I am using 0.8.8.  Attached are two tar files that should be pretty
>>self-explanatory, but let me know if you have questions.
>>
>>Regards,
>>John
>>
>>--
>>============================
>>John R. Tannahill
>>Lawrence Livermore Nat. Lab.
>>P.O. Box 808, M/S L-103
>>Livermore, CA  94551
>>925-423-3514
>>Fax:  925-423-4908
>>============================
>>
> 
> 
> 


-- 
============================
John R. Tannahill
Lawrence Livermore Nat. Lab.
P.O. Box 808, M/S L-103
Livermore, CA  94551
925-423-3514
Fax:  925-423-4908
============================




More information about the parallel-netcdf mailing list