Unsure about some entries in log_summary for 2B DoF problem

Hong Zhang hzhang at mcs.anl.gov
Mon May 4 17:14:31 CDT 2009


Richard,

Ah, I almost forget what I did last year. ;-)
See PetscLogPrintSummaryToPy() which dumps some data from
-log_sumamry into a python file.

The subroutine was only used in the example
/tutorials/multiphysicsNonlinear/mp1.c

Hong

On Mon, 4 May 2009, Barry Smith wrote:

>
> On May 4, 2009, at 1:51 PM, Richard Tran Mills wrote:
>
>> Barry,
>> 
>> Is there a way to dump the log data for all processes so that I can do 
>> things like look at the the time spent in each stage by each process?  I 
>> thought you or someone else had mentioned adding a capability to do this in 
>> PETSc to a file format that would be easy to manipulate using something 
>> like Python... but perhaps I recall incorrectly.
>
> Hong has worked a little on this, you are welcome to mess with it if you 
> want (Hong, have you pushed this?).
>
>  Barry
>
>> 
>> 
>> It would be very nice to have this to calculate some more detailed 
>> statistics.
>> 
>> --Richard
>> 
>> Barry Smith wrote:
>>>  Matt is completely correct. What this means is that though some processes 
>>> wait a long time for the dots, MOST processes don't wait much at all.
>>> In other words, the dot causes very little idle time integrated over the 
>>> whole machine.
>>> Meanwhile for flow (where the percentage is large) the dots cause a LARGE 
>>> amount of idle time integrated over the machine.
>>> Why it is high for one and not the other I do not know.
>>>  Barry
>>> On May 4, 2009, at 1:09 PM, Matthew Knepley wrote:
>>>> I believe that the time reported there is collective sum of times divided 
>>>> by the collective sum
>>>> of the stage times. If you look at the time imbalance, it is a staggering 
>>>> 9.7, which either means
>>>> 
>>>> 1) The partition is really crap (which we know isn't true)
>>>> 
>>>> 2) Some procs spend a lot of time waiting
>>>> 
>>>> We can get at this waiting time with the split VecDot() events.
>>>> 
>>>> Matt
>>>> 
>>>> On Mon, May 4, 2009 at 12:58 PM, Richard Tran Mills 
>>>> <rmills at climate.ornl.gov> wrote:
>>>> PETSc folks,
>>>> 
>>>> I was looking over the log summary data for the 2 billion degrees of 
>>>> freedom transport problem, and I'm a bit puzzled by some of the things 
>>>> I'm seeing.  (I sent a tarball of this to the pflotran-dev list on April 
>>>> 30.)  For instance, looking at the run at 32768 cores, I see that the 
>>>> total time for the "transport" phase is 3.2139e+02 seconds.  But if I 
>>>> look at the VecDot line for the transport stage, I see
>>>> 
>>>> Event                Count      Time (sec)     Flops   --- Global --- 
>>>> --- Stage ---   Total
>>>>                 Max Ratio  Max     Ratio   Max  Ratio  Mess   Avg len 
>>>> Reduct  %T %F %M %L %R  %T %F %M %L %R Mflop/s
>>>> VecDot              1306 1.0 4.1529e+01 9.7 1.76e+08 1.1 0.0e+00 0.0e+00 
>>>> 1.3e+03  1  0  0  0  1   3  0  0  0 24 128305
>>>> 
>>>> It's hard to read this the way my email client will wrap it, but it's 
>>>> saying that 3% of the time in the stage was spent on VecDot()s.  But the 
>>>> max time in VecDot is 4.1529e+01, close to thirteen percent.  Does the 
>>>> "%T" for the stage mean something other than what I think it does?
>>>> 
>>>> --Richard
>>>> 
>>>> -- 
>>>> Richard Tran Mills, Ph.D.            |   E-mail: rmills at climate.ornl.gov
>>>> Computational Scientist              |   Phone:  (865) 241-3198
>>>> Computational Earth Sciences Group   |   Fax:    (865) 574-0405
>>>> Oak Ridge National Laboratory        |   http://climate.ornl.gov/~rmills
>>>> 
>>>> 
>>>> 
>>>> -- 
>>>> What most experimenters take for granted before they begin their 
>>>> experiments is infinitely more interesting than any results to which 
>>>> their experiments lead.
>>>> -- Norbert Wiener
>> 
>> 
>> -- 
>> Richard Tran Mills, Ph.D.            |   E-mail: rmills at climate.ornl.gov
>> Computational Scientist              |   Phone:  (865) 241-3198
>> Computational Earth Sciences Group   |   Fax:    (865) 574-0405
>> Oak Ridge National Laboratory        |   http://climate.ornl.gov/~rmills
>



More information about the petsc-dev mailing list