[petsc-users] Very poor speed up performance

Jed Brown jed at 59A2.org
Wed Dec 22 20:32:00 CST 2010


I disagree, there is easily a factor of two in flop/s between a naive
ordering (e.g. hierarchical by node type in a finite element method) and a
good low-bandwidth ordering.

This is in the FUN3D papers and still true today, in my experience.

Incomplete factorization is also very order dependent, as you note.

Jed

On Dec 22, 2010 5:03 PM, "Matthew Knepley" <knepley at gmail.com> wrote:

On Wed, Dec 22, 2010 at 10:11 AM, Yongjun Chen <yjxd.chen at gmail.com> wrote:

>
>
>
> On Wed, Dec 22, 2010 at 6:53 PM, Satish Balay <balay at mcs.anl.gov> wrote:
>>
>> On Wed, 22 De...
1) To see a large gain, the ordering you start with would have to be very
bad. Maybe it is. These
    orderings try to minimize bandwidth, which means minimize communication
in the MatMult.

2) If you use incomplete facotrization, the ordering can have a large effect
on conditioning, so
    number of iterations, which does not improve scalability. This would
impact scalability if you
   use a parallel IC, however all those packages reorder your matrix
already.

In short, I suspect this will not help a lot, except maybe with
conditioning, which is what I was refering to in the quote.

    Matt



-- 
What most experimenters take for granted before they begin their experiments
is infinitely more...
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20101223/89beccb3/attachment.htm>


More information about the petsc-users mailing list