[petsc-users] Tuning the parallel performance of a 3D FEM CFD code
Barry Smith
bsmith at mcs.anl.gov
Wed May 18 12:46:34 CDT 2011
On May 18, 2011, at 10:08 AM, Henning Sauerland wrote:
>
> On 15.05.2011, at 15:59, Jed Brown wrote:
>
>> This is a problem, but you can still order unknowns in the vector so that velocity and pressure all come together, then AIJ/Inode will treat the components together which will tend to make ILU stronger.
>
> The results using the interlaced ordering are more pronounced than expected. The number of iterations did increase significantly using ILU(0) (~430 vs. ~170 per Picard iteration on 4 PEs) which makes it 4-5 times slower.
So interlacing the variables makes ILU() much worse in both iteration count and time?
Barry
> However, the iteration count almost stays constant when moving to more processors! ILU(2) again reduces the iteration count (~100) while being slower compared to ILU(0).
>
>
> I really appreciate your suggestions so far!
>
> Henning
More information about the petsc-users
mailing list