[petsc-users] speedup for TS solver using DMDA

Matthew Knepley knepley at gmail.com
Mon Sep 15 13:47:27 CDT 2014


On Mon, Sep 15, 2014 at 1:42 PM, Katy Ghantous <katyghantous at gmail.com>
wrote:

> Matt, thanks! i will look into that and find other ways to make the
> computation faster.
>
> Barry, the benchmark reports up to 2 speedup, but says 1 node in the end.
> but either way i was expecting a higher speedup.. 2 is the limit for two
> cpus despite the multiple cores?
>

This is a bit of a scam on the part of chip makers. The speed of your
computations, say VecAXPY and simple stencil operations,
is not determined by the flop rate, but by the memory bandwidth. They sell
you a computer with a great flop rate, but not much
bandwidth at all. This is much like a car dealer who sells you a car with
an incredible amount of torque, just loads of torque, enough
torque to tear down a building, but that is not going to make you go faster.

   Matt


> please let me know if the file attached is what you are asking for.
> Thank you!
>
>
> On Mon, Sep 15, 2014 at 8:23 PM, Barry Smith <bsmith at mcs.anl.gov> wrote:
>
>>
>>    Please send the output from running
>>
>>     make steams NPMAX=32
>>
>>     in the PETSc root directory.
>>
>>
>>    Barry
>>
>>   My guess is that it reports “one node” is just because it uses the
>> “hostname” to distinguish nodes and though your machine has two CPUs, from
>> the point of view of the OS it has only a single hostname and hence reports
>> just one “node”.
>>
>>
>> On Sep 15, 2014, at 12:45 PM, Katy Ghantous <katyghantous at gmail.com>
>> wrote:
>>
>> > Hi,
>> > I am using DMDA to run in parallel TS to solves a set of N equations. I
>> am using DMDAGetCorners in the RHSfunction with setting the stencil size at
>> 2 to solve a set of coupled ODEs on 30 cores.
>> > The machine has 32 cores (2 physical CPUs with 2x8 core each with speed
>> of 3.4Ghz per core).
>> > However, mpiexec with more than one core is showing no speedup.
>> > Also at the configuring/testing stage for petsc on that machine, there
>> was no speedup and it only reported one node.
>> > Is there somehting wrong with how i configured petsc or is the approach
>> inappropriate for the machine?
>> > I am not sure what files (or sections of the code) you would need to be
>> able to answer my question.
>> >
>> > Thank you!
>>
>>
>


-- 
What most experimenters take for granted before they begin their
experiments is infinitely more interesting than any results to which their
experiments lead.
-- Norbert Wiener
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20140915/12a434e8/attachment.html>


More information about the petsc-users mailing list