[petsc-users] generate entries on 'wrong' process

Wen Jiang jiangwen84 at gmail.com
Fri Jan 20 11:31:59 CST 2012


Hi Matt,

The serial job is running without any problems and never stalls. Actually
the parallel jobs also running successfully on distributed-memory desktop
or on single node of cluster. It will get stuck if it is running on more
than one compute node(now it is running on two nodes). Both the serial job
and parallel job (running on distributed or cluster) I mentioned before
have the same size(dofs). But If I ran a smaller job on cluster with two
nodes, it might not get stuck and work fine.

As you said before, I add MAT_ASSEMBLY_FLUSH after every element stiffness
matrix is inserted. I got the output like below, and it gets stuck too.

[0] MatStashScatterBegin_Private()
: No of messages: 1
[0] MatStashScatterBegin_Private(): Mesg_to: 1: size: 24584
[0] MatAssemblyBegin_MPIAIJ(): Stash has 3072 entries, uses 0 mallocs.
[3] MatAssemblyBegin_MPIAIJ(): Stash has 3072 entries, uses 0 mallocs.
[2] MatAssemblyBegin_MPIAIJ(): Stash has 4096 entries, uses 0 mallocs.
[7] MatAssemblyBegin_MPIAIJ(): Stash has 0 entries, uses 0 mallocs.
[5] MatAssemblyBegin_MPIAIJ(): Stash has 2048 entries, uses 0 mallocs.
[4] MatAssemblyBegin_MPIAIJ(): Stash has 2048 entries, uses 0 mallocs.
[6] MatAssemblyBegin_MPIAIJ(): Stash has 1024 entries, uses 0 mallocs.
[1] MatAssemblyBegin_MPIAIJ(): Stash has 3072 entries, uses 0 mallocs.
[0] MatStashScatterBegin_Private(): No of messages: 1
[0] MatStashScatterBegin_Private(): Mesg_to: 1: size: 24584
[0] MatAssemblyBegin_MPIAIJ(): Stash has 3072 entries, uses 0 mallocs.
[3] MatAssemblyBegin_MPIAIJ(): Stash has 3072 entries, uses 0 mallocs.
[2] MatAssemblyBegin_MPIAIJ(): Stash has 4096 entries, uses 0 mallocs.
[7] MatAssemblyBegin_MPIAIJ(): Stash has 0 entries, uses 0 mallocs.
[4] MatAssemblyBegin_MPIAIJ(): Stash has 2048 entries, uses 0 mallocs.
[5] MatAssemblyBegin_MPIAIJ(): Stash has 2048 entries, uses 0 mallocs.
[6] MatAssemblyBegin_MPIAIJ(): Stash has 1024 entries, uses 0 mallocs.
[1] MatAssemblyBegin_MPIAIJ(): Stash has 3072 entries, uses 0 mallocs.
[0] MatStashScatterBegin_Private(): No of messages: 1
[0] MatStashScatterBegin_Private(): Mesg_to: 1: size: 24584
[0] MatAssemblyBegin_MPIAIJ(): Stash has 3072 entries, uses 0 mallocs.
[3] MatAssemblyBegin_MPIAIJ(): Stash has 3072 entries, uses 0 mallocs.
[2] MatAssemblyBegin_MPIAIJ(): Stash has 4096 entries, uses 0 mallocs.


On Fri, Jan 20, 2012 at 10:21 AM, Wen Jiang <jiangwen84 at gmail.com> wrote:

> Hi, Matt
>
> Could you tell me some more details about how to get a stack trace there?
> I know little about it. The job is submitted on head node and running on
> compute nodes.
>

1) Always run serial problems until you understand what is happening

2) Run with -start_in_debugger, and type 'cont' in the debugger (read about
gdb)

3) When it stalls, Ctrl-C and then type 'where'

 Matt
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20120120/3537ad97/attachment.htm>


More information about the petsc-users mailing list