SBAIJ issue

Hong Zhang hzhang at mcs.anl.gov
Tue Oct 13 08:56:27 CDT 2009


It seems the code hangs in MatView_MPISBAIJ_ASCIIorDraworSocket (), a  
not well-tested routine.
Can you send us your matrix in petsc binary format?
We can test it and find a fix.

My understanding is: when given the option -ksp_view_binary
on np=4, the code hangs.
What happens without the option -ksp_view_binary?
If this is the case, the hang is not related to mumps.

Hong

On Oct 13, 2009, at 7:44 AM, Andreas Grassl wrote:

> Hong Zhang schrieb:
>> I would suggest you run your code with debugger,
>> e.g., '-start_in_debugger'.
>> When it hangs, type Control-C,
>> and type 'where' to check where it hangs.
>
> The debugger output of the ongoing process when given option - 
> ksp_view_binary
> with mumps_cholesky. It is hanging after solving.
>
> Explanations and suggestions?
>
> Cheers,
>
> ando
>
> Program received signal SIGINT, Interrupt.
> [Switching to Thread 46912507935936 (LWP 11965)]
> 0x00002aaaab00eb8a in __intel_new_memset ()
>   from /opt/intel/Compiler/11.0/074/lib/intel64/libirc.so
> (gdb) bt
> #0  0x00002aaaab00eb8a in __intel_new_memset ()
>   from /opt/intel/Compiler/11.0/074/lib/intel64/libirc.so
> #1  0x00002aaaaafecb66 in _intel_fast_memset.J ()
>   from /opt/intel/Compiler/11.0/074/lib/intel64/libirc.so
> #2  0x0000000000aefefe in PetscMemzero (a=0x14b206c0, n=8051408) at  
> memc.c:205
> #3  0x0000000000ab41d0 in PetscTrFreeDefault (aa=0x14b206c0, line=89,
>    function=0x1260b10 "MatSeqXAIJFreeAIJ",
>    file=0x1260840
> "/home/lux/csae1801/petsc/petsc-3.0.0-p8/include/../src/mat/impls/ 
> aij/seq/aij.h",
> dir=0x1260ad4 "src/mat/impls/sbaij/mpi/") at mtr.c:318
> #4  0x00000000008f606e in MatSeqXAIJFreeAIJ (AA=0x143112a0,  
> a=0x143124c8,
> j=0x143124b8,
>    i=0x143124b0)
>    at
> /home/lux/csae1801/petsc/petsc-3.0.0-p8/include/../src/mat/impls/aij/ 
> seq/aij.h:89
> #5  0x00000000008f773e in MatSetValues_MPISBAIJ (mat=0x133aaf30, m=1,
> im=0x15cc2f70, n=43,
>    in=0x15f6fa10, v=0x16cfb938, addv=NOT_SET_VALUES) at mpisbaij.c:202
> #6  0x00000000008fc0a9 in MatAssemblyEnd_MPISBAIJ (mat=0x133aaf30,
> mode=MAT_FINAL_ASSEMBLY)
>    at mpisbaij.c:539
> #7  0x0000000000633e5e in MatAssemblyEnd (mat=0x133aaf30,  
> type=MAT_FINAL_ASSEMBLY)
>    at matrix.c:4561
> #8  0x00000000008fe302 in MatView_MPISBAIJ_ASCIIorDraworSocket  
> (mat=0x11df00e0,
>    viewer=0x133a4070) at mpisbaij.c:704
> #9  0x00000000008fe95c in MatView_MPISBAIJ (mat=0x11df00e0,  
> viewer=0x133a4070)
>    at mpisbaij.c:733
> #10 0x0000000000603570 in MatView (mat=0x11df00e0,  
> viewer=0x133a4070) at
> matrix.c:643
> #11 0x00000000004c9962 in KSPSolve (ksp=0x11f3ed80, b=0x116650a0,  
> x=0x115fe9b0)
>    at itfunc.c:328
> #12 0x000000000040a5ff in main (argc=1, argv=0x7fff3baade68) at  
> externalsolver.c:590
> (gdb) c
> Continuing.
>
> Program received signal SIGINT, Interrupt.
> 0x00002aaaab00adf3 in __intel_new_memcpy ()
>   from /opt/intel/Compiler/11.0/074/lib/intel64/libirc.so
> (gdb) bt
> #0  0x00002aaaab00adf3 in __intel_new_memcpy ()
>   from /opt/intel/Compiler/11.0/074/lib/intel64/libirc.so
> #1  0x00002aaaaafecb16 in _intel_fast_memcpy.J ()
>   from /opt/intel/Compiler/11.0/074/lib/intel64/libirc.so
> #2  0x0000000000aef6f5 in PetscMemcpy (a=0x14b2abc0, b=0x156bed20,  
> n=3721504) at
> memc.c:102
> #3  0x00000000008f74b7 in MatSetValues_MPISBAIJ (mat=0x133aaf30, m=1,
> im=0x15cc7964, n=44,
>    in=0x15f74404, v=0x16d04d20, addv=NOT_SET_VALUES) at mpisbaij.c:202
> #4  0x00000000008fc0a9 in MatAssemblyEnd_MPISBAIJ (mat=0x133aaf30,
> mode=MAT_FINAL_ASSEMBLY)
>    at mpisbaij.c:539
> #5  0x0000000000633e5e in MatAssemblyEnd (mat=0x133aaf30,  
> type=MAT_FINAL_ASSEMBLY)
>    at matrix.c:4561
> #6  0x00000000008fe302 in MatView_MPISBAIJ_ASCIIorDraworSocket  
> (mat=0x11df00e0,
>    viewer=0x133a4070) at mpisbaij.c:704
> #7  0x00000000008fe95c in MatView_MPISBAIJ (mat=0x11df00e0,  
> viewer=0x133a4070)
>    at mpisbaij.c:733
> #8  0x0000000000603570 in MatView (mat=0x11df00e0,  
> viewer=0x133a4070) at
> matrix.c:643
> #9  0x00000000004c9962 in KSPSolve (ksp=0x11f3ed80, b=0x116650a0,  
> x=0x115fe9b0)
>    at itfunc.c:328
> #10 0x000000000040a5ff in main (argc=1, argv=0x7fff3baade68) at  
> externalsolver.c:590
> (gdb)
>
> -- 
> /"\                               Grassl Andreas
> \ /    ASCII Ribbon Campaign      Uni Innsbruck Institut f. Mathematik
>  X      against HTML email        Technikerstr. 13 Zi 709
> / \                               +43 (0)512 507 6091



More information about the petsc-users mailing list