bombing out writing large scratch files

Randall Mackie randy at geosystem.us
Sat May 27 18:58:44 CDT 2006


I can't seem to get the debugger to pop up on my screen.

When I'm logged into the cluster I'm working on, I can
type xterm &, and an xterm pops up on my display. So I know
I can get something from the remote cluster.

Now, when I try this using PETSc, I'm getting the following error
message, for example:

------------------------------------------------------------------------
[17]PETSC ERROR: PETSC: Attaching gdb to /home/randy/d3inv/PETSC_V3.3/d3inv_3_3_petsc of pid 3628 on display 24.5.142.138:0.0 on 
machine compute-0-23.local
------------------------------------------------------------------------

I'm using this in my command file:

source ~/.bashrc
time /opt/mpich/intel/bin/mpirun -np 20 -nolocal -machinefile machines \
          /home/randy/d3inv/PETSC_V3.3/d3inv_3_3_petsc \
          -start_in_debugger \
          -debugger_node 1 \
          -display 24.5.142.138:0.0 \
          -em_ksp_type bcgs \
          -em_sub_pc_type ilu \
          -em_sub_pc_factor_levels 8 \
          -em_sub_pc_factor_fill 4 \
          -em_sub_pc_factor_reuse_ordering \
          -em_sub_pc_factor_reuse_fill \
          -em_sub_pc_factor_mat_ordering_type rcm \
          -divh_ksp_type cr \
          -divh_sub_pc_type icc \
          -ppc_sub_pc_type ilu \
<< EOF
...


Randy


Matthew Knepley wrote:
> 1) Make sure ssh is forwarding X (-Y I think)
> 
> 2) -start_in_debugger
> 
> 3) -display <your machine>:0.0
> 
> should do it.
> 
>    Matt
> 
> On 5/27/06, *Randall Mackie* <randy at geosystem.us 
> <mailto:randy at geosystem.us>> wrote:
> 
>     This is a stupid question, but how do I start in the debugger if I'm
>     running
>     on a cluster half-way around the world and I'm working on that cluster
>     via ssh?
> 
>     Randy
> 
> 
>     Matthew Knepley wrote:
>      > The best thing to do here is get a stack trace from the debugger.
>     From the
>      > description, it is hard to tell what statement is trying to
>     access which
>      > illegal
>      > memory.
>      >
>      >    Matt
>      >
>      > On 5/27/06, *Randall Mackie* < randy at geosystem.us
>     <mailto:randy at geosystem.us>
>      > <mailto:randy at geosystem.us <mailto:randy at geosystem.us>>> wrote:
>      >
>      >     In my PETSc based modeling code, I write out intermediate
>     results to
>      >     a scratch
>      >     file, and then read them back later. This has worked fine up
>     until
>      >     today,
>      >     when for a large model, this seems to be causing my program
>     to crash
>      >     with
>      >     errors like:
>      >
>      >    
>     ------------------------------------------------------------------------
>      >     [9]PETSC ERROR: Caught signal number 11 SEGV: Segmentation
>      >     Violation, probably memory access out of range
>      >
>      >
>      >     I've tracked down the offending code to:
>      >
>      >                IF (rank == 0) THEN
>      >                  irec=(iper-1)*2+ipol
>      >                  write(7,rec=irec) (xvec(i),i=1,np)
>      >                END IF
>      >
>      >     It writes out xvec for the first record, but then on the second
>      >     record my program is crashing.
>      >
>      >     The record length (from an inquire statement) is  recl    
>     22626552
>      >
>      >     The size of the scratch file when my program crashes is 98M.
>      >
>      >     PETSc is compiled using the intel compilers ( v9.0 for fortran),
>      >     and the users manual says that you can have record lengths of
>      >     up to 2 billion bytes.
>      >
>      >     I'm kind of stuck as to what might be the cause. Any ideas
>     from anyone
>      >     would be greatly appreciated.
>      >
>      >     Randy Mackie
>      >
>      >     ps. I've tried both the optimized and debugging versions of
>     the PETSc
>      >     libraries, with the same result.
>      >
>      >
>      >     --
>      >     Randall Mackie
>      >     GSY-USA, Inc.
>      >     PMB# 643
>      >     2261 Market St.,
>      >     San Francisco, CA 94114-1600
>      >     Tel (415) 469-8649
>      >     Fax (415) 469-5044
>      >
>      >     California Registered Geophysicist
>      >     License No. GP 1034
>      >
>      >
>      >
>      >
>      > --
>      > "Failure has a thousand explanations. Success doesn't need one"
>     -- Sir
>      > Alec Guiness
> 
>     --
>     Randall Mackie
>     GSY-USA, Inc.
>     PMB# 643
>     2261 Market St.,
>     San Francisco, CA 94114-1600
>     Tel (415) 469-8649
>     Fax (415) 469-5044
> 
>     California Registered Geophysicist
>     License No. GP 1034
> 
> 
> 
> 
> -- 
> "Failure has a thousand explanations. Success doesn't need one" -- Sir 
> Alec Guiness

-- 
Randall Mackie
GSY-USA, Inc.
PMB# 643
2261 Market St.,
San Francisco, CA 94114-1600
Tel (415) 469-8649
Fax (415) 469-5044

California Registered Geophysicist
License No. GP 1034




More information about the petsc-users mailing list