[petsc-users] Install PETSc with option `--with-shared-libraries=1` failed on MacOS
Satish Balay
balay at mcs.anl.gov
Mon Mar 18 18:31:54 CDT 2024
On Mon, 18 Mar 2024, Pierre Jolivet wrote:
>
> And here we go: https://urldefense.us/v3/__https://gitlab.com/petsc/petsc/-/jobs/6420606887__;!!G_uCfscf7eWS!alfBlmyFQ5JJUYKxxFdETav6xjHOl5W54BPrmJEyXdSakVXnj8eYIRZdknOI-FK4uiaPdL4zSdJlD2zrcw$
> 20 minutes in, and still in the dm_* tests with timeouts right, left, and center.
> For reference, this prior job https://urldefense.us/v3/__https://gitlab.com/petsc/petsc/-/jobs/6418468279__;!!G_uCfscf7eWS!alfBlmyFQ5JJUYKxxFdETav6xjHOl5W54BPrmJEyXdSakVXnj8eYIRZdknOI-FK4uiaPdL4zSdJj83LENQ$ completed in 3 minutes (OK, maybe add a couple of minutes to rebuild the packages to have a fair comparison).
> What did they do to OpenBLAS? Add a sleep() in their axpy?
(gdb) r
Starting program: /home/petsc/petsc/src/dm/dt/tests/ex13
^C
Program received signal SIGINT, Interrupt.
0x0000fffff331ad10 in dgemm_otcopy (m=m at entry=8, n=n at entry=7, a=a at entry=0x58f150, lda=lda at entry=15, b=b at entry=0xffffefae0000) at ../kernel/arm64/../generic/gemm_tcopy_2.c:69
69 *(b_offset1 + 3) = *(a_offset2 + 1);
(gdb) where
#0 0x0000fffff331ad10 in dgemm_otcopy (m=m at entry=8, n=n at entry=7, a=a at entry=0x58f150, lda=lda at entry=15, b=b at entry=0xffffefae0000) at ../kernel/arm64/../generic/gemm_tcopy_2.c:69
#1 0x0000fffff3342e68 in dgetrf_single (args=args at entry=0xffffffffe9d8, range_m=range_m at entry=0x0, range_n=range_n at entry=0x0, sa=sa at entry=0xffffefae0000, sb=<optimized out>, myid=myid at entry=0) at getrf_single.c:157
#2 0x0000fffff3255ec4 in dgetrf_ (M=<optimized out>, N=<optimized out>, a=<optimized out>, ldA=<optimized out>, ipiv=<optimized out>, Info=0xffffffffeaa8) at lapack/getrf.c:110
#3 0x0000fffff50b8dd8 in MatLUFactor_SeqDense (A=0x598360, row=0x0, col=0x0, minfo=0xffffffffeba8) at /home/petsc/petsc/src/mat/impls/dense/seq/dense.c:801
#4 0x0000fffff559b8b4 in MatLUFactor (mat=0x598360, row=0x0, col=0x0, info=0xffffffffeba8) at /home/petsc/petsc/src/mat/interface/matrix.c:3087
#5 0x00000000004149e0 in test (dim=2, deg=3, form=-1, jetDegree=3, cond=PETSC_FALSE) at ex13.c:141
#6 0x0000000000418f20 in main (argc=1, argv=0xfffffffff158) at ex13.c:303
(gdb)
It appears to get stuck in a loop here.
This test runs fine - if I remove "--download-openblas-make-options=TARGET=GENERIC" option.
Ok - trying out "git bisect"
ea6c5f3cf553a23f8e2e787307805e7874e1f9c6 is the first bad commit
commit ea6c5f3cf553a23f8e2e787307805e7874e1f9c6
Author: Martin Kroeker <martin at ruby.chemie.uni-freiburg.de>
Date: Sun Oct 30 12:55:23 2022 +0100
Add option RELAPACK_REPLACE
Makefile.rule | 5 ++++-
Makefile.system | 4 ++++
2 files changed, 8 insertions(+), 1 deletion(-)
Don't really understand why this change is triggering this hang. Or the correct way to build latest openblas [do we need "BUILD_RELAPACK=1"?]
Satish
More information about the petsc-users
mailing list