[mpich-discuss] mpich2 hangs on Ubuntu beowulf cluster(with NFS) with patch

Konstantinos Varotsos kvarotso at gmail.com
Mon Jan 23 13:03:47 CST 2012


ok it seems that the segmenation error was a stack size issue

after reseting it
ulimit -a gives

core file size          (blocks, -c) unlimited
data seg size           (kbytes, -d) unlimited
scheduling priority             (-e) 20
file size               (blocks, -f) unlimited
pending signals                 (-i) 16382
max locked memory       (kbytes, -l) 64
max memory size         (kbytes, -m) unlimited
open files                      (-n) 1024
pipe size            (512 bytes, -p) 8
POSIX message queues     (bytes, -q) 819200
real-time priority              (-r) 0
stack size              (kbytes, -s) unlimited
cpu time               (seconds, -t) unlimited
max user processes              (-u) unlimited
virtual memory          (kbytes, -v) unlimited
file locks                      (-x) unlimited




mpiexec with -l gives


[4] Fatal error in PMPI_Bcast: Other MPI error, error stack:
[4] PMPI_Bcast(1483)......: MPI_Bcast(buf=0x1f9fb60, count=1, MPI_CHAR, 
root=0, comm=0x84000004) failed
[4] MPIR_Bcast_impl(1326).:
[4] MPIR_Bcast_intra(1124):
[4] MPIR_SMP_Bcast(1041)..: Failure during collective
[0] Fatal error in PMPI_Barrier: Other MPI error, error stack:
[0] PMPI_Barrier(425)...........: MPI_Barrier(comm=0x84000004) failed
[0] MPIR_Barrier_impl(306)......:
[0] MPIR_Bcast_impl(1326).......:
[0] MPIR_Bcast_intra(1160)......:
[0] MPIR_Bcast_binomial(213)....: Failure during collective
[0] MPIR_Barrier_impl(292)......:
[0] MPIR_Barrier_or_coll_fn(121):
[0] MPIR_Barrier_intra(83)......:
[0] dequeue_and_set_error(596)..: Communication error with rank 4


thanx Kwstas







More information about the mpich-discuss mailing list