[Nek5000-users] mpich2? (fwd)

nek5000-users at lists.mcs.anl.gov nek5000-users at lists.mcs.anl.gov
Thu May 27 15:19:08 CDT 2010



Hi Stefan,

Any ideas?  Thanks!

Paul


---------- Forwarded message ----------
Date: Thu, 27 May 2010 16:15:14 -0400 (EDT)
From: Tamay Ozgokmen <tozgokmen at rsmas.miami.edu>
To: Zongjun Hu <zhu at med.miami.edu>
Cc: fischer at mcs.anl.gov
Subject: mpich2?


Hi Paul,

I installed latest code on one of UM's linux clusters
but have trouble to get p4c running; it gets stuck after
the prelims. See below.
Any ideas...?
(I will be out of touch for a few hours, unfortunately, a
later meeting...)
Tamay



gs_setup: 379187 unique labels shared
    pairwise times (avg, min, max): 0.0235193 0.0224905 0.043084
    crystal router                : 0.00944349 0.00717771 0.029574
    used all_to_all method: crystal router
    setupds time 2.5930E+00 seconds   3  4     2545856      131072
    setvert3d:   6     8149312    16537920     8149312     8149312
  call usrsetvert
  done :: usrsetvert

gs_setup: 1062197 unique labels shared
    pairwise times (avg, min, max): 0.0277191 0.0266448 0.0471066
    crystal router                : 0.0466603 0.0462395 0.0471185
    used all_to_all method: pairwise
    setupds time 5.2800E+00 seconds   4  6     8149312      131072
  setup h1 coarse grid, nx_crs=           2
  call usrsetvert
  done :: usrsetvert

gs_setup: 40385 unique labels shared
    pairwise times (avg, min, max): 0.000733222 0.00068059 0.000779104
    crystal router                : 0.00111822 0.00108392 0.00117099
    all reduce                    : 0.0248416 0.0245946 0.0251968
    used all_to_all method: pairwise
gs_setup: 40385 unique labels shared
    pairwise times (avg, min, max): 0.000738662 0.000688004 0.000781512
    crystal router                : 0.00110533 0.00107729 0.00113969
    all reduce                    : 0.0281804 0.0244667 0.0450669
    used all_to_all method: pairwise
rank 87 in job 1  n0422_55429   caused collective abort of all ranks
   exit status of rank 87: killed by signal 9
rank 88 in job 1  n0422_55429   caused collective abort of all ranks
   exit status of rank 88: killed by signal 9
rank 12 in job 1  n0422_55429   caused collective abort of all ranks
   exit status of rank 12: killed by signal 9
rank 50 in job 1  n0422_55429   caused collective abort of all ranks
   exit status of rank 50: killed by signal 9
rank 16 in job 1  n0422_55429   caused collective abort of all ranks
   exit status of rank 16: killed by signal 9
Job  /share/apps/lsf/7.0/linux2.6-glibc2.3-x86_64/bin/mpich2_wrapper ./nek5000





More information about the Nek5000-users mailing list