[mpich-discuss] Need some help getting mpich to work
Hung-Hsun Su
su at hcs.ufl.edu
Tue Nov 24 12:50:37 CST 2009
Unfortunately, the latest release did not solve the issue. It actually
introduces new bug. After installation, mpdboot cannot setup the
environment correctly. It freezes when I make the following call.
[su at alpha ~]$ which mpdboot
/home/su/software/mpich2-1.2.1/bin/mpdboot
[su at alpha ~]$ mpdboot -n 32 --ncpus=1 -f /home/su/mpd.hosts
After I killed the process and found out that only 4/31 of the compute
nodes were setup
[su at alpha ~]$ mpdtrace
alpha
compute-0-3
compute-0-1
compute-0-0
compute-0-2
I then tried setting up using the 1.2 version and it works fine.
[su at alpha ~]$ /home/su/software/mpich2-1.2/bin/mpdboot -n 32 --ncpus=1
-f /home/su/mpd.hosts
[su at alpha ~]$ mpdtrace
alpha
compute-0-3
compute-0-11
compute-0-10
compute-0-9
compute-0-8
compute-0-1
compute-0-15
compute-0-14
compute-0-13
compute-0-12
compute-0-0
compute-0-19
compute-0-27
compute-0-26
compute-0-25
compute-0-24
compute-0-18
compute-0-30
compute-0-29
compute-0-28
compute-0-17
compute-0-16
compute-0-2
compute-0-7
compute-0-6
compute-0-5
compute-0-4
compute-0-23
compute-0-22
compute-0-21
compute-0-20
I then tried ran make testing and got even more error. Anyone has an
idea of what is going on?
Hung-Hsun
PS. I've attached the output files from various steps
c.txt - configuration
m.txt - make
mi.txt - make install
mpd.hosts - my machine file
mtest.txt - make testing
summary.xml - output in test/mpi directory from make testing
> Can you try the latest release 1.2.1?
>
> Rajeev
>
>
>> -----Original Message-----
>> From: mpich-discuss-bounces at mcs.anl.gov
>> [mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of Hung-Hsun Su
>> Sent: Monday, November 23, 2009 11:16 AM
>> To: mpich-discuss at mcs.anl.gov
>> Subject: [mpich-discuss] Need some help getting mpich to work
>>
>> Hi,
>>
>> I was wondering if anyone can help me figure out why my MPICH2
>> installation isn't working correctly. I've downloaded the
>> v1.2 version,
>> configured using "configure
>> --prefix=/home/su/software/mpich2-1.2", make
>> and make install and everything seemed fine (I've attached the 3 txt
>> output from configure, make and make install which shows no
>> error). I
>> then tried to see if my installation is working correctly by
>> running the
>> mpich-test suite (result given in summary.xml) and some of the tests
>> failed (collective). Does anyone know what might be the cause of my
>> problem? Thanks.
>>
>> System spec:
>> 32 nodes Quad-core Xeon cluster
>> Linux version 2.6.9-55.0.2.ELsmp (mockbuild at builder6.centos.org) (gcc
>> version 3.4.6 20060404 (Red Hat 3.4.6-8)) #1 SMP Tue Jun 26
>> 14:14:47 EDT
>> 2007
>>
>> Hung-Hsun
>>
>> --
>>
>> --------------------------------------------------------------
>> ---------------------------------------------
>> Sincerely,
>>
>> Hung-Hsun Su
>>
>> Ph.D. Student, UPC Group Leader, Research Assistant, Teaching
>> Assistant
>> High-performance Computing and Simulation (HCS) Research Laboratory
>> Dept. of Electrical and Computer Engineering , University of Florida,
>> Gainesville, FL 32611-6200
>> Email: su at hcs.ufl.edu, hunghsun at ufl.edu
>> --------------------------------------------------------------
>> ----------------------------------------------
>>
>>
>>
>
> _______________________________________________
> mpich-discuss mailing list
> mpich-discuss at mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>
--
-----------------------------------------------------------------------------------------------------------
Sincerely,
Hung-Hsun Su
Ph.D. Student, UPC Group Leader, Research Assistant, Teaching Assistant
High-performance Computing and Simulation (HCS) Research Laboratory
Dept. of Electrical and Computer Engineering , University of Florida,
Gainesville, FL 32611-6200
Email: su at hcs.ufl.edu, hunghsun at ufl.edu
------------------------------------------------------------------------------------------------------------
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: mi.txt
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20091124/57c23265/attachment-0004.txt>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: c.txt
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20091124/57c23265/attachment-0005.txt>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: m.txt
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20091124/57c23265/attachment-0006.txt>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: mpd.hosts
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20091124/57c23265/attachment-0001.diff>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: summary.xml
Type: text/xml
Size: 73904 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20091124/57c23265/attachment-0001.bin>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: mtest.txt
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20091124/57c23265/attachment-0007.txt>
More information about the mpich-discuss
mailing list