<table cellspacing="0" cellpadding="0" border="0" ><tr><td valign="top" style="font: inherit;">hello,<div><br></div><div>thx very much. '-m32' work!. Now, I can run /examples/cpi with two machines. I will try with my programs later on tonight, hopefully They will work like the cpi program.</div><div>thanks again ^^.</div><div><br></div><div>Cheers,</div><div>Gra<br><br>--- On <b>Sat, 7/18/09, Pavan Balaji <i><balaji@mcs.anl.gov></i></b> wrote:<br><blockquote style="border-left: 2px solid rgb(16, 16, 255); margin-left: 5px; padding-left: 5px;"><br>From: Pavan Balaji <balaji@mcs.anl.gov><br>Subject: Re: [mpich-discuss] I wonder if my mpdboot is the causeofproblem...help me!<br>To: mpich-discuss@mcs.anl.gov<br>Date: Saturday, July 18, 2009, 1:17 PM<br><br><div class="plainMail"><br>You have a few options:<br><br>1. Check if your processor supports 64-bit Operating Systems -- most modern processors do. If it does, just reinstall a 64-bit OS on
the machine. This is the most efficient option.<br><br>2. Use the "-m32" CFLAGS to your MPICH2 configure -- this will build MPICH2 in 32-bit mode even on the 64-bit platform. Even your applications that are built with mpicc and friends will be built as 32-bit binaries. This will work, but you'll not be using the 64-bit capabilities of one of your machines, so the performance will not be optimal.<br><br>3. You could use MPICH-1 instead of MPICH2, though I wouldn't suggest doing that. In this case MPICH will internally do the data conversion for you, which will eat up some performance as well.<br><br> -- Pavan<br><br>On 07/18/2009 03:08 PM, Gra zeus wrote:<br>> ello Rajeev,<br>> *<br>> *<br>> ahh sorry about last email, my OS on two machine are different<br>> <br>> quadcore machine is 64bit and OS is "Linux myquadcore_machine 2.6.18-128.1.1.el5 #1 SMP Tue Feb 10 11:36:29 EST 2009 x86_64 x86_64 x86_64 GNU/Linux"<br>> <br>> <br>>
dual core is 32bit and IS is :: "Linux mydualcore_machine 2.6.18-128.1.6.el5PAE #1 SMP Wed Apr 1 07:24:39 EDT 2009 i686 i686 i386 GNU/Linux"<br>> <br>> Are these the cause of my problem? Do i need to run my MPI with the same 32-bit machines? Are there any configurations i need to set , to make them work togather?<br>> <br>> thank you very much,and sorry again about wrong OS info in my last email<br>> <br>> regards,<br>> Gra <br>> --- On *Sat, 7/18/09, Rajeev Thakur /<<a ymailto="mailto:thakur@mcs.anl.gov" href="/mc/compose?to=thakur@mcs.anl.gov">thakur@mcs.anl.gov</a>>/* wrote:<br>> <br>> <br>> From: Rajeev Thakur <<a ymailto="mailto:thakur@mcs.anl.gov" href="/mc/compose?to=thakur@mcs.anl.gov">thakur@mcs.anl.gov</a>><br>> Subject: Re: [mpich-discuss] I wonder if my mpdboot is the<br>> causeofproblem...help me!<br>>
To: <a ymailto="mailto:mpich-discuss@mcs.anl.gov" href="/mc/compose?to=mpich-discuss@mcs.anl.gov">mpich-discuss@mcs.anl.gov</a><br>> Date: Saturday, July 18, 2009, 8:42 AM<br>> <br>> Are the CPUs identical on them? Is one 32-bit, the other 64-bit?<br>> <br>> ------------------------------------------------------------------------<br>> *From:* <a ymailto="mailto:mpich-discuss-bounces@mcs.anl.gov" href="/mc/compose?to=mpich-discuss-bounces@mcs.anl.gov">mpich-discuss-bounces@mcs.anl.gov</a><br>> [mailto:<a ymailto="mailto:mpich-discuss-bounces@mcs.anl.gov" href="/mc/compose?to=mpich-discuss-bounces@mcs.anl.gov">mpich-discuss-bounces@mcs.anl.gov</a>] *On Behalf Of *Gra zeus<br>> *Sent:* Saturday,
July 18, 2009 10:27 AM<br>> *To:* <a ymailto="mailto:mpich-discuss@mcs.anl.gov" href="/mc/compose?to=mpich-discuss@mcs.anl.gov">mpich-discuss@mcs.anl.gov</a><br>> *Subject:* Re: [mpich-discuss] I wonder if my mpdboot is the<br>> causeofproblem...help me!<br>> <br>> one of them is quad core and another one is dual core. however,<br>> OS,account,my password,install path are all the same.<br>> I use this configuration "./configure<br>> --prefix=/opt/localhomes/myname/mpich2-install" in both machines.<br>> <br>> --- On *Sat, 7/18/09, Rajeev Thakur /<<a ymailto="mailto:thakur@mcs.anl.gov"
href="/mc/compose?to=thakur@mcs.anl.gov">thakur@mcs.anl.gov</a>>/* wrote:<br>> <br>> <br>> From: Rajeev Thakur <<a ymailto="mailto:thakur@mcs.anl.gov" href="/mc/compose?to=thakur@mcs.anl.gov">thakur@mcs.anl.gov</a>><br>> Subject: Re: [mpich-discuss] I wonder if my mpdboot is the<br>> cause ofproblem...help me!<br>> To: <a ymailto="mailto:mpich-discuss@mcs.anl.gov" href="/mc/compose?to=mpich-discuss@mcs.anl.gov">mpich-discuss@mcs.anl.gov</a><br>> Date: Saturday, July 18, 2009, 7:02 AM<br>> <br>> What are the exact parameters you passed to configure when<br>>
building MPICH2? Are the two machines identical?<br>> Rajeev<br>> <br>> ------------------------------------------------------------------------<br>> *From:* <a ymailto="mailto:mpich-discuss-bounces@mcs.anl.gov" href="/mc/compose?to=mpich-discuss-bounces@mcs.anl.gov">mpich-discuss-bounces@mcs.anl.gov</a><br>> [mailto:<a ymailto="mailto:mpich-discuss-bounces@mcs.anl.gov" href="/mc/compose?to=mpich-discuss-bounces@mcs.anl.gov">mpich-discuss-bounces@mcs.anl.gov</a>] *On Behalf Of<br>> *Gra zeus<br>>
*Sent:* Saturday, July 18, 2009 12:06 AM<br>> *To:* <a ymailto="mailto:mpich-discuss@mcs.anl.gov" href="/mc/compose?to=mpich-discuss@mcs.anl.gov">mpich-discuss@mcs.anl.gov</a><br>> *Subject:* [mpich-discuss] I wonder if my mpdboot is the<br>> cause ofproblem...help me!<br>> <br>> hello,<br>> <br>> thx for answer yesterday.<br>> I tested my code in one machine (with"mpiexec -n 2<br>> ./myprog"),everything work fine - my program can use<br>>
MPI_Send,MPI_Recv without any problems.<br>> <br>> today, I setup mpich2 on two machines. Both machines can<br>> communicate with others, ssh are tested on both<br>> machines, mpd work, mpdringtest work.<br>> <br>> however,when i run my program that use MPI_Send and<br>> MPI_Recv, MPI_Recv is blocked forever.<br>> so i write new simple code to test MPI_Send,MPI_Recv<br>> like this<br>> <br>>
int myrank;<br>> MPI_Status status;<br>> MPI_Init( &argc, &argv );<br>> MPI_Comm_rank( MPI_COMM_WORLD, &myrank );<br>> if (myrank == 0) {<br>> int senddata = 1;<br>> MPI_Send(&senddata, 1, MPI_INT, 1, 0, MPI_COMM_WORLD);<br>> }<br>>
else if (myrank == 1) {<br>> int recvdata = 0;<br>> MPI_Recv(&recvdata, 1, MPI_INT, 0, 0, MPI_COMM_WORLD,<br>> &status);<br>> printf("received :%d:\n", recvdata);<br>> }<br>> MPI_Finalize();<br>> <br>> <br>> i got this error<br>> <br>> <br>> Assertion failed in file ch3_progress.c at line 489:<br>>
pkt->type >= 0 && pkt->type < MPIDI_NEM_PKT_END<br>> internal ABORT - process 1<br>> Fatal error in MPI_Finalize: Other MPI error, error stack:<br>> MPI_Finalize(315)..................: MPI_Finalize failed<br>> MPI_Finalize(207)..................: MPID_Finalize(92)..................: PMPI_Barrier(476)..................:<br>> MPI_Barrier(comm=0x44000002) failed<br>>
MPIR_Barrier(82)...................: MPIC_Sendrecv(164).................: MPIC_Wait(405).....................: MPIDI_CH3I_Progress(150)...........: MPID_nem_mpich2_blocking_recv(1074): MPID_nem_tcp_connpoll(1667)........: state_commrdy_handler(1517)........: MPID_nem_tcp_recv_handler(1413)....: socket closed<br>> <br>> ////////////////////////////////////////////////////////////////<br>>
<br>> I also tried example/cpi that come with install package<br>> -> result is the example program freezed, without any<br>> errors.(I assume it stopped at MPI_Bcast())<br>> <br>> Can anyone help me with this?<br>> This code and my program can run smoothly when I use 1<br>> machine (with option , -n 2, -n 4 .... etc) but<br>> whenever I start mpdboot with 2 machines, mpi processes<br>> can't
communicate with other mpi processes via<br>> MPI_Send,MPI_Recv.<br>> <br>> thx,<br>> gra<br>> <br>> <br>> <br>> <br>> <br><br>-- Pavan Balaji<br><a href="http://www.mcs.anl.gov/~balaji" target="_blank">http://www.mcs.anl.gov/~balaji</a><br></div></blockquote></div></td></tr></table><br>