[MPICH] mpich and pbs

Rajeev Thakur thakur at mcs.anl.gov
Wed Jun 20 12:47:22 CDT 2007


> However, now it appears that the program being run is in serial. 
> For example, an 8 cpu job gets stared on two nodes (each node
> has 4 cpu's - 2 dual core opterons). We see all 8 processes running on
> the nodes. But in looking at the output it appears like a 
> serial job. I get the same results trying to use vasp and amber. 

What do you mean by "it appears like a serial job"? Do you mean
performance-wise?

Try running the cpi example from the examples directory on 8 processes. If
you see 4 hostnames from 1 machine and 4 from the other, the job should be
running ok. It's up to the OS to schedule the 4 processes on each machine.
MPI doesn't do that.

Rajeev

   

> -----Original Message-----
> From: owner-mpich-discuss at mcs.anl.gov 
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Steve Young
> Sent: Wednesday, June 20, 2007 10:58 AM
> To: mpich-discuss at mcs.anl.gov
> Subject: [MPICH] mpich and pbs
> 
> Hello everyone,
> 	I still seem to be having an issue with getting mpich 
> to work properly.
> I have version mpich2-1.0.5 compiled. This works as expected 
> when I use
> mpiexec or mpirun. However, the nodes that jobs run on aren't in sync
> with the nodes that PBS allocates to the job. In posting to the list
> before I was informed to use the mpiexec from OSC that works 
> with PBS. I
> installed that and jobs now get started on the proper nodes that PBS
> allocates. However, now it appears that the program being run is in
> serial. For example, an 8 cpu job gets stared on two nodes (each node
> has 4 cpu's - 2 dual core opterons). We see all 8 processes running on
> the nodes. But in looking at the output it appears like a 
> serial job. I
> get the same results trying to use vasp and amber. So I'm not 
> sure what
> I could do to correct this. Any ideas?
> 
> -Steve
> 
> 
> 




More information about the mpich-discuss mailing list