[mpich-discuss] installing and using MPI_PUBLISH_NAME and MPI_LOOKUP_NAME with mpich2

Pavan Balaji balaji at mcs.anl.gov
Tue Jun 28 21:09:31 CDT 2011


Hi Bernard,

Sorry about the delayed response. Will you be able to try out the 
nightly snapshot to see if the problem still exists? We fixed a few 
issues in this, which might be related.

Thanks,

  -- Pavan

On 06/20/2011 03:38 AM, Bernard Secher - SFME/LGLS wrote:
> The problem occures after I launch only the first executable:
>
> mpiexec -np 2 -nameserver is209011 testMPI2
>
> Bernard
>
>
>
> Pavan Balaji a écrit :
>>
>> Can you tell me the mpiexec command line you are using for the two
>> executables?
>>
>> -- Pavan
>>
>> On 06/17/2011 01:53 AM, Bernard Secher - SFME/LGLS wrote:
>>> Hi Pavan,
>>>
>>> I can run my application on system with a shared file-system, but also
>>> on system without shared file-system.
>>>
>>> The usage is to perform coupling between parallel programs with more
>>> than 1 process.
>>>
>>> My test program uses the same executable, but it can be different
>>> executables.
>>>
>>> Bernard
>>>
>>> Pavan Balaji a écrit :
>>>> Hi Bernard,
>>>>
>>>> You didn't answer my first question. Can you please let us know about
>>>> this:
>>>>
>>>> >> Do you have a shared file-system? If yes, you don't need to modify
>>>> >> anything. It should just work. If no, then read below:
>>>>
>>>> With respect to the test program, what's the exact usage? It looks
>>>> like it needs to be run with more than 1 process. You mentioned that
>>>> there are two applications? Are both using the same executable (with
>>>> different input arguments)?
>>>>
>>>> -- Pavan
>>>>
>>>> On 06/16/2011 04:12 AM, Bernard Secher - SFME/LGLS wrote:
>>>>> When I tried to use hydra_nameserver I get the followinf error message
>>>>> when I launch my application:
>>>>>
>>>>> [secher at is209011 essai]$ [mpiexec at is209011] HYDU_sock_read
>>>>> (/data/tmplgls/secher/mpich2-1.3.2p1-src/src/pm/hydra/utils/sock/sock.c:223):
>>>>>
>>>>>
>>>>> read errno (Input/output error)
>>>>> [mpiexec at is209011] control_cb
>>>>> (/data/tmplgls/secher/mpich2-1.3.2p1-src/src/pm/hydra/pm/pmiserv/pmiserv_cb.c:249):
>>>>>
>>>>>
>>>>> assert (!closed) failed
>>>>> [mpiexec at is209011] HYDT_dmxu_poll_wait_for_event
>>>>> (/data/tmplgls/secher/mpich2-1.3.2p1-src/src/pm/hydra/tools/demux/demux_poll.c:77):
>>>>>
>>>>>
>>>>> callback returned error status
>>>>> [mpiexec at is209011] HYD_pmci_wait_for_completion
>>>>> (/data/tmplgls/secher/mpich2-1.3.2p1-src/src/pm/hydra/pm/pmiserv/pmiserv_pmci.c:206):
>>>>>
>>>>>
>>>>> error waiting for event
>>>>> [mpiexec at is209011] main
>>>>> (/data/tmplgls/secher/mpich2-1.3.2p1-src/src/pm/hydra/ui/mpich/mpiexec.c:404):
>>>>>
>>>>>
>>>>> process manager error waiting for completion
>>>>>
>>>>> What is wrong
>>>>>
>>>>> My application is just a little test program. I have attached it to
>>>>> this
>>>>> message.
>>>>>
>>>>> Bernard
>>>>>
>>>>> Pavan Balaji a écrit :
>>>>>>
>>>>>> [please keep mpich-discuss cc'ed]
>>>>>>
>>>>>> On 06/16/2011 02:18 AM, Bernard Secher - SFME/LGLS wrote:
>>>>>>> How can I use the default publish mechanism? My Publish/lookup is
>>>>>>> hapenning between two different applications. Is there an option to
>>>>>>> give
>>>>>>> to mpiexec for that ?
>>>>>>
>>>>>> Do you have a shared file-system? If yes, you don't need to modify
>>>>>> anything. It should just work. If no, then read below:
>>>>>>
>>>>>>> If I use the pmi option, the mpiexec -nameserver -help give the
>>>>>>> following message:
>>>>>>> -nameserver: Nameserver to use for publish/lookup (format is
>>>>>>> host:port)
>>>>>>>
>>>>>>> I don't understand what I have to do with this option.
>>>>>>> First I launch the hydra_nameserver . Do I give some options to
>>>>>>> hydra_nameserver ? Do I give some options then, to the two
>>>>>>> mpiexec to
>>>>>>> launch my two applications?
>>>>>>
>>>>>> There are no options required for hydra_nameserver. The two mpiexec's
>>>>>> will need the -nameserver {hostname} option, where {hostname} is the
>>>>>> name of the host where the nameserver is running.
>>>>>>
>>>>>> -- Pavan
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>>
>>>>> _\\|//_
>>>>> (' 0 0 ')
>>>>> ____ooO (_) Ooo______________________________________________________
>>>>> Bernard Sécher DEN/DM2S/SFME/LGLS mailto :bsecher at cea.fr
>>>>> CEA Saclay, Bât 454, Pièce 114 Phone : 33 (0)1 69 08 73 78
>>>>> 91191 Gif-sur-Yvette Cedex, France Fax : 33 (0)1 69 08 10 87
>>>>> ------------Oooo---------------------------------------------------
>>>>> oooO ( )
>>>>> ( ) ) /
>>>>> \ ( (_/
>>>>> \_)
>>>>>
>>>>>
>>>>> Ce messageélectronique et tous les fichiers attachés qu'il contient
>>>>> sont confidentiels et destinés exclusivementà l'usage de la personne
>>>>> à laquelle ils sont adressés. Si vous avez reçu ce message par erreur,
>>>>> merci d'en avertir immédiatement sonémetteur et de ne pas en conserver
>>>>> de copie.
>>>>>
>>>>> This e-mail and any files transmitted with it are confidential and
>>>>> intended solely for the use of the individual to whom they are
>>>>> addressed.
>>>>> If you have received this e-mail in error please inform the sender
>>>>> immediately, without keeping any copy thereof.
>>>>>
>>>>
>>>
>>>
>>> --
>>>
>>> _\\|//_
>>> (' 0 0 ')
>>> ____ooO (_) Ooo______________________________________________________
>>> Bernard Sécher DEN/DM2S/SFME/LGLS mailto :bsecher at cea.fr
>>> CEA Saclay, Bât 454, Pièce 114 Phone : 33 (0)1 69 08 73 78
>>> 91191 Gif-sur-Yvette Cedex, France Fax : 33 (0)1 69 08 10 87
>>> ------------Oooo---------------------------------------------------
>>> oooO ( )
>>> ( ) ) /
>>> \ ( (_/
>>> \_)
>>>
>>>
>>> Ce messageélectronique et tous les fichiers attachés qu'il contient
>>> sont confidentiels et destinés exclusivementà l'usage de la personne
>>> à laquelle ils sont adressés. Si vous avez reçu ce message par erreur,
>>> merci d'en avertir immédiatement sonémetteur et de ne pas en conserver
>>> de copie.
>>>
>>> This e-mail and any files transmitted with it are confidential and
>>> intended solely for the use of the individual to whom they are
>>> addressed.
>>> If you have received this e-mail in error please inform the sender
>>> immediately, without keeping any copy thereof.
>>>
>>
>
>
> --
>
>         _\\|//_
>        (' 0 0 ')
> ____ooO  (_) Ooo______________________________________________________
>   Bernard Sécher  DEN/DM2S/SFME/LGLS    mailto :bsecher at cea.fr
>   CEA Saclay, Bât 454, Pièce 114        Phone  : 33 (0)1 69 08 73 78
>   91191 Gif-sur-Yvette Cedex, France    Fax    : 33 (0)1 69 08 10 87
> ------------Oooo---------------------------------------------------
>         oooO (   )
>         (   ) ) /
>          \ ( (_/
>           \_)
>
>
> Ce messageélectronique et tous les fichiers attachés qu'il contient
> sont confidentiels et destinés exclusivementà  l'usage de la personne
> à  laquelle ils sont adressés. Si vous avez reçu ce message par erreur,
> merci d'en avertir immédiatement sonémetteur et de ne pas en conserver
> de copie.
>
> This e-mail and any files transmitted with it are confidential and
> intended solely for the use of the individual to whom they are addressed.
> If you have received this e-mail in error please inform the sender
> immediately, without keeping any copy thereof.
>

-- 
Pavan Balaji
http://www.mcs.anl.gov/~balaji


More information about the mpich-discuss mailing list