[MPICH] setup and testing of smpd and mpiexec for MPICH2 under WinXP
Kim Parnell
kim.parnell at mscsoftware.com
Fri Jan 11 13:20:04 CST 2008
Let me provide the last items first:
Can you provide us the output of "smpd -status -d" ? Can you try
"mpiexec -n 1 dir" ?
mpiexec -n 1 dir
hangs at the command prompt and does not do a directory listing;
if I include -timeout 10 then it will timeout
smpd -status -d
[-1:5056]..\smpd_get_opt_int
[-1:5056]../smpd_get_opt_int
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt_int
[-1:5056]../smpd_get_opt_int
[-1:5056]..\smpd_get_opt_int
[-1:5056]../smpd_get_opt_int
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt_string
[-1:5056]../smpd_get_opt_string
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_hostname
[-1:5056]../smpd_get_hostname
[-1:5056]..\smpd_get_opt_string
[-1:5056]../smpd_get_opt_string
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt_string
[-1:5056]../smpd_get_opt_string
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt_string
[-1:5056]../smpd_get_opt_string
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt_string
[-1:5056]../smpd_get_opt_string
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt_string
[-1:5056]../smpd_get_opt_string
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_get_opt_string
[-1:5056]../smpd_get_opt_string
[-1:5056]..\smpd_get_opt_string
[-1:5056]../smpd_get_opt_string
[-1:5056]..\smpd_get_opt
[-1:5056]../smpd_get_opt
[-1:5056]..\smpd_do_console
^C to kill it (does not return to command prompt)
I just used "smpd -install" to restart the process after I had
killed it from the TaskManager.
I am not using the latest versions just to avoid possibly introducing
another variable with the MPICH2 applications that I ultimately want to
run. The version used is:
mpich2-1.0.3-1-win32-ia32.msi
There are not multiple versions and I running smpd and mpiexec from the
local directory.
Thanks!
________________________________
From: Jayesh Krishna [mailto:jayesh at mcs.anl.gov]
Sent: Friday, January 11, 2008 11:01 AM
To: Kim Parnell
Cc: mpich-discuss at mcs.anl.gov
Subject: RE: [MPICH] setup and testing of smpd and mpiexec for MPICH2
under WinXP
Hi,
Are you using the latest version of MPICH2 (1.0.6p1)? I would
recommend that you try installing the latest version of MPICH2 on your
machine. (Are there multiple versions of smpd/mpiexec on your machine ?
Check the PATH and make sure that you are running the right
smpd/mpiexec.)
Is there a reason you chose to run "smpd -install" (Instead of letting
the MPICH2 installer install SMPD as a service, which gets started when
you logon the next time)?
Can you provide us the output of "smpd -status -d" ? Can you try
"mpiexec -n 1 dir" ?
Regards,
Jayesh
________________________________
From: owner-mpich-discuss at mcs.anl.gov
[mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Kim Parnell
Sent: Friday, January 11, 2008 12:25 PM
To: mpich-discuss at mcs.anl.gov
Subject: [MPICH] setup and testing of smpd and mpiexec for MPICH2 under
WinXP
Windows XP SP2 machine with Dual Core CPU
I have an MPICH2 application that was working correctly and now does not
get submitted.
I am trying to go back and test the smpd and mpiexec installations and
hitting some problems that I do not understand.
smpd -install
starts an smpd process under user SYSTEM that seems to be running
smpd -status
starts another smpd process under my username but never returns a
status; basically just hangs at the command prompt
smpd -stop
Stopping MPICH2 Process Manager, Argonne National
Lab...................................
MPICH2 Process Manager, Argonne National Lab failed to stop.
I can kill the smpd process from the Windows Task Manager
I will put the Debug output from "smpd -d" below
mpiexec -validate -user USERNAME
hangs at the command prompt without any further output
mpiexec -register
prompts for username and password which I can register and is confirmed
as "Password encrypted into the Registry."
a simple test like:
mpiexec -timeout 20 -verbose -n 1 ping
will just terminate due to the timeout
I will put the verbose output below under the smpd debug
What am I missing? Thanks for any assistance.
Regards,
Kim
smpd -install -d
[-1:5076]..\smpd_get_opt_int
[-1:5076]../smpd_get_opt_int
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
[-1:5076]..\smpd_get_opt_int
[-1:5076]../smpd_get_opt_int
[-1:5076]..\smpd_get_opt_int
[-1:5076]../smpd_get_opt_int
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
MPICH2 Process Manager, Argonne National Lab removed.
[-1:5076]..\smpd_get_opt_string
[-1:5076]../smpd_get_opt_string
[-1:5076]..\smpd_get_win_opt_string
[-1:5076]../smpd_get_win_opt_string
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
[-1:5076]..\smpd_get_opt_string
[-1:5076]../smpd_get_opt_string
[-1:5076]..\smpd_get_opt
[-1:5076]../smpd_get_opt
MPICH2 Process Manager, Argonne National Lab installed.
[-1:5076]..\smpd_set_smpd_data
[-1:5076]../smpd_set_smpd_data
C:\MSC.Software\Marc\2007r1\marc2007r1\mpich2\bin>mpiexec -timeout 20
-verbose -n 1 ping
..\smpd_get_full_path_name
...fixing up exe name: 'ping' -> '(null)'
../smpd_get_full_path_name
..handling executable:
ping.exe
..\smpd_get_next_host
...\smpd_get_next_hostname
.../smpd_get_next_hostname
...\smpd_get_host_id
.../smpd_get_host_id
../smpd_get_next_host
..\smpd_create_cliques
...\next_launch_node
.../next_launch_node
...\next_launch_node
.../next_launch_node
../smpd_create_cliques
..\smpd_fix_up_host_tree
../smpd_fix_up_host_tree
./mp_parse_command_args
.host tree:
. host: l30544.na.mscsoftware.com, parent: 0, id: 1
.launch nodes:
. iproc: 0, id: 1, exe: ping.exe
.\smpd_create_context
..\smpd_init_context
...\smpd_init_command
.../smpd_init_command
../smpd_init_context
./smpd_create_context
.\smpd_make_socket_loop
..\smpd_get_hostname
../smpd_get_hostname
./smpd_make_socket_loop
.\smpd_create_context
..\smpd_init_context
...\smpd_init_command
.../smpd_init_command
../smpd_init_context
./smpd_create_context
.\smpd_enter_at_state
..sock_waiting for the next event.
..SOCK_OP_CONNECT
..\smpd_handle_op_connect
...connect succeeded, posting read of the challenge string
../smpd_handle_op_connect
..sock_waiting for the next event.
..
mpiexec terminated job due to 20 second timeout.
mpiexec -timeout 100 -verbose -n 1 ping
..\smpd_get_full_path_name
...fixing up exe name: 'ping' -> '(null)'
../smpd_get_full_path_name
..handling executable:
ping.exe
..\smpd_get_next_host
...\smpd_get_next_hostname
.../smpd_get_next_hostname
...\smpd_get_host_id
.../smpd_get_host_id
../smpd_get_next_host
..\smpd_create_cliques
...\next_launch_node
.../next_launch_node
...\next_launch_node
.../next_launch_node
../smpd_create_cliques
..\smpd_fix_up_host_tree
../smpd_fix_up_host_tree
./mp_parse_command_args
.host tree:
. host: l30544.na.xxx.com, parent: 0, id: 1
.launch nodes:
. iproc: 0, id: 1, exe: ping.exe
.\smpd_create_context
..\smpd_init_context
...\smpd_init_command
.../smpd_init_command
../smpd_init_context
./smpd_create_context
.\smpd_make_socket_loop
..\smpd_get_hostname
../smpd_get_hostname
./smpd_make_socket_loop
.\smpd_create_context
..\smpd_init_context
...\smpd_init_command
.../smpd_init_command
../smpd_init_context
./smpd_create_context
.\smpd_enter_at_state
..sock_waiting for the next event.
..SOCK_OP_CONNECT
..\smpd_handle_op_connect
...connect succeeded, posting read of the challenge string
../smpd_handle_op_connect
..sock_waiting for the next event.
..
mpiexec terminated job due to 100 second timeout.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20080111/6474088d/attachment.htm>
More information about the mpich-discuss
mailing list