locked
Problem - Aborting: smpd on host1 is unable to connect to the smpd service on host2:8677 RRS feed

  • Question

  • I have two PCs, one running Win10 pro and another running Win 10 home, they are connceted with a 5-ports netwrok switch.

    On each of the PCs, I created a user account with a same username and no password.

    Both pc installed MS-MPI(msmpisdk and MSMpiSetup) and can do MPI computing locally.

    I created a shared folder ”MPI_DIR" on host1 and the MPI parallel application is in it.

    Firewall on both PCs are disabled.

    First, I run "smpd -d" in cmd line on each host. Second, on host1 I input "mpiexec -hosts 2 host1 2 host2 2 host1\\C:\MPI_DIR\MPI_TEST.exe" in a new cmd line window, but it shows the error message "Aborting: smpd on host1 is unable to connect to the smpd service on host2:8677“. 

    Could someone help me with the problem??


    W.Zhou


    • Edited by CLR_cn Thursday, June 15, 2017 11:33 PM
    Thursday, June 15, 2017 12:05 PM

All replies

  • Hi,

    What is the version of MS-MPI that you're using? You can run mpiexec -help and the very first line should list the version number

    Can you run the following steps and provide us the output

    1) Run "smpd -d 3" on host2

    2) On host1 run "mpiexec -d 3 -host host2 mpi_test.exe"

    Anh

    Thursday, June 15, 2017 1:28 PM
  • Hi,

    What is the version of MS-MPI that you're using? You can run mpiexec -help and the very first line should list the version number

    Can you run the following steps and provide us the output

    1) Run "smpd -d 3" on host2

    2) On host1 run "mpiexec -d 3 -host host2 mpi_test.exe"

    Anh

    Thanks!  The version of MS-MPI is "8.0.12438.0"

    The output on host1 is as follows:

    C:\MPI_DIR>mpiexec -d 3 -host host2 MPI_TEST.exe
    [00:10612] host tree:
    [00:10612]  host: host2, parent: 0, id: 1
    [00:10612] mpiexec started smpd manager listening on port 3914
    [00:10612] using spn msmpi/host2 to contact server
    [00:10612] Previous attempt failed with error 5, trying to authenticate without Kerberos
    [00:10612] ERROR: Failed RpcCliCreateContext error 5
    Aborting: mpiexec on host1 is unable to connect to the smpd service on host2:8677
    Other MPI error, error stack:
    connect failed - 鎷掔粷璁块棶銆? (errno 5)
    [00:10612] smpd manager successfully stopped listening.

    The output on host2 is just "Launching SMPD services" and "listening on port 8677"


    W.Zhou

    Thursday, June 15, 2017 2:26 PM
  • If you use the IP address of host2 does it work? 

    Thursday, June 15, 2017 4:30 PM
  • If you use the IP address of host2 does it work? 


    It doesn't work, and it shows the same error message.

    W.Zhou

    Thursday, June 15, 2017 11:23 PM
  • Can you provide the output of 

    1) ipconfig /all for both host1 and host2

    2) the content of the hosts file (it is in %windir%\system32\drivers\etc\hosts) for both host1 and host2

    3) If you are to revert the direction of the test (i.e., start "smpd -d 3" on host1, and run "mpiexec -d 3 -host host1..." on host2, does the same issue happen?

    If the output is too long or not convenient for you to post them in the forum - please send an email to askmpi at Microsoft dot com with the provided output and include a link to this post and we'll be able to continue the investigation there

    Thanks

    Anh

    Thursday, June 15, 2017 11:49 PM
  • Can you provide the output of 

    1) ipconfig /all for both host1 and host2

    2) the content of the hosts file (it is in %windir%\system32\drivers\etc\hosts) for both host1 and host2

    3) If you are to revert the direction of the test (i.e., start "smpd -d 3" on host1, and run "mpiexec -d 3 -host host1..." on host2, does the same issue happen?

    If the output is too long or not convenient for you to post them in the forum - please send an email to askmpi at Microsoft dot com with the provided output and include a link to this post and we'll be able to continue the investigation there

    Thanks

    Anh

    Thank you!  I have fixed the problem by setting a password on each of the hosts.  Maybe the MS-MPI doesn't allow account with no password.

    W.Zhou

    Friday, June 16, 2017 11:37 AM
  • Can you provide the output of 

    1) ipconfig /all for both host1 and host2

    2) the content of the hosts file (it is in %windir%\system32\drivers\etc\hosts) for both host1 and host2

    3) If you are to revert the direction of the test (i.e., start "smpd -d 3" on host1, and run "mpiexec -d 3 -host host1..." on host2, does the same issue happen?

    If the output is too long or not convenient for you to post them in the forum - please send an email to askmpi at Microsoft dot com with the provided output and include a link to this post and we'll be able to continue the investigation there

    Thanks

    Anh

    Thank you!  I have fixed the problem by setting a password on each of the hosts.  Maybe the MS-MPI doesn't allow account with no password.

    W.Zhou

    I used to run msmpi normally,but i don't know why i have the same problem.Can you tell me specific steps to solve this problem.Thank you.

    Q.Sun

    Tuesday, July 30, 2019 12:16 PM