locked
MS-MPI connection error RRS feed

  • Question

  • I install mpi 10.1.2 (https://www.microsoft.com/en-us/download/details.aspx?id=100593).

    However, when I use mpiexec to run program, I get some error:

    my commond:

    > (cmd window1) smpd -d 3

    > (cmd window2) mpiexec -d 3 -host localhost -n 1 hostname

    the error message:

    (cmd window1)

    C:\Users\Administrator>smpd -d 3
    [-1:5328] Launching SMPD service.
    [-1:5328] smpd listening on port 8677
    [-1:5328] Authentication completed. Successfully obtained Context for Client.
    [-1:5328] version check complete, using PMP version 4.
    [-1:5328] create manager process (using smpd daemon credentials)
    [-1:5328] Launching smpd as 'C:\Program Files\Microsoft MPI\Bin\smpd.exe "C:\Pro
    gram Files\Microsoft MPI\Bin\smpd.exe" -p 8677 -d 11 -mgr 568 "job"'
    [-1:5328] smpd reading the port string from the manager
    [-1:5292] Launching smpd manager instance.
    [-1:5292] created set for manager listener 116
    [-1:5292] smpd manager listening on port 6931
    [-1:5292] smpd will not assign child processes to job object
    [-1:5328] closing the pipe to the manager
    [-1:5292] Authentication completed. Successfully obtained Context for Client.
    [-1:5292] Authorization completed.
    [-1:5292] version check complete, using PMP version 4.
    [-1:5292] Received session header from parent id=1, parent=0, level=0
    [01:5292] Connecting back to parent using host BUAA-LYZ and endpoint 6929
    [01:5292] Previous attempt failed with error 5, trying to authenticate without K
    erberos
    [01:5292] Failed to connect back to parent error 5.
    [01:5292] ERROR: Failed to connect back to parent 'ncacn_ip_tcp:BUAA-LYZ:6929' e
    rror 5
    [01:5292] smpd manager successfully stopped listening.
    [01:5292] SMPD exiting with error code 4294967293.

    (cmd window2)

    C:\Users\Administrator>mpiexec -d 3 -host localhost -n 1 hostname
    [00:5804] host tree:
    [00:5804]  host: localhost, parent: 0, id: 1
    [00:5804] mpiexec started smpd manager listening on port 6929
    [00:5804] using spn msmpi/localhost to contact server
    [00:5804] BUAA-LYZ posting a re-connect to localhost:6931 in left child context.

    [00:5804] ERROR: Failed to connect to SMPD Manager Instance error 1726

    Aborting: mpiexec on BUAA-LYZ is unable to connect to the smpd manager on localh
    ost:6931 error 1726
    [00:5804] smpd manager successfully stopped listening.

    Wednesday, December 4, 2019 5:30 AM