locked
Very slow initialisation RRS feed

  • Question

  • Hello,

       We are currently moving from MPICH2 to MS-MPI. The porting has been quite simple, but the initialisation of the computation with MS-MPI is much slower (about 30s, whatever the number of cores used) than it was with MPICH2 (<1s).  The computation then runs Ok.

       The problem is reproduced with the sample program "fpi.f90" from mpich. Below is the log.

       Anyone has encountered similar issue ?I am using MS-MPI v7

    Thanks,

      Guillaume

    C:\drsvn\testMPI\x64\Release_msmpi>mpiexec -debug 2 -n 2 fpi.exe

    ------------------------------------------

    [00:9400] host tree:
    [00:9400]  host: WW7SMS01DRGDH, parent: 0, id: 1
    [00:9400] mpiexec started smpd manager listening on port 60642
    [00:9400] create manager process (using mpiexec credentials)
    [00:9400] Command C:\Program Files\Microsoft MPI\\bin\smpd.exe "C:\Program Files\Microsoft MPI\\bin\smpd.exe" -p 8677 -d 10 -mgr 420 "job" -localonly
    [00:9400] smpd reading the port string from the manager
    [-1:12700] Launching smpd manager instance.
    [-1:12700] created set for manager listener, 108
    [-1:12700] smpd manager listening on port 60643
    [-1:12700] manager writing port back to smpd.
    [00:9400] closing the pipe to the manager
    [00:9400] WW7SMS01DRGDH posting a re-connect to WW7SMS01DRGDH:60643 in left child context.

    -----> Here is the 30s waiting time

    [-1:12700] Authentication completed. Successfully obtained Context for Client.
    [-1:12700] Authorization completed.
    [-1:12700] version check complete, using PMP version 2.
    [-1:12700] Received session header from parent id=1, parent=0, level=0
    [01:12700] Connecting back to parent using host 10.67.6.54 and endpoint 60642
    [00:9400] Authentication completed. Successfully obtained Context for Client.
    [00:9400] Authorization completed.

    -----------------------------------------------------------------------------


    Friday, January 8, 2016 3:32 PM

Answers

  • Hi Guillaume,

    Can you run this and provide us with the output

    1) Start a console and run "smpd -d 3"

    2) Start another console and run mpiexec -host WW7SMS01DRGDH -d 3 -n 2 fpi.exe

    If you repeat the experiment above, but replace the -host WW7SMS01DRGDH with -host localhost, does the issue go away?

    Thanks

    • Marked as answer by Guillaume_dh Tuesday, February 23, 2016 9:20 AM
    Tuesday, January 19, 2016 11:46 PM
  • Hi Guillaume,

    Thanks for providing us with the log. Looking at the IP configuration and the log my guess is that this might be due to DNS resolution issue. Your connection has several possible DNS suffixes

    Liste de recherche du suffixe DNS.: bvcorp.corp
                                           eua.bvcorp.corp
                                           bureauveritas.com
                                           fr.bureauveritas.com
                                           ho.bureauveritas.com

    A few things to try here and it would be great if you can get back to us with the following

    1) Can you try with msmpi v6 and see if this issue persists. You can get v6 from this link: http://www.microsoft.com/en-us/download/details.aspx?id=47259

    2) Can you try running smpd -d 3 and then mpiexec -host 10.67.6.31 (the IP address of the box)

    3) I am not sure which DNS suffix your machine name resolves to, if you do ping WW7SMS01DRGDH, what does the fully qualified domain name (FQDN) for the machine look like? Can you try passing the FQDN to the -host option?

    Thanks

    Anh

    • Marked as answer by Guillaume_dh Tuesday, February 23, 2016 9:21 AM
    Wednesday, January 20, 2016 5:35 PM

All replies

  • This might happen if you have multiple active network interface on your machine but only one would allow TCP/IP traffic for mpiexec/smpd to communicate. Can you paste the output of "ipconfig /all"?

    Another user previously had a similar problem, I'm not sure if this is the same issue but the symptom does look familiar

    https://social.microsoft.com/Forums/en-US/31348586-8665-4e22-8bcf-1b5ca382f7f2/mpireduce-is-blocking-for-minutes-in-msmpi-v7?forum=windowshpcmpi

    Anh


    Friday, January 8, 2016 9:34 PM
  • Thanks for your help.

      Below is the output from ipconfig /all (I did have an additional "virtual box" adaptater, I removed it but this does not solve the issue). I also try "-env MPICH_NETMASK=10.67.6.31/255.255.255.255" to limit the network to my IP, without success.

    Regards,

      Guillaume

    --------------------------------------------------------------------------------------

    Configuration IP de Windows

       Nom de l'hôte . . . . . . . . . . : WW7SMS01DRGDH
       Suffixe DNS principal . . . . . . : eua.bvcorp.corp
       Type de noeud. . . . . . . . . .  : Hybride
       Routage IP activé . . . . . . . . : Non
       Proxy WINS activé . . . . . . . . : Non
       Liste de recherche du suffixe DNS.: bvcorp.corp
                                           eua.bvcorp.corp
                                           bureauveritas.com
                                           fr.bureauveritas.com
                                           ho.bureauveritas.com

    Carte Ethernet Connexion au réseau local :

       Suffixe DNS propre à la connexion. . . : eua.bvcorp.corp
       Description. . . . . . . . . . . . . . : Intel(R) 82579LM Gigabit Network Connection
       Adresse physique . . . . . . . . . . . : 18-03-73-3D-B2-54
       DHCP activé. . . . . . . . . . . . . . : Oui
       Configuration automatique activée. . . : Oui
       Adresse IPv6 de liaison locale. . . . .: fe80::dde0:3a9b:6ee0:f8f1%11(préféré)
       Adresse IPv4. . . . . . . . . . . . . .: 10.67.6.31(préféré)
       Masque de sous-réseau. . . . . . . . . : 255.255.255.0
       Bail obtenu. . . . . . . . . . . . . . : mardi 19 janvier 2016 11:01:59
       Bail expirant. . . . . . . . . . . . . : jeudi 21 janvier 2016 11:01:58
       Passerelle par défaut. . . . . . . . . : 10.67.6.1
       Serveur DHCP . . . . . . . . . . . . . : 10.67.24.31
       IAID DHCPv6 . . . . . . . . . . . : 236454771
       DUID de client DHCPv6. . . . . . . . : 00-01-00-01-18-07-87-CD-18-03-73-3D-B2-54
       Serveurs DNS. . .  . . . . . . . . . . : 10.67.24.31
                                           10.64.16.201
                                           10.64.32.8
       NetBIOS sur Tcpip. . . . . . . . . . . : Activé

    Carte Tunnel Connexion au réseau local* 12 :

       Statut du média. . . . . . . . . . . . : Média déconnecté
       Suffixe DNS propre à la connexion. . . :
       Description. . . . . . . . . . . . . . : Teredo Tunneling Pseudo-Interface
       Adresse physique . . . . . . . . . . . : 00-00-00-00-00-00-00-E0
       DHCP activé. . . . . . . . . . . . . . : Non
       Configuration automatique activée. . . : Oui

    Carte Tunnel isatap.eua.bvcorp.corp :

       Statut du média. . . . . . . . . . . . : Média déconnecté
       Suffixe DNS propre à la connexion. . . : eua.bvcorp.corp
       Description. . . . . . . . . . . . . . : Carte Microsoft ISATAP #3
       Adresse physique . . . . . . . . . . . : 00-00-00-00-00-00-00-E0
       DHCP activé. . . . . . . . . . . . . . : Non
       Configuration automatique activée. . . : Oui

    --------------------------------------------------------------------------------------

    Tuesday, January 19, 2016 1:52 PM
  • Hi Guillaume,

    Can you run this and provide us with the output

    1) Start a console and run "smpd -d 3"

    2) Start another console and run mpiexec -host WW7SMS01DRGDH -d 3 -n 2 fpi.exe

    If you repeat the experiment above, but replace the -host WW7SMS01DRGDH with -host localhost, does the issue go away?

    Thanks

    • Marked as answer by Guillaume_dh Tuesday, February 23, 2016 9:20 AM
    Tuesday, January 19, 2016 11:46 PM
  • Hi Anh,

      Below are the log. "-host localhost" does indeed make the problem disappear. (I though it was default when no host were provided).

    Thanks,

      Guillaume

    smpd_log1
    
    C:\Users\gdehauteclocque.EUA>"C:\Program Files\Microsoft MPI\Bin\smpd.exe" -d 3
    [-1:9716] Launching SMPD service.
    [-1:9716] smpd listening on port 8677
    [-1:9716] Authentication completed. Successfully obtained Context for Client.
    [-1:9716] version check complete, using PMP version 2.
    [-1:9716] create manager process (using smpd daemon credentials)
    [-1:9716] smpd reading the port string from the manager
    [-1:4528] Launching smpd manager instance.
    [-1:4528] created set for manager listener, 108
    [-1:4528] smpd manager listening on port 54931
    [-1:4528] manager writing port back to smpd.
    [-1:9716] closing the pipe to the manager
    [-1:4528] Authentication completed. Successfully obtained Context for Client.
    [-1:4528] Authorization completed.
    [-1:4528] version check complete, using PMP version 2.
    [-1:4528] Received session header from parent id=1, parent=0, level=0
    [01:4528] Connecting back to parent using host 10.67.6.31 and endpoint 54912
    [01:4528] Authentication completed. Successfully obtained Context for Client.
    [01:4528] Authorization completed.
    [01:4528] handling command SMPD_COLLECT src=0
    [01:4528] handling command SMPD_STARTDBS src=0
    [01:4528] sending start_dbs result command kvs = 450200b4-b31d-4dc8-9498-4d8fc1fec0d1.
    [01:4528] handling command SMPD_LAUNCH src=0
    [01:4528] Successfully handled bcast nodeids command.
    [01:4528] setting environment variable: <MPIEXEC_HOSTNAME> = <WW7SMS01DRGDH>
    [01:4528] env: PMI_SIZE=2
    [01:4528] env: PMI_KVS=450200b4-b31d-4dc8-9498-4d8fc1fec0d1
    [01:4528] env: PMI_DOMAIN=5b13f1b3-a3ff-481a-9299-c5453221af3c
    [01:4528] env: PMI_HOST=localhost
    [01:4528] env: PMI_PORT=4f52d924-5648-49f0-9020-4a2c3c84d0a9
    [01:4528] env: PMI_SMPD_ID=1
    [01:4528] env: PMI_APPNUM=0
    [01:4528] env: PMI_NODE_IDS=s
    [01:4528] env: PMI_RANK_AFFINITIES=a
    [01:4528] searching for 'fpi.exe' in workdir 'C:\drsvn\testMPI\x64\Release_msmpi'
    [01:4528] C:\drsvn\testMPI\x64\Release_msmpi>CreateProcess(C:\drsvn\testMPI\x64\Release_msmpi\fpi.exe fpi.exe)
    [01:4528] env: PMI_RANK=1
    [01:4528] env: PMI_SMPD_KEY=0
    [01:4528] C:\drsvn\testMPI\x64\Release_msmpi>CreateProcess(C:\drsvn\testMPI\x64\Release_msmpi\fpi.exe fpi.exe)
    [01:4528] env: PMI_RANK=0
    [01:4528] env: PMI_SMPD_KEY=1
    [01:4528] Authentication completed. Successfully obtained Context for Client.
    [01:4528] Authentication completed. Successfully obtained Context for Client.
    [01:4528] Authorization completed.
    [01:4528] version check complete, using PMP version 2.
    [01:4528] Authorization completed.
    [01:4528] 1 -> 0 : returning parent_context: 0 < 1
    [01:4528] forwarding command SMPD_INIT to 0
    [01:4528] version check complete, using PMP version 2.
    [01:4528] posting command SMPD_INIT to parent, src=1, ctx_key=0, dest=0.
    [01:4528] 1 -> 0 : returning parent_context: 0 < 1
    [01:4528] forwarding command SMPD_INIT to 0
    [01:4528] posting command SMPD_INIT to parent, src=1, ctx_key=1, dest=0.
    [01:4528] Handling cmd=SMPD_INIT result
    [01:4528] forward SMPD_INIT result to dest=1 ctx_key=0
    [01:4528] Handling cmd=SMPD_INIT result
    [01:4528] forward SMPD_INIT result to dest=1 ctx_key=1
    [01:4528] handling command SMPD_BCPUT src=1 ctx_key=0
    [01:4528] Handling SMPD_BCPUT command from smpd 1
            ctx_key=0
            rank=1
            value=port=54943 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=10864:248
            result=success
    [01:4528] handling command SMPD_BCPUT src=1 ctx_key=1
    [01:4528] Handling SMPD_BCPUT command from smpd 1
            ctx_key=1
            rank=0
            value=port=54944 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=10848:248
            result=success
    [01:4528] handling command SMPD_BARRIER src=1 ctx_key=0
    [01:4528] Handling SMPD_BARRIER src=1 ctx_key=0
    [01:4528] initializing barrier(450200b4-b31d-4dc8-9498-4d8fc1fec0d1): in=1 size=2
    [01:4528] incrementing barrier(450200b4-b31d-4dc8-9498-4d8fc1fec0d1) incount from 0 to 1 out of 2
    [01:4528] handling command SMPD_BARRIER src=1 ctx_key=1
    [01:4528] Handling SMPD_BARRIER src=1 ctx_key=1
    [01:4528] incrementing barrier(450200b4-b31d-4dc8-9498-4d8fc1fec0d1) incount from 1 to 2 out of 2
    [01:4528] all in barrier, release the barrier.
    [01:4528] sending reply to barrier command '450200b4-b31d-4dc8-9498-4d8fc1fec0d1'.
    [01:4528] sending reply to barrier command '450200b4-b31d-4dc8-9498-4d8fc1fec0d1'.
    [01:4528] read 50 bytes from stdout
    [01:4528] posting command SMPD_STDOUT to parent, src=1, dest=0.
    [01:4528] read 50 bytes from stdout
    [01:4528] posting command SMPD_STDOUT to parent, src=1, dest=0.
    [01:4528] Handling cmd=SMPD_STDOUT result
    [01:4528] cmd=SMPD_STDOUT result will be handled locally
    [01:4528] read 42 bytes from stdout
    [01:4528] posting command SMPD_STDOUT to parent, src=1, dest=0.
    [01:4528] Handling cmd=SMPD_STDOUT result
    [01:4528] cmd=SMPD_STDOUT result will be handled locally
    [01:4528] Handling cmd=SMPD_STDOUT result
    [01:4528] cmd=SMPD_STDOUT result will be handled locally
    [01:4528] handling command SMPD_STDIN src=0
    [01:4528] wrote 3 bytes to stdin of rank 0
    [01:4528] handling command SMPD_BCGET src=1 ctx_key=1
    [01:4528] Handling SMPD_BCGET command from smpd 1
            ctx_key=1
            rank=1
            value=port=54943 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=10864:248
            result=success
    [01:4528] handling command SMPD_BCGET src=1 ctx_key=0
    [01:4528] Handling SMPD_BCGET command from smpd 1
            ctx_key=0
            rank=0
            value=port=54944 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=10848:248
            result=success
    [01:4528] handling command SMPD_BCGET src=1 ctx_key=0
    [01:4528] Handling SMPD_BCGET command from smpd 1
            ctx_key=0
            rank=0
            value=port=54944 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=10848:248
            result=success
    [01:4528] 1 -> 0 : returning parent_context: 0 < 1
    [01:4528] 1 -> 0 : returning parent_context: 0 < 1
    [01:4528] forwarding command SMPD_FINALIZE to 0
    [01:4528] forwarding command SMPD_FINALIZE to 0
    [01:4528] posting command SMPD_FINALIZE to parent, src=1, ctx_key=1, dest=0.
    [01:4528] posting command SMPD_FINALIZE to parent, src=1, ctx_key=0, dest=0.
    [01:4528] Handling cmd=SMPD_FINALIZE result
    [01:4528] forward SMPD_FINALIZE result to dest=1 ctx_key=0
    [01:4528] handling command SMPD_BARRIER src=1 ctx_key=0
    [01:4528] Handling SMPD_BARRIER src=1 ctx_key=0
    [01:4528] initializing barrier(450200b4-b31d-4dc8-9498-4d8fc1fec0d1): in=1 size=2
    [01:4528] incrementing barrier(450200b4-b31d-4dc8-9498-4d8fc1fec0d1) incount from 0 to 1 out of 2
    [01:4528] Handling cmd=SMPD_FINALIZE result
    [01:4528] forward SMPD_FINALIZE result to dest=1 ctx_key=1
    [01:4528] handling command SMPD_BARRIER src=1 ctx_key=1
    [01:4528] Handling SMPD_BARRIER src=1 ctx_key=1
    [01:4528] incrementing barrier(450200b4-b31d-4dc8-9498-4d8fc1fec0d1) incount from 1 to 2 out of 2
    [01:4528] all in barrier, release the barrier.
    [01:4528] sending reply to barrier command '450200b4-b31d-4dc8-9498-4d8fc1fec0d1'.
    [01:4528] sending reply to barrier command '450200b4-b31d-4dc8-9498-4d8fc1fec0d1'.
    [01:4528] process_id=0 process refcount == 2, pmi client closed.
    [01:4528] process_id=1 process refcount == 2, pmi client closed.
    [01:4528] reading failed, assuming stdout is closed. error 0xc000014b
    [01:4528] process_id=0 process refcount == 1, stdout closed.
    [01:4528] reading failed, assuming stderr is closed. error 0xc000014b
    [01:4528] process_id=0 process refcount == 0, stderr closed.
    [01:4528] process_id=0 process refcount == 0, waiting for the process to finish exiting.
    [01:4528] creating an exit command for rank 1, pid 10864, exit code 0.
    [01:4528] posting command SMPD_EXIT to parent, src=1, dest=0.
    [01:4528] reading failed, assuming stdout is closed. error 0xc000014b
    [01:4528] process_id=1 process refcount == 1, stdout closed.
    [01:4528] reading failed, assuming stderr is closed. error 0xc000014b
    [01:4528] process_id=1 process refcount == 0, stderr closed.
    [01:4528] process_id=1 process refcount == 0, waiting for the process to finish exiting.
    [01:4528] creating an exit command for rank 0, pid 10848, exit code 0.
    [01:4528] posting command SMPD_EXIT to parent, src=1, dest=0.
    [01:4528] Handling cmd=SMPD_EXIT result
    [01:4528] cmd=SMPD_EXIT result will be handled locally
    [01:4528] Handling cmd=SMPD_EXIT result
    [01:4528] cmd=SMPD_EXIT result will be handled locally
    [01:4528] handling command SMPD_CLOSE src=0
    [01:4528] sending 'closed' command to parent context
    [01:4528] posting command SMPD_CLOSED to parent, src=1, dest=0.
    [01:4528] Handling cmd=SMPD_CLOSED result
    [01:4528] cmd=SMPD_CLOSED result will be handled locally
    [01:4528] smpd manager successfully stopped listening.
    [01:4528] SMPD exiting with error code 0.
    

    mpiexec log 1
    C:\drsvn\testMPI\x64\Release_msmpi>"C:\Program Files\Microsoft MPI\Bin\mpiexec" -host WW7SMS01DRGDH -d 3 -n 2 fpi.exe
    [00:5008] host tree:
    [00:5008]  host: WW7SMS01DRGDH, parent: 0, id: 1
    [00:5008] mpiexec started smpd manager listening on port 54912
    [00:5008] WW7SMS01DRGDH posting a re-connect to WW7SMS01DRGDH:54931 in left child context.
    [00:5008] Authentication completed. Successfully obtained Context for Client.
    [00:5008] Authorization completed.
    [00:5008] version check complete, using PMP version 2.
    [00:5008] posting command SMPD_COLLECT to left child, src=0, dest=1.
    [00:5008] Handling cmd=SMPD_COLLECT result
    [00:5008] cmd=SMPD_COLLECT result will be handled locally
    [00:5008] Finished collecting hardware summary.
    [00:5008] posting command SMPD_STARTDBS to left child, src=0, dest=1.
    [00:5008] Handling cmd=SMPD_STARTDBS result
    [00:5008] cmd=SMPD_STARTDBS result will be handled locally
    [00:5008] start_dbs succeeded, kvs_name: '450200b4-b31d-4dc8-9498-4d8fc1fec0d1', domain_name: '5b13f1b3-a3ff-481a-9299-c5453221af3c'
    [00:5008] creating a process group of size 2 on node 0 called 450200b4-b31d-4dc8-9498-4d8fc1fec0d1
    [00:5008] launching the processes.
    [00:5008] posting command SMPD_LAUNCH to left child, src=0, dest=1.
    [00:5008] Handling cmd=SMPD_LAUNCH result
    [00:5008] cmd=SMPD_LAUNCH result will be handled locally
    [00:5008] successfully launched process 1
    [00:5008] successfully launched process 0
    [00:5008] root process launched, starting stdin redirection.
    [00:5008] Authentication completed. Successfully obtained Context for Client.
    [00:5008] Authorization completed.
    [00:5008] handling command SMPD_INIT src=1 ctx_key=0
    [00:5008] init: 1:2:450200b4-b31d-4dc8-9498-4d8fc1fec0d1
    [00:5008] handling command SMPD_INIT src=1 ctx_key=1
    [00:5008] init: 0:2:450200b4-b31d-4dc8-9498-4d8fc1fec0d1
    [00:5008] handling command SMPD_STDOUT src=1
    [00:5008] Handling SMPD_STDOUT
    [00:5008] Decoding stdout/stderr buffer 2050726F636573732020202020202020202020203120206F66202020202020202020202020322020697320616C6976650D0A
     Process            1  of            2  is alive
    [00:5008] handling command SMPD_STDOUT src=1
    [00:5008] Handling SMPD_STDOUT
    [00:5008] Decoding stdout/stderr buffer 2050726F636573732020202020202020202020203020206F66202020202020202020202020322020697320616C6976650D0A
     Process            0  of            2  is alive
    [00:5008] handling command SMPD_STDOUT src=1
    [00:5008] Handling SMPD_STDOUT
    [00:5008] Decoding stdout/stderr buffer 456E74657220746865206E756D626572206F6620696E74657276616C733A202830207175697473290D0A
    Enter the number of intervals: (0 quits)
    0
    [00:5008] posting command SMPD_STDIN to left child, src=0, dest=1.
    [00:5008] Handling cmd=SMPD_STDIN result
    [00:5008] cmd=SMPD_STDIN result will be handled locally
    [00:5008] Authentication completed. Successfully obtained Context for Client.
    [00:5008] Authorization completed.
    [00:5008] handling command SMPD_FINALIZE src=1 ctx_key=0
    [00:5008] finalize: 1:450200b4-b31d-4dc8-9498-4d8fc1fec0d1
    [00:5008] handling command SMPD_FINALIZE src=1 ctx_key=1
    [00:5008] finalize: 0:450200b4-b31d-4dc8-9498-4d8fc1fec0d1
    [00:5008] handling command SMPD_EXIT src=1
    [00:5008] saving exit code: rank 1, exitcode 0, pg <450200b4-b31d-4dc8-9498-4d8fc1fec0d1>
    [00:5008] handling command SMPD_EXIT src=1
    [00:5008] saving exit code: rank 0, exitcode 0, pg <450200b4-b31d-4dc8-9498-4d8fc1fec0d1>
    [00:5008] last process exited, tearing down the job tree.
    [00:5008] posting command SMPD_CLOSE to left child, src=0, dest=1.
    [00:5008] Handling cmd=SMPD_CLOSE result
    [00:5008] cmd=SMPD_CLOSE result will be handled locally
    [00:5008] handling command SMPD_CLOSED src=1
    [00:5008] closed command received from left child.
    [00:5008] smpd manager successfully stopped listening.
    
    C:\drsvn\testMPI\x64\Release_msmpi>
    
    smpd log 2
    C:\Users\gdehauteclocque.EUA>"C:\Program Files\Microsoft MPI\Bin\smpd.exe" -d 3
    [-1:4700] Launching SMPD service.
    [-1:4700] smpd listening on port 8677
    [-1:4700] Authentication completed. Successfully obtained Context for Client.
    [-1:4700] version check complete, using PMP version 2.
    [-1:4700] create manager process (using smpd daemon credentials)
    [-1:4700] smpd reading the port string from the manager
    [-1:11080] Launching smpd manager instance.
    [-1:11080] created set for manager listener, 108
    [-1:11080] smpd manager listening on port 54997
    [-1:11080] manager writing port back to smpd.
    [-1:4700] closing the pipe to the manager
    [-1:11080] Authentication completed. Successfully obtained Context for Client.
    [-1:11080] Authorization completed.
    [-1:11080] version check complete, using PMP version 2.
    [-1:11080] Received session header from parent id=1, parent=0, level=0
    [01:11080] Connecting back to parent using host 127.0.0.1 and endpoint 54995
    [01:11080] Authentication completed. Successfully obtained Context for Client.
    [01:11080] Authorization completed.
    [01:11080] handling command SMPD_COLLECT src=0
    [01:11080] handling command SMPD_STARTDBS src=0
    [01:11080] sending start_dbs result command kvs = af26e0a6-3228-4fd3-9661-58d9f7849c0b.
    [01:11080] handling command SMPD_LAUNCH src=0
    [01:11080] Successfully handled bcast nodeids command.
    [01:11080] setting environment variable: <MPIEXEC_HOSTNAME> = <WW7SMS01DRGDH>
    [01:11080] env: PMI_SIZE=2
    [01:11080] env: PMI_KVS=af26e0a6-3228-4fd3-9661-58d9f7849c0b
    [01:11080] env: PMI_DOMAIN=6793b38d-a234-414a-a434-718ccfc3ed9e
    [01:11080] env: PMI_HOST=localhost
    [01:11080] env: PMI_PORT=0ec7822b-9b7e-4cf7-a32d-f211a37c3ac7
    [01:11080] env: PMI_SMPD_ID=1
    [01:11080] env: PMI_APPNUM=0
    [01:11080] env: PMI_NODE_IDS=s
    [01:11080] env: PMI_RANK_AFFINITIES=a
    [01:11080] searching for 'fpi.exe' in workdir 'C:\drsvn\testMPI\x64\Release_msmpi'
    [01:11080] C:\drsvn\testMPI\x64\Release_msmpi>CreateProcess(C:\drsvn\testMPI\x64\Release_msmpi\fpi.exe fpi.exe)
    [01:11080] env: PMI_RANK=1
    [01:11080] env: PMI_SMPD_KEY=0
    [01:11080] C:\drsvn\testMPI\x64\Release_msmpi>CreateProcess(C:\drsvn\testMPI\x64\Release_msmpi\fpi.exe fpi.exe)
    [01:11080] env: PMI_RANK=0
    [01:11080] env: PMI_SMPD_KEY=1
    [01:11080] Authentication completed. Successfully obtained Context for Client.
    [01:11080] Authorization completed.
    [01:11080] version check complete, using PMP version 2.
    [01:11080] Authentication completed. Successfully obtained Context for Client.
    [01:11080] Authorization completed.
    [01:11080] 1 -> 0 : returning parent_context: 0 < 1
    [01:11080] forwarding command SMPD_INIT to 0
    [01:11080] version check complete, using PMP version 2.
    [01:11080] posting command SMPD_INIT to parent, src=1, ctx_key=1, dest=0.
    [01:11080] 1 -> 0 : returning parent_context: 0 < 1
    [01:11080] forwarding command SMPD_INIT to 0
    [01:11080] posting command SMPD_INIT to parent, src=1, ctx_key=0, dest=0.
    [01:11080] Handling cmd=SMPD_INIT result
    [01:11080] forward SMPD_INIT result to dest=1 ctx_key=1
    [01:11080] Handling cmd=SMPD_INIT result
    [01:11080] forward SMPD_INIT result to dest=1 ctx_key=0
    [01:11080] handling command SMPD_BCPUT src=1 ctx_key=1
    [01:11080] Handling SMPD_BCPUT command from smpd 1
            ctx_key=1
            rank=0
            value=port=55002 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=8620:248
            result=success
    [01:11080] handling command SMPD_BCPUT src=1 ctx_key=0
    [01:11080] Handling SMPD_BCPUT command from smpd 1
            ctx_key=0
            rank=1
            value=port=55003 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=9264:248
            result=success
    [01:11080] handling command SMPD_BARRIER src=1 ctx_key=1
    [01:11080] Handling SMPD_BARRIER src=1 ctx_key=1
    [01:11080] initializing barrier(af26e0a6-3228-4fd3-9661-58d9f7849c0b): in=1 size=2
    [01:11080] incrementing barrier(af26e0a6-3228-4fd3-9661-58d9f7849c0b) incount from 0 to 1 out of 2
    [01:11080] handling command SMPD_BARRIER src=1 ctx_key=0
    [01:11080] Handling SMPD_BARRIER src=1 ctx_key=0
    [01:11080] incrementing barrier(af26e0a6-3228-4fd3-9661-58d9f7849c0b) incount from 1 to 2 out of 2
    [01:11080] all in barrier, release the barrier.
    [01:11080] sending reply to barrier command 'af26e0a6-3228-4fd3-9661-58d9f7849c0b'.
    [01:11080] sending reply to barrier command 'af26e0a6-3228-4fd3-9661-58d9f7849c0b'.
    [01:11080] read 50 bytes from stdout
    [01:11080] posting command SMPD_STDOUT to parent, src=1, dest=0.
    [01:11080] read 50 bytes from stdout
    [01:11080] posting command SMPD_STDOUT to parent, src=1, dest=0.
    [01:11080] read 42 bytes from stdout
    [01:11080] posting command SMPD_STDOUT to parent, src=1, dest=0.
    [01:11080] Handling cmd=SMPD_STDOUT result
    [01:11080] cmd=SMPD_STDOUT result will be handled locally
    [01:11080] Handling cmd=SMPD_STDOUT result
    [01:11080] cmd=SMPD_STDOUT result will be handled locally
    [01:11080] Handling cmd=SMPD_STDOUT result
    [01:11080] cmd=SMPD_STDOUT result will be handled locally
    [01:11080] handling command SMPD_STDIN src=0
    [01:11080] wrote 3 bytes to stdin of rank 0
    [01:11080] handling command SMPD_BCGET src=1 ctx_key=1
    [01:11080] Handling SMPD_BCGET command from smpd 1
            ctx_key=1
            rank=1
            value=port=55003 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=9264:248
            result=success
    [01:11080] handling command SMPD_BCGET src=1 ctx_key=0
    [01:11080] Handling SMPD_BCGET command from smpd 1
            ctx_key=0
            rank=0
            value=port=55002 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=8620:248
            result=success
    [01:11080] handling command SMPD_BCGET src=1 ctx_key=0
    [01:11080] Handling SMPD_BCGET command from smpd 1
            ctx_key=0
            rank=0
            value=port=55002 description="10.67.6.31 WW7SMS01DRGDH " shm_host=WW7SMS01DRGDH shm_queue=8620:248
            result=success
    [01:11080] 1 -> 0 : returning parent_context: 0 < 1
    [01:11080] forwarding command SMPD_FINALIZE to 0
    [01:11080] 1 -> 0 : returning parent_context: 0 < 1
    [01:11080] posting command SMPD_FINALIZE to parent, src=1, ctx_key=0, dest=0.
    [01:11080] forwarding command SMPD_FINALIZE to 0
    [01:11080] posting command SMPD_FINALIZE to parent, src=1, ctx_key=1, dest=0.
    [01:11080] Handling cmd=SMPD_FINALIZE result
    [01:11080] forward SMPD_FINALIZE result to dest=1 ctx_key=1
    [01:11080] handling command SMPD_BARRIER src=1 ctx_key=1
    [01:11080] Handling SMPD_BARRIER src=1 ctx_key=1
    [01:11080] initializing barrier(af26e0a6-3228-4fd3-9661-58d9f7849c0b): in=1 size=2
    [01:11080] incrementing barrier(af26e0a6-3228-4fd3-9661-58d9f7849c0b) incount from 0 to 1 out of 2
    [01:11080] Handling cmd=SMPD_FINALIZE result
    [01:11080] forward SMPD_FINALIZE result to dest=1 ctx_key=0
    [01:11080] handling command SMPD_BARRIER src=1 ctx_key=0
    [01:11080] Handling SMPD_BARRIER src=1 ctx_key=0
    [01:11080] incrementing barrier(af26e0a6-3228-4fd3-9661-58d9f7849c0b) incount from 1 to 2 out of 2
    [01:11080] all in barrier, release the barrier.
    [01:11080] sending reply to barrier command 'af26e0a6-3228-4fd3-9661-58d9f7849c0b'.
    [01:11080] sending reply to barrier command 'af26e0a6-3228-4fd3-9661-58d9f7849c0b'.
    [01:11080] process_id=1 process refcount == 2, pmi client closed.
    [01:11080] process_id=0 process refcount == 2, pmi client closed.
    [01:11080] reading failed, assuming stdout is closed. error 0xc000014b
    [01:11080] process_id=0 process refcount == 1, stdout closed.
    [01:11080] reading failed, assuming stderr is closed. error 0xc000014b
    [01:11080] process_id=0 process refcount == 0, stderr closed.
    [01:11080] process_id=0 process refcount == 0, waiting for the process to finish exiting.
    [01:11080] creating an exit command for rank 1, pid 9264, exit code 0.
    [01:11080] posting command SMPD_EXIT to parent, src=1, dest=0.
    [01:11080] Handling cmd=SMPD_EXIT result
    [01:11080] cmd=SMPD_EXIT result will be handled locally
    [01:11080] reading failed, assuming stdout is closed. error 0xc000014b
    [01:11080] process_id=1 process refcount == 1, stdout closed.
    [01:11080] reading failed, assuming stderr is closed. error 0xc000014b
    [01:11080] process_id=1 process refcount == 0, stderr closed.
    [01:11080] process_id=1 process refcount == 0, waiting for the process to finish exiting.
    [01:11080] creating an exit command for rank 0, pid 8620, exit code 0.
    [01:11080] posting command SMPD_EXIT to parent, src=1, dest=0.
    [01:11080] Handling cmd=SMPD_EXIT result
    [01:11080] cmd=SMPD_EXIT result will be handled locally
    [01:11080] handling command SMPD_CLOSE src=0
    [01:11080] sending 'closed' command to parent context
    [01:11080] posting command SMPD_CLOSED to parent, src=1, dest=0.
    [01:11080] Handling cmd=SMPD_CLOSED result
    [01:11080] cmd=SMPD_CLOSED result will be handled locally
    [01:11080] smpd manager successfully stopped listening.
    [01:11080] SMPD exiting with error code 0.
    mpiexec log 2
    C:\drsvn\testMPI\x64\Release_msmpi>"C:\Program Files\Microsoft MPI\Bin\mpiexec" -host localhost -d 3 -n 2 fpi.exe
    [00:1356] host tree:
    [00:1356]  host: localhost, parent: 0, id: 1
    [00:1356] mpiexec started smpd manager listening on port 54995
    [00:1356] WW7SMS01DRGDH posting a re-connect to localhost:54997 in left child context.
    [00:1356] Authentication completed. Successfully obtained Context for Client.
    [00:1356] Authorization completed.
    [00:1356] version check complete, using PMP version 2.
    [00:1356] posting command SMPD_COLLECT to left child, src=0, dest=1.
    [00:1356] Handling cmd=SMPD_COLLECT result
    [00:1356] cmd=SMPD_COLLECT result will be handled locally
    [00:1356] Finished collecting hardware summary.
    [00:1356] posting command SMPD_STARTDBS to left child, src=0, dest=1.
    [00:1356] Handling cmd=SMPD_STARTDBS result
    [00:1356] cmd=SMPD_STARTDBS result will be handled locally
    [00:1356] start_dbs succeeded, kvs_name: 'af26e0a6-3228-4fd3-9661-58d9f7849c0b', domain_name: '6793b38d-a234-414a-a434-718ccfc3ed9e'
    [00:1356] creating a process group of size 2 on node 0 called af26e0a6-3228-4fd3-9661-58d9f7849c0b
    [00:1356] launching the processes.
    [00:1356] posting command SMPD_LAUNCH to left child, src=0, dest=1.
    [00:1356] Handling cmd=SMPD_LAUNCH result
    [00:1356] cmd=SMPD_LAUNCH result will be handled locally
    [00:1356] successfully launched process 1
    [00:1356] successfully launched process 0
    [00:1356] root process launched, starting stdin redirection.
    [00:1356] Authentication completed. Successfully obtained Context for Client.
    [00:1356] Authorization completed.
    [00:1356] handling command SMPD_INIT src=1 ctx_key=1
    [00:1356] init: 0:2:af26e0a6-3228-4fd3-9661-58d9f7849c0b
    [00:1356] handling command SMPD_INIT src=1 ctx_key=0
    [00:1356] init: 1:2:af26e0a6-3228-4fd3-9661-58d9f7849c0b
    [00:1356] handling command SMPD_STDOUT src=1
    [00:1356] Handling SMPD_STDOUT
    [00:1356] Decoding stdout/stderr buffer 2050726F636573732020202020202020202020203020206F66202020202020202020202020322020697320616C6976650D0A
     Process            0  of            2  is alive
    [00:1356] handling command SMPD_STDOUT src=1
    [00:1356] Handling SMPD_STDOUT
    [00:1356] Decoding stdout/stderr buffer 2050726F636573732020202020202020202020203120206F66202020202020202020202020322020697320616C6976650D0A
     Process            1  of            2  is alive
    [00:1356] handling command SMPD_STDOUT src=1
    [00:1356] Handling SMPD_STDOUT
    [00:1356] Decoding stdout/stderr buffer 456E74657220746865206E756D626572206F6620696E74657276616C733A202830207175697473290D0A
    Enter the number of intervals: (0 quits)
    0
    [00:1356] posting command SMPD_STDIN to left child, src=0, dest=1.
    [00:1356] Handling cmd=SMPD_STDIN result
    [00:1356] cmd=SMPD_STDIN result will be handled locally
    [00:1356] Authentication completed. Successfully obtained Context for Client.
    [00:1356] Authorization completed.
    [00:1356] handling command SMPD_FINALIZE src=1 ctx_key=1
    [00:1356] finalize: 0:af26e0a6-3228-4fd3-9661-58d9f7849c0b
    [00:1356] handling command SMPD_FINALIZE src=1 ctx_key=0
    [00:1356] finalize: 1:af26e0a6-3228-4fd3-9661-58d9f7849c0b
    [00:1356] handling command SMPD_EXIT src=1
    [00:1356] saving exit code: rank 1, exitcode 0, pg <af26e0a6-3228-4fd3-9661-58d9f7849c0b>
    [00:1356] handling command SMPD_EXIT src=1
    [00:1356] saving exit code: rank 0, exitcode 0, pg <af26e0a6-3228-4fd3-9661-58d9f7849c0b>
    [00:1356] last process exited, tearing down the job tree.
    [00:1356] posting command SMPD_CLOSE to left child, src=0, dest=1.
    [00:1356] Handling cmd=SMPD_CLOSE result
    [00:1356] cmd=SMPD_CLOSE result will be handled locally
    [00:1356] handling command SMPD_CLOSED src=1
    [00:1356] closed command received from left child.
    [00:1356] smpd manager successfully stopped listening.
    
    C:\drsvn\testMPI\x64\Release_msmpi>
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    



    Wednesday, January 20, 2016 9:17 AM
  • Hi Guillaume,

    Thanks for providing us with the log. Looking at the IP configuration and the log my guess is that this might be due to DNS resolution issue. Your connection has several possible DNS suffixes

    Liste de recherche du suffixe DNS.: bvcorp.corp
                                           eua.bvcorp.corp
                                           bureauveritas.com
                                           fr.bureauveritas.com
                                           ho.bureauveritas.com

    A few things to try here and it would be great if you can get back to us with the following

    1) Can you try with msmpi v6 and see if this issue persists. You can get v6 from this link: http://www.microsoft.com/en-us/download/details.aspx?id=47259

    2) Can you try running smpd -d 3 and then mpiexec -host 10.67.6.31 (the IP address of the box)

    3) I am not sure which DNS suffix your machine name resolves to, if you do ping WW7SMS01DRGDH, what does the fully qualified domain name (FQDN) for the machine look like? Can you try passing the FQDN to the -host option?

    Thanks

    Anh

    • Marked as answer by Guillaume_dh Tuesday, February 23, 2016 9:21 AM
    Wednesday, January 20, 2016 5:35 PM
  • Hello Anh,

    1) I have already tried to downgrade to v6, but unfortunately, the installation failed (even with v7 uninstalled)

    2) Using the ip address there is no problem, the computation starts immediately

    3) Using the FQDN "mpiexec -host WW7SMS01DRGDH.eua.bvcorp.corp -n 2 fpi" does not solve the issue
       Thanks,

    Guillaume

    Thursday, January 21, 2016 2:55 PM
  • One of the work around you could use is to edit the C:\windows\system32\drivers\etc\hosts file and add the entry

    127.0.0.1 WW7SMS01DRGDH

    This will route the name resolution to the loopback adapter and will make things go faster. We're investigating the issue and will try to address this in the next release of MS-MPI

    Do you have more detail on the failure of v6 installation? You can also try to use the command line -force option to install it (this will work even if v7 is not yet uninstalled)

    From an administrator command window, run MSMPISetup.exe -force -unattend


    Thursday, January 21, 2016 10:04 PM
  • Hello Anh,

      Here is the message I get when installing v6 : "Microsoft MPI (6.0.12436.10) setup wizard ended prematurely because of an error"   (-force does not help). Anyway thanks for your help, the suggested workaround solves my issue (and providind IPs instead of names is not too bad either).

    Regards,

        Guillaume

    Friday, January 22, 2016 8:47 AM
  • Hi Guillaume,

    You can try adding the option -log c:\msmpiv6_install.log to the command and see if that provides better information about why v6 installation fails. The workaround of using the hosts file should allow you to just run mpiexec without specifying the host option as well (without specifying the host option mpiexec will default to using the local computer's name as the host - and with the work around we now route it through loopback)

    Thanks

    Friday, January 22, 2016 6:58 PM
  • Hi Anh,

       Most of the use is on local host only, so that the option "-host localhost" fix the issue for 95% of the cases. To launch on several nodes, I will use IP addresses (although not tested yet).

      About v6, below is the log :

    === Logging started: 28/01/2016  17:15:05 ===
    Action 17:15:05: INSTALL. 
    Action start 17:15:05: INSTALL.
    Action 17:15:05: FindRelatedProducts. Searching for related applications
    Action start 17:15:05: FindRelatedProducts.
    Action ended 17:15:05: FindRelatedProducts. Return value 1.
    Action 17:15:05: PrepareDlg. 
    Action start 17:15:05: PrepareDlg.
    Info 2898.For WixUI_Font_Normal textstyle, the system created a 'Tahoma' font, in 0 character set, of 13 pixels height.
    Info 2898.For WixUI_Font_Bigger textstyle, the system created a 'Tahoma' font, in 0 character set, of 19 pixels height.
    Action 17:15:05: PrepareDlg. Dialog created
    Action ended 17:15:05: PrepareDlg. Return value 1.
    Action 17:15:05: AppSearch. Searching for installed applications
    Action start 17:15:05: AppSearch.
    Action ended 17:15:05: AppSearch. Return value 0.
    Action 17:15:05: LaunchConditions. Evaluating launch conditions
    Action start 17:15:05: LaunchConditions.
    Action ended 17:15:05: LaunchConditions. Return value 1.
    Action 17:15:05: ValidateProductID. 
    Action start 17:15:05: ValidateProductID.
    Action ended 17:15:05: ValidateProductID. Return value 1.
    Action 17:15:05: CostInitialize. Computing space requirements
    Action start 17:15:05: CostInitialize.
    Action ended 17:15:05: CostInitialize. Return value 1.
    Action 17:15:05: FileCost. Computing space requirements
    Action start 17:15:05: FileCost.
    Action ended 17:15:05: FileCost. Return value 1.
    Action 17:15:05: CostFinalize. Computing space requirements
    Action start 17:15:05: CostFinalize.
    Action ended 17:15:05: CostFinalize. Return value 1.
    Action 17:15:05: WelcomeDlg. 
    Action start 17:15:05: WelcomeDlg.
    Action 17:15:05: WelcomeDlg. Dialog created
    Info 2898.For WixUI_Font_Title textstyle, the system created a 'Tahoma' font, in 0 character set, of 14 pixels height.
    Action 17:15:07: LicenseAgreementDlg. Dialog created
    Action 17:15:09: InstallDirDlg. Dialog created
    Action 17:15:10: WixUIValidatePath. 
    Action start 17:15:10: WixUIValidatePath.
    Action ended 17:15:10: WixUIValidatePath. Return value 1.
    Action 17:15:10: VerifyReadyDlg. Dialog created
    Action ended 17:15:11: WelcomeDlg. Return value 1.
    Action 17:15:11: ProgressDlg. 
    Action start 17:15:11: ProgressDlg.
    Action 17:15:11: ProgressDlg. Dialog created
    Action ended 17:15:11: ProgressDlg. Return value 1.
    Action 17:15:11: ExecuteAction. 
    Action start 17:15:11: ExecuteAction.
    Action 17:15:24: INSTALL. 
    Action start 17:15:24: INSTALL.
    Action 17:15:24: FindRelatedProducts. Searching for related applications
    Action start 17:15:24: FindRelatedProducts.
    Action ended 17:15:24: FindRelatedProducts. Return value 0.
    Action 17:15:24: LaunchConditions. Evaluating launch conditions
    Action start 17:15:24: LaunchConditions.
    Action ended 17:15:24: LaunchConditions. Return value 1.
    Action 17:15:24: ValidateProductID. 
    Action start 17:15:24: ValidateProductID.
    Action ended 17:15:24: ValidateProductID. Return value 1.
    Action 17:15:24: CostInitialize. Computing space requirements
    Action start 17:15:24: CostInitialize.
    Action ended 17:15:24: CostInitialize. Return value 1.
    Action 17:15:24: FileCost. Computing space requirements
    Action start 17:15:24: FileCost.
    Action ended 17:15:24: FileCost. Return value 1.
    Action 17:15:24: CostFinalize. Computing space requirements
    Action start 17:15:24: CostFinalize.
    Action ended 17:15:24: CostFinalize. Return value 1.
    Action 17:15:24: SetARPINSTALLLOCATION. 
    Action start 17:15:24: SetARPINSTALLLOCATION.
    Action ended 17:15:24: SetARPINSTALLLOCATION. Return value 1.
    Action 17:15:24: InstallValidate. Validating install
    Action start 17:15:24: InstallValidate.
    Action ended 17:15:24: InstallValidate. Return value 1.
    Action 17:15:24: InstallInitialize. 
    Action start 17:15:24: InstallInitialize.
    Action ended 17:15:25: InstallInitialize. Return value 1.
    Action 17:15:25: ProcessComponents. Updating component registration
    Action start 17:15:25: ProcessComponents.
    Action 17:15:25: GenerateScript. Generating script operations for action:
    GenerateScript: Updating component registration
    Action ended 17:15:25: ProcessComponents. Return value 1.
    Action 17:15:25: UnpublishFeatures. Unpublishing Product Features
    Action start 17:15:25: UnpublishFeatures.
    Action ended 17:15:25: UnpublishFeatures. Return value 1.
    Action 17:15:25: StopServices. Stopping services
    Action start 17:15:25: StopServices.
    Action ended 17:15:25: StopServices. Return value 1.
    Action 17:15:25: RemoveMpiexecFwRuleCmdLine. 
    Action start 17:15:25: RemoveMpiexecFwRuleCmdLine.
    Action ended 17:15:25: RemoveMpiexecFwRuleCmdLine. Return value 1.
    Action 17:15:25: RemoveMpiexecFwRule. 
    Action start 17:15:25: RemoveMpiexecFwRule.
    RemoveMpiexecFwRule: 
    Action ended 17:15:25: RemoveMpiexecFwRule. Return value 1.
    Action 17:15:25: RemoveSmpdFwRuleCmdLine. 
    Action start 17:15:25: RemoveSmpdFwRuleCmdLine.
    Action ended 17:15:25: RemoveSmpdFwRuleCmdLine. Return value 1.
    Action 17:15:25: RemoveSmpdFwRule. 
    Action start 17:15:25: RemoveSmpdFwRule.
    RemoveSmpdFwRule: 
    Action ended 17:15:25: RemoveSmpdFwRule. Return value 1.
    Action 17:15:25: RemoveRegistryValues. Removing system registry values
    Action start 17:15:25: RemoveRegistryValues.
    Action ended 17:15:25: RemoveRegistryValues. Return value 1.
    Action 17:15:25: ConfigureEventManifestUnregister. 
    Action start 17:15:25: ConfigureEventManifestUnregister.
    ConfigureEventManifestUnreg:  Entering ConfigureEventManifestUnreg in C:\Windows\Installer\MSIEBE.tmp, version 3.6.3303.0
    Action ended 17:15:25: ConfigureEventManifestUnregister. Return value 1.
    Action 17:15:25: RemoveEnvironmentStrings. Updating environment strings
    Action start 17:15:25: RemoveEnvironmentStrings.
    Action ended 17:15:25: RemoveEnvironmentStrings. Return value 1.
    Action 17:15:25: RemoveFiles. Removing files
    Action start 17:15:25: RemoveFiles.
    Action ended 17:15:25: RemoveFiles. Return value 1.
    Action 17:15:25: RemoveFolders. Removing folders
    Action start 17:15:25: RemoveFolders.
    Action ended 17:15:25: RemoveFolders. Return value 1.
    Action 17:15:25: CreateFolders. Creating folders
    Action start 17:15:25: CreateFolders.
    CreateFolders: Folder: Creating folders
    Action ended 17:15:25: CreateFolders. Return value 1.
    Action 17:15:25: MoveFiles. Moving files
    Action start 17:15:25: MoveFiles.
    MoveFiles: File: Moving files,  Directory: ,  Size: 
    Action ended 17:15:25: MoveFiles. Return value 1.
    Action 17:15:25: InstallFiles. Copying new files
    Action start 17:15:25: InstallFiles.
    InstallFiles: File: Copying new files,  Directory: ,  Size: 
    Action ended 17:15:25: InstallFiles. Return value 1.
    Action 17:15:25: DuplicateFiles. Creating duplicate files
    Action start 17:15:25: DuplicateFiles.
    Action ended 17:15:25: DuplicateFiles. Return value 1.
    Action 17:15:25: SchedXmlFile. 
    Action start 17:15:25: SchedXmlFile.
    SchedXmlFile:  Entering SchedXmlFile in C:\Windows\Installer\MSIF3C.tmp, version 3.6.3303.0
    SchedXmlFile:  Skipping SchedXmlFile because XmlFile table not present
    Action ended 17:15:25: SchedXmlFile. Return value 1.
    Action 17:15:25: ConfigureEventManifestRegister. 
    Action start 17:15:25: ConfigureEventManifestRegister.
    ConfigureEventManifestReg:  Entering ConfigureEventManifestReg in C:\Windows\Installer\MSIF6C.tmp, version 3.6.3303.0
    Action 17:15:25: RollbackRegisterEventManifest. 
    Action start 17:15:25: RollbackRegisterEventManifest.
    RollbackRegisterEventManifest: 
    Action ended 17:15:25: RollbackRegisterEventManifest. Return value 1.
    ConfigureEventManifestReg:  RegisterEventManifest's CustomActionData: '"wevtutil.exe" im "C:\Program Files\Microsoft MPI\Bin\mpitrace.man"'
    Action 17:15:25: RegisterEventManifest. 
    Action start 17:15:25: RegisterEventManifest.
    RegisterEventManifest: 
    Action ended 17:15:25: RegisterEventManifest. Return value 1.
    Action ended 17:15:25: ConfigureEventManifestRegister. Return value 1.
    Action 17:15:25: WriteRegistryValues. Writing system registry values
    Action start 17:15:25: WriteRegistryValues.
    WriteRegistryValues: Key: Writing system registry values, Name: , Value: 
    Action ended 17:15:25: WriteRegistryValues. Return value 1.
    Action 17:15:25: WriteEnvironmentStrings. Updating environment strings
    Action start 17:15:25: WriteEnvironmentStrings.
    WriteEnvironmentStrings: Name: Updating environment strings, Value: , Action 
    Action ended 17:15:25: WriteEnvironmentStrings. Return value 1.
    Action 17:15:25: StartServices. Starting services
    Action start 17:15:25: StartServices.
    Action ended 17:15:25: StartServices. Return value 1.
    Action 17:15:25: InstallMpiexecFwRuleInCmdLine. 
    Action start 17:15:25: InstallMpiexecFwRuleInCmdLine.
    Action ended 17:15:25: InstallMpiexecFwRuleInCmdLine. Return value 1.
    Action 17:15:25: InstallMpiexecFwRuleIn. 
    Action start 17:15:25: InstallMpiexecFwRuleIn.
    InstallMpiexecFwRuleIn: 
    Action ended 17:15:25: InstallMpiexecFwRuleIn. Return value 1.
    Action 17:15:25: InstallMpiexecFwRuleOutCmdLine. 
    Action start 17:15:25: InstallMpiexecFwRuleOutCmdLine.
    Action ended 17:15:25: InstallMpiexecFwRuleOutCmdLine. Return value 1.
    Action 17:15:25: InstallMpiexecFwRuleOut. 
    Action start 17:15:25: InstallMpiexecFwRuleOut.
    InstallMpiexecFwRuleOut: 
    Action ended 17:15:25: InstallMpiexecFwRuleOut. Return value 1.
    Action 17:15:25: InstallSmpdFwRuleInCmdLine. 
    Action start 17:15:25: InstallSmpdFwRuleInCmdLine.
    Action ended 17:15:25: InstallSmpdFwRuleInCmdLine. Return value 1.
    Action 17:15:25: InstallSmpdFwRuleIn. 
    Action start 17:15:25: InstallSmpdFwRuleIn.
    InstallSmpdFwRuleIn: 
    Action ended 17:15:25: InstallSmpdFwRuleIn. Return value 1.
    Action 17:15:25: InstallSmpdFwRuleOutCmdLine. 
    Action start 17:15:25: InstallSmpdFwRuleOutCmdLine.
    Action ended 17:15:25: InstallSmpdFwRuleOutCmdLine. Return value 1.
    Action 17:15:25: InstallSmpdFwRuleOut. 
    Action start 17:15:25: InstallSmpdFwRuleOut.
    InstallSmpdFwRuleOut: 
    Action ended 17:15:25: InstallSmpdFwRuleOut. Return value 1.
    Action 17:15:25: RegisterUser. Registering user
    Action start 17:15:25: RegisterUser.
    Action ended 17:15:25: RegisterUser. Return value 1.
    Action 17:15:25: RegisterProduct. Registering product
    Action start 17:15:25: RegisterProduct.
    RegisterProduct: Registering product
    Action ended 17:15:25: RegisterProduct. Return value 1.
    Action 17:15:25: PublishFeatures. Publishing Product Features
    Action start 17:15:25: PublishFeatures.
    PublishFeatures: Feature: Publishing Product Features
    Action ended 17:15:25: PublishFeatures. Return value 1.
    Action 17:15:25: PublishProduct. Publishing product information
    Action start 17:15:25: PublishProduct.
    PublishProduct: 
    Action ended 17:15:25: PublishProduct. Return value 1.
    Action 17:15:25: InstallFinalize. 
    Action start 17:15:25: InstallFinalize.
    Action 17:15:25: ProcessComponents. Updating component registration
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {633437D9-FF06-557A-A060-A16444C4EEA3} 3: 02:\SOFTWARE\Microsoft\MPI\Version 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {E5A0287C-B1B9-5F9E-B8AF-AEFF8BF5E610} 3: 02:\SOFTWARE\Microsoft\MPI\InstallRoot 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {9B72CB57-DB10-504F-89A7-F289833BA961} 3: 02:\SOFTWARE\Microsoft\MPI\MSPMSProvider 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {7027117D-398B-5059-B0B7-2A80BD69858F} 3: 02:\SOFTWARE\Microsoft\MPI\RedistPath 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {F5E135BC-80C8-56AB-8F5E-7FDF8492DCE0} 3: 22:\SOFTWARE\Microsoft\MPI\Version 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {DE616141-DECD-5A77-99B4-B5EE9B40D582} 3: 22:\SOFTWARE\Microsoft\MPI\InstallRoot 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {182CBEB8-1D5C-5404-8C1F-B342C4B64ADF} 3: 22:\SOFTWARE\Microsoft\MPI\RedistPath 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {C58F7139-9534-5A31-9270-571418A39CE0} 3: 22:\SOFTWARE\Microsoft\MPI\MSPMSProvider 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {368CB894-4A80-57D0-BF48-062DB606768F} 3: 21:\Software\Microsoft\MPI\InstallState 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {61FF4387-D224-4C6D-A342-8D0346EEDF94} 3: C:\Program Files\Microsoft MPI\ 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {D39680DD-1EBA-5AEF-BAF2-4FB9478C6FED} 3: C:\Program Files\Microsoft MPI\Bin\smpd.exe 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {A7AEB4EE-D971-5F63-9072-9A827CAFCE63} 3: C:\Program Files\Microsoft MPI\Bin\mpiexec.exe 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {5DDBF673-33FD-5B38-A000-A4A52DC97857} 3: C:\Program Files\Microsoft MPI\Bin\mpitrace.man 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {209C2B6D-8D07-48AC-8C84-9D08F05E59F0} 3: C:\Program Files\Microsoft MPI\Redist\ 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {4922A1BE-0134-587E-B6E7-9974CF8B0C1C} 3: C:\Program Files\Microsoft MPI\License\license.rtf 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {9B673534-CE5D-5339-980D-6F6694B42D5E} 3: C:\Windows\SysWOW64\msmpi.dll 
    1: {00000000-0000-0000-0000-000000000000} 2: {9B673534-CE5D-5339-980D-6F6694B42D5E} 3: C:\Windows\SysWOW64\msmpi.dll 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {78D10FF4-ECA5-5617-B1E8-207A229369B7} 3: C:\Windows\SysWOW64\msmpires.dll 
    1: {00000000-0000-0000-0000-000000000000} 2: {78D10FF4-ECA5-5617-B1E8-207A229369B7} 3: C:\Windows\SysWOW64\msmpires.dll 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {FB4853CA-A8A8-58B9-A417-0495116CC794} 3: C:\Windows\system32\msmpi.dll 
    1: {00000000-0000-0000-0000-000000000000} 2: {FB4853CA-A8A8-58B9-A417-0495116CC794} 3: C:\Windows\system32\msmpi.dll 
    1: {8499ACD3-C1E3-45AB-BF96-DA491727EBE1} 2: {CCE9E917-8A14-5A38-90D3-2BF46E02456B} 3: C:\Windows\system32\msmpires.dll 
    1: {00000000-0000-0000-0000-000000000000} 2: {CCE9E917-8A14-5A38-90D3-2BF46E02456B} 3: C:\Windows\system32\msmpires.dll 
    Action 17:15:25: RemoveMpiexecFwRule. 
    CAQuietExec:  Entering CAQuietExec in C:\Windows\Installer\MSIFEB.tmp, version 3.6.3303.0
    CAQuietExec:  "C:\Windows\SysWOW64\netsh.exe" advfirewall firewall delete rule name=MSMPI-MPIEXEC
    CAQuietExec:  
    CAQuietExec:  Une erreur s'est produite lors de la tentative de prise de contact avec
    CAQuietExec:  le service Pare-feu Windows. Assurez-vous que le service est en cours
    CAQuietExec:  d'excution, puis ressayez.
    CAQuietExec:  
    CAQuietExec:  Error 0x80070001: Command line returned an error.
    CAQuietExec:  Error 0x80070001: CAQuietExec Failed
    CustomAction RemoveMpiexecFwRule returned actual error code 1603 but will be translated to success due to continue marking
    Action 17:15:26: RemoveSmpdFwRule. 
    CAQuietExec:  Entering CAQuietExec in C:\Windows\Installer\MSI1307.tmp, version 3.6.3303.0
    CAQuietExec:  "C:\Windows\SysWOW64\netsh.exe" advfirewall firewall delete rule name=MSMPI-SMPD
    CAQuietExec:  
    CAQuietExec:  Une erreur s'est produite lors de la tentative de prise de contact avec
    CAQuietExec:  le service Pare-feu Windows. Assurez-vous que le service est en cours
    CAQuietExec:  d'excution, puis ressayez.
    CAQuietExec:  
    CAQuietExec:  Error 0x80070001: Command line returned an error.
    CAQuietExec:  Error 0x80070001: CAQuietExec Failed
    CustomAction RemoveSmpdFwRule returned actual error code 1603 but will be translated to success due to continue marking
    Action 17:15:26: CreateFolders. Creating folders
    CreateFolders: Folder: C:\Program Files\Microsoft MPI\
    Action 17:15:26: MoveFiles. Moving files
    MoveFiles: File: MSMpiSetup.exe,  Directory: C:\Program Files\Microsoft MPI\Redist\,  Size: 5219952
    Action 17:15:26: InstallFiles. Copying new files
    InstallFiles: File: license.rtf,  Directory: C:\Program Files\Microsoft MPI\License\,  Size: 178744
    InstallFiles: File: mpiexec.exe,  Directory: C:\Program Files\Microsoft MPI\Bin\,  Size: 510160
    InstallFiles: File: mpitrace.man,  Directory: C:\Program Files\Microsoft MPI\Bin\,  Size: 980404
    InstallFiles: File: msmpi.dll,  Directory: C:\Windows\SysWOW64\,  Size: 1229008
    InstallFiles: File: msmpi.dll,  Directory: C:\Windows\system32\,  Size: 1460432
    InstallFiles: File: msmpires.dll,  Directory: C:\Windows\SysWOW64\,  Size: 397008
    InstallFiles: File: msmpires.dll,  Directory: C:\Windows\system32\,  Size: 397008
    InstallFiles: File: smpd.exe,  Directory: C:\Program Files\Microsoft MPI\Bin\,  Size: 486096
    Action 17:15:26: RollbackRegisterEventManifest. 
    Action 17:15:26: RegisterEventManifest. 
    CAQuietExec:  Entering CAQuietExec in C:\Windows\Installer\MSI1431.tmp, version 3.6.3303.0
    CAQuietExec:  "wevtutil.exe" im "C:\Program Files\Microsoft MPI\Bin\mpitrace.man"
    Action 17:15:27: WriteRegistryValues. Writing system registry values
    WriteRegistryValues: Key: \SOFTWARE\Microsoft\MPI, Name: Version, Value: 6.0.12436.10
    WriteRegistryValues: Key: \SOFTWARE\Microsoft\MPI, Name: InstallRoot, Value: C:\Program Files\Microsoft MPI\
    WriteRegistryValues: Key: \SOFTWARE\Microsoft\MPI, Name: MSPMSProvider, Value: C:\Windows\SysWOW64\msmpi.dll
    WriteRegistryValues: Key: \SOFTWARE\Microsoft\MPI, Name: RedistPath, Value: C:\Program Files\Microsoft MPI\Redist\MSMpiSetup.exe
    WriteRegistryValues: Key: \Software\Microsoft\MPI, Name: InstallState, Value: ProgramMenu
    WriteRegistryValues: Key: \SOFTWARE\Microsoft\MPI, Name: Version, Value: 6.0.12436.10
    WriteRegistryValues: Key: \SOFTWARE\Microsoft\MPI, Name: InstallRoot, Value: C:\Program Files\Microsoft MPI\
    WriteRegistryValues: Key: \SOFTWARE\Microsoft\MPI, Name: RedistPath, Value: C:\Program Files\Microsoft MPI\Redist\MSMpiSetup.exe
    WriteRegistryValues: Key: \SOFTWARE\Microsoft\MPI, Name: MSPMSProvider, Value: C:\Windows\system32\msmpi.dll
    Action 17:15:27: WriteEnvironmentStrings. Updating environment strings
    WriteEnvironmentStrings: Name: PATH, Value: C:\Program Files\Microsoft MPI\Bin\, Action -1610612735
    WriteEnvironmentStrings: Name: MSMPI_BIN, Value: C:\Program Files\Microsoft MPI\Bin\, Action 536870913
    Action 17:15:27: InstallMpiexecFwRuleIn. 
    CAQuietExec:  Entering CAQuietExec in C:\Windows\Installer\MSI170F.tmp, version 3.6.3303.0
    CAQuietExec:  "C:\Windows\SysWOW64\netsh.exe" advfirewall firewall add rule name=MSMPI-MPIEXEC dir=in action=allow program="C:\Program Files\Microsoft MPI\Bin\mpiexec.exe" profile=any
    CAQuietExec:  
    CAQuietExec:  Une erreur s'est produite lors de la tentative de prise de contact avec
    CAQuietExec:  le service Pare-feu Windows. Assurez-vous que le service est en cours
    CAQuietExec:  d'excution, puis ressayez.
    CAQuietExec:  
    CAQuietExec:  Error 0x80070001: Command line returned an error.
    CAQuietExec:  Error 0x80070001: CAQuietExec Failed
    CustomAction InstallMpiexecFwRuleIn returned actual error code 1603 (note this may not be 100% accurate if translation happened inside sandbox)
    Action ended 17:15:27: InstallFinalize. Return value 3.
    Action 17:15:27: Rollback. Rolling back action:
    Rollback: InstallMpiexecFwRuleIn
    Rollback: Updating environment strings
    Rollback: Writing system registry values
    Rollback: RegisterEventManifest
    Rollback: RollbackRegisterEventManifest
    CAQuietExec:  Entering CAQuietExec in C:\Windows\Installer\MSI17DB.tmp, version 3.6.3303.0
    CAQuietExec:  "wevtutil.exe" um "C:\Program Files\Microsoft MPI\Bin\mpitrace.man"
    Rollback: Copying new files
    Rollback: Moving files
    Rollback: Creating folders
    Rollback: RemoveSmpdFwRule
    Rollback: RemoveMpiexecFwRule
    Rollback: Updating component registration
    Action ended 17:15:28: INSTALL. Return value 3.
    Action ended 17:15:28: ExecuteAction. Return value 3.
    Action 17:15:28: FatalError. 
    Action start 17:15:28: FatalError.
    Action 17:15:28: FatalError. Dialog created
    Action ended 17:15:31: FatalError. Return value 2.
    Action ended 17:15:31: INSTALL. Return value 3.
    === Logging stopped: 28/01/2016  17:15:31 ===
    MSI (c) (54:5C) [17:15:31:536]: Product: Microsoft MPI (6.0.12436.10) -- Installation failed.
    
    MSI (c) (54:5C) [17:15:31:537]: Windows Installer installed the product. Product Name: Microsoft MPI (6.0.12436.10). Product Version: 6.0.12436.10. Product Language: 1033. Manufacturer: Microsoft Corporation. Installation success or error status: 1603.

      Thanks,

    Guillaume

     

    Thursday, January 28, 2016 4:20 PM
  • Hi Guillaume,

    Do you happen to have a third party firewall running instead of the Windows Firewall? We do have an issue with V6 installation when Windows Firewall is disabled. In V7 we chose not to abort the installation when the firewall setup step fails. However, this means that if you have third party firewall rules setup you will need to add the proper rules/exception to the firewall for smpd/mpiexec (I'm not sure if this can cause the 30 second delay issue but it might be worth taking a look)

    Anh

    Thursday, January 28, 2016 8:16 PM