Unanswered MPI communication problems with multiple nodes

  • יום שלישי 30 אוגוסט 2011 14:19
     
     

    Hello!

    Could you help me? I try to launch my MPI application, using Microsoft HPC Pack 2008 SDK and two virtual machines, provided Hyper-V. I launch smpd.exe -d via the command lines of the both virtual machines and then I execute the next command via command line of the one of the virtual machines:

     >"C:\Program Files\Microsoft HPC Pack 2008 SDK\Bin\mpiexec.exe" -machi nefile file.txt GlimpseOfMPI.exe

    In case file.txt is:

    10.0.16.6 5 

    everything is OK

    In case file.txt is:

    10.0.16.7 5 

    everything is OK

    But in case file.txt is:

    10.0.16.6 5

    10.0.16.7 5

    I see:

    Running 10 MPI processes, n = 1000

     

    job aborted:

    [ranks] message

     

    [0-4] terminated

     

    [5] fatal error

    Fatal error in MPI_Send: Other MPI error, error stack:

    MPI_Send(175)..................: MPI_Send(buf=0x00267880, count=101, MPI_DOUBLE,

     dest=0, tag=0, MPI_COMM_WORLD) failed

    MPIDI_EagerContigSend(177).....: failure occurred while attempting to send an ea

    ger message

    MPIDI_CH3_iStartMsgv(240)......:

    MPIDI_CH3I_VC_post_connect(405): MPIDI_CH3I_Shm_connect failed in VC_post_connec

    t

    MPIDI_CH3I_Shm_connect(194)....: failed to attach to a bootstrap queue

     

    [6] fatal error

    Fatal error in MPI_Send: Other MPI error, error stack:

    MPI_Send(175)..................: MPI_Send(buf=0x021D7880, count=101, MPI_DOUBLE,

     dest=0, tag=0, MPI_COMM_WORLD) failed

    MPIDI_EagerContigSend(177).....: failure occurred while attempting to send an ea

    ger message

    MPIDI_CH3_iStartMsgv(240)......:

    MPIDI_CH3I_VC_post_connect(405): MPIDI_CH3I_Shm_connect failed in VC_post_connec

    t

    MPIDI_CH3I_Shm_connect(194)....: failed to attach to a bootstrap queue

     

    [7] fatal error

    Fatal error in MPI_Send: Other MPI error, error stack:

    MPI_Send(175)..................: MPI_Send(buf=0x020E7880, count=101, MPI_DOUBLE,

     dest=0, tag=0, MPI_COMM_WORLD) failed

    MPIDI_EagerContigSend(177).....: failure occurred while attempting to send an ea

    ger message

    MPIDI_CH3_iStartMsgv(240)......:

    MPIDI_CH3I_VC_post_connect(405): MPIDI_CH3I_Shm_connect failed in VC_post_connec

    t

    MPIDI_CH3I_Shm_connect(194)....: failed to attach to a bootstrap queue

     

    [8] fatal error

    Fatal error in MPI_Send: Other MPI error, error stack:

    MPI_Send(175)..................: MPI_Send(buf=0x02267880, count=101, MPI_DOUBLE,

     dest=0, tag=0, MPI_COMM_WORLD) failed

    MPIDI_EagerContigSend(177).....: failure occurred while attempting to send an ea

    ger message

    MPIDI_CH3_iStartMsgv(240)......:

    MPIDI_CH3I_VC_post_connect(405): MPIDI_CH3I_Shm_connect failed in VC_post_connec

    t

    MPIDI_CH3I_Shm_connect(194)....: failed to attach to a bootstrap queue

     

    [9] fatal error

    Fatal error in MPI_Send: Other MPI error, error stack:

    MPI_Send(175)..................: MPI_Send(buf=0x008B7880, count=100, MPI_DOUBLE,

     dest=0, tag=0, MPI_COMM_WORLD) failed

    MPIDI_EagerContigSend(177).....: failure occurred while attempting to send an ea

    ger message

    MPIDI_CH3_iStartMsgv(240)......:

    MPIDI_CH3I_VC_post_connect(405): MPIDI_CH3I_Shm_connect failed in VC_post_connec

    t

    MPIDI_CH3I_Shm_connect(194)....: failed to attach to a bootstrap queue

     

    ---- error analysis -----

     

    [5-9] on 10.0.16.7

    mpi has detected a fatal error and aborted GlimpseOfMPI.exe

     

    ---- error analysis -----

     

    Any ideas?I would be very much obliged to you. Thank you!