RemotingCommunicator: An unexpected error occurred

Вопрос RemotingCommunicator: An unexpected error occurred

  • 2011年10月13日 9:32
     
     

    Hi,


    I have lots of this error message in the Event Viewer -> Microsoft - HPC - Scheduler - Operational : "RemotingCommunicator: An unexpected error occurred trying to start task RemotingCommunicator.1663 on node xxx".

    I wrote xxx because I have found lots of this kind of error with a different number after "on node". Note that I have only two nodes in my cluster (one is head node and also compute node).

    Please, can someone tell me why this error occurs ?
    Thank you very much

全部回复

  • 2011年10月14日 16:53
     
     

    Hi,

    Could you share details of your cluster configuration? Here's what we are interested in:

    - Number, type of nodes and their hardware specs,

    - Networking topology and hardware,

    - Location of the database and version of the SQL Server.

    Thank you,
    Łukasz

  • 2011年10月18日 18:46
     
     

    Hi,

    This type of error messages is usually related to some sort of problems in communication between Scheduler and cluster nodes. These issues might be network related or could be caused by, for example, nodes being rebooted during job/task execution.

    Regards,
    Łukasz

  • 2011年10月19日 13:46
     
     
    I will ask to a colleague more details to post here.
  • 2011年10月24日 14:46
     
     
    Hi,
    We have a cluster of 3 servers (one is head/compute node, the other 2 are compute nodes).
    Servers are HP BL685 with HP on board NICs. They are in different subnets 02,03 and 04, connected to 1 Gig switches, running 1 gig full duplex.
    The database is local to the same server marked as head node. We use the SQL Express that comes with HPC to host the database.