How do set some type of monitoring if the compute node is offline? RRS feed

  • Question

  • I wanted to be alerted if one of the compute node happens to be offline or unreachable?  We have SCOM in place.   Thanks all!
    Thursday, April 8, 2010 5:46 PM


All replies

  • Have you imported Windows HPC Management Pack?

    If Yes, then we do have a "Compute Node Reachable" monitor for each compute node. It will be in critical state if the compute node is unreachable. The alert is not automatically on, but you can easily turn on it by overriding the parameter for the monitor. Please let me know if you still have question on it. Thanks.

    Friday, April 9, 2010 10:52 AM
  • I just downloaded the MP for HPC.  Will look into it, but that'll seem do the trick. 

    Thank you!

    Friday, April 9, 2010 8:25 PM