Headnode alternate between unreachable and then ready<p align=left><font face=Arial size=2>Hi,</font></p> <p align=left> </p> <p align=left>I MS RDP into the Headnode and the compute maangement MMC to manage the compute clusters of two compute nodes + one headnode. However, after operating for a while, the headnode will become unreachable and my MMC will disconnect. Then, after a while, about 5 minutes, the headnode become reachable again. Can anyone advise what when wrong?</p> <p align=left> </p> <p align=left>I have three networks.</p> <p align=left>Public network</p> <p align=left>MPI network</p> <p align=left>Private network</p> <p align=left> </p> <p align=left>1. What should be the network binding order? Currently, mine is public, MPI and private. Could this be the caused of the problem?</p> <p align=left> </p> <p align=left>2. All my three nodes (1 HN and 2 CN) have 8 cores, total: 24 cores. When I submit a job and choose 16 cores, everything run less than 1 minutes. When it goes beyond 16 cores, it seem to run forever? Anyway to tell where the compute cluster is hung at?</p> <p align=left> </p>© 2009 Microsoft Corporation. All rights reserved.Fri, 22 May 2009 20:52:45 Zfb2e8823-eecc-4621-a8cf-4b6628554547http://social.microsoft.com/Forums/en-US/windowshpcitpros/thread/fb2e8823-eecc-4621-a8cf-4b6628554547#fb2e8823-eecc-4621-a8cf-4b6628554547http://social.microsoft.com/Forums/en-US/windowshpcitpros/thread/fb2e8823-eecc-4621-a8cf-4b6628554547#fb2e8823-eecc-4621-a8cf-4b6628554547Soon Henghttp://social.microsoft.com/Profile/en-US/?user=Soon%20HengHeadnode alternate between unreachable and then ready<p align=left><font face=Arial size=2>Hi,</font></p> <p align=left> </p> <p align=left>I MS RDP into the Headnode and the compute maangement MMC to manage the compute clusters of two compute nodes + one headnode. However, after operating for a while, the headnode will become unreachable and my MMC will disconnect. Then, after a while, about 5 minutes, the headnode become reachable again. Can anyone advise what when wrong?</p> <p align=left> </p> <p align=left>I have three networks.</p> <p align=left>Public network</p> <p align=left>MPI network</p> <p align=left>Private network</p> <p align=left> </p> <p align=left>1. What should be the network binding order? Currently, mine is public, MPI and private. Could this be the caused of the problem?</p> <p align=left> </p> <p align=left>2. All my three nodes (1 HN and 2 CN) have 8 cores, total: 24 cores. When I submit a job and choose 16 cores, everything run less than 1 minutes. When it goes beyond 16 cores, it seem to run forever? Anyway to tell where the compute cluster is hung at?</p> <p align=left> </p>Wed, 04 Jun 2008 15:25:12 Z2008-06-04T15:25:12Zhttp://social.microsoft.com/Forums/en-US/windowshpcitpros/thread/fb2e8823-eecc-4621-a8cf-4b6628554547#9c936117-2509-467e-8dfa-1e1e7e752c68http://social.microsoft.com/Forums/en-US/windowshpcitpros/thread/fb2e8823-eecc-4621-a8cf-4b6628554547#9c936117-2509-467e-8dfa-1e1e7e752c68Don Patteehttp://social.microsoft.com/Profile/en-US/?user=Don%20PatteeHeadnode alternate between unreachable and then readyHopefully your issue has been resolved since it was posted so long ago. If you are still encountering the problem please start a new thread on the forum. We weren't that great at managing our forum in the past, I apologize for that, but we've made serious improvements in handling it and will get to all the new posts now.Fri, 22 May 2009 20:52:40 Z2009-05-22T20:52:40Z