IT 专业人士的资源 > 论坛主页 > Windows HPC Server Job Submission and Scheduling > Cluster Manager Job State stuck on Configuring
提出问题提出问题
 

已答复Cluster Manager Job State stuck on Configuring

  • 2009年6月26日 18:45Phil Molzer 用户奖牌用户奖牌用户奖牌用户奖牌用户奖牌
     
    Hi,
    I am testing a SOA style service on HPC 2008 Server sp1 cluster.  I have one head node and 2 compute nodes.  All nodes have State=Online and Node Health=OK in the Node Management pane of Cluster Manager.  I ran diagnostics without any issues.

    When my client tries to create a session, the cluster manager's Job Management pane shows 2 items.  The first is Job ID 4 WCF service, the second is Job ID 5 WCF service - Broker for service job 4.  The State for both is 'Configuring'.  There is no furher detail in the Job Details.  My client just hangs.  I let it run overnight and it never got past the configuring stage. 

    I need advice on how to troubleshoot.  I can't find any more information on what is happening during Configuring state.
    thanks
    Phil

答案

  • 2009年6月29日 12:59Phil Molzer 用户奖牌用户奖牌用户奖牌用户奖牌用户奖牌
     已答复

    I can answer my own question.  I had not put credentials in my SessionStartInfo. I'd assumed that if client and server were running as same user, it would be ok.  For whatever reason, the Cluster Manager does not complain, but it does hang.

全部回复

  • 2009年6月29日 12:59Phil Molzer 用户奖牌用户奖牌用户奖牌用户奖牌用户奖牌
     已答复

    I can answer my own question.  I had not put credentials in my SessionStartInfo. I'd assumed that if client and server were running as same user, it would be ok.  For whatever reason, the Cluster Manager does not complain, but it does hang.

  • 2009年7月22日 21:52Josh BarnardMSFT, 所有者:用户奖牌用户奖牌用户奖牌用户奖牌用户奖牌
     
    Thanks.  I'll bring this up and hopefully we can get better error handling in a future version.

    -J
    -Josh
  • 2009年8月5日 20:57tosa.yasunari 用户奖牌用户奖牌用户奖牌用户奖牌用户奖牌
     
    I'm getting a different exception for my job stuck at "configuring" for job submitted from
    Job Management after HPCPack Sp1.   The error is

    Database Exception
    Procedure or function 'Schd_NextTaskId' expects parameter '@numTasks', which was not supplied.

    Interestingly, if I copy the finished job, it will not stuck at "Configuring" stage.
    Looks like a bug introduced in HPC Pack SP1.

  • 2009年8月7日 17:12Steve Chilcoat - HPC Microsoft 用户奖牌用户奖牌用户奖牌用户奖牌用户奖牌
     

    I don't think I understand the problem.

    Is this a SOA job?

    If the job is stuck in Configuring state with the error:

    Database Exception
    Procedure or function 'Schd_NextTaskId' expects parameter '@numTasks', which was not supplied.

    What did you to get it running so that it is in the Finished state? Did you add numTasks? If so then I would guess that copying the finished job would have numTasks & therefore would work if resubmitted.