locked
Compute nodes get stuck in "provisioning" state if I add any node template tasks. RRS feed

  • Question

  • I can assign nodes to the "Default ComputeNode" template or even to my own custom template but only if there are no tasks associated with this template.

    As soon as I add any node template tasks, even ones I can successfully invoke on that node using "Run Command..." on that machine, then when I assign that node to the template it gets stuck in the "provisioning" state for house before timing out:

    Time Message
    2014-11-17 4:17:09 PM Reverted
    2014-11-17 4:17:09 PM Disassociating template from node NORSE\MACHINE01
    2014-11-17 4:17:09 PM The operation failed due to errors during execution.
    2014-11-17 4:17:09 PM The operation failed and will not be retried.
    2014-11-17 4:17:09 PM The operation failed due to errors during execution.
    2014-11-17 4:17:09 PM The operation has timed out.
    2014-11-17 12:16:48 PM Initiating deployment operations for template: My Test 01
    2014-11-17 12:16:48 PM Associating template My Test 01 with node NORSE\MACHINE01
    2014-11-17 12:16:48 PM Moving node NORSE\MACHINE01 from state Offline to state Provisioning
    2014-11-17 12:16:48 PM Assigning template My Test 01 to node MACHINE01


    Is there some way I can find out what is blocking on this call? I've searched all the Event Logs on all the machines and nothing is turning up more information.

    Tuesday, November 18, 2014 6:56 PM

Answers

  • Hi Paineroo,

    You can "Provisioning", "Configuration" or "Deployment“ type tasks only when the node template is with operating system (i.e. node template for bare metal deployment).  If the node template is without operating system, you can only add "Maintanence" type tasks.

    According to the node template you pasted, you are using a node template without operating system, so you can add a custom "Post Installation Command" task if you want to run your custom command when assigning node template. The custom command will also be run when you maintain a node in the node template.

    You can import below node template xml for your reference.

    <?xml version="1.0" encoding="utf-8"?>
    <Template xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
      Description=""
      Version="6">
      <Item
        Name="PostInstallationCommand">
        <Parameter
          Name="Command"
          Value="sc query" />
        <Parameter
          Name="WorkingDirectory"
          Value="" />
        <Parameter
          Name="Description"
          Value="Runs a command on the node after HPC Pack has been installed." />
      </Item>
      <Item
        Name="ActivateOsItem">
        <Parameter
          Name="Description"
          Value="Activates the operating system on the node." />
      </Item>
    </Template>

    • Marked as answer by Paineroo Thursday, November 20, 2014 9:46 PM
    Thursday, November 20, 2014 4:27 AM

All replies

  • Can you export the node template and share the content?

    You can right click the node template you want to export and then click "Export" to export it.

    Wednesday, November 19, 2014 3:11 AM
  • <?xml version="1.0" encoding="utf-8"?>
    <Template xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
      Description=""
      Version="6">
      <Item
        Name="ExecuteOSCommand">
        <Parameter
          Name="Command"
          Value="sc query" />
        <ParameterList
          Name="ErrorWhiteList" />
        <Parameter
          Name="Description"
          Value="Runs a command as Administrator." />
      </Item>
      <Item
        Name="ActivateOsItem">
        <Parameter
          Name="Description"
          Value="Activates the operating system on the node." />
      </Item>
    </Template>

    (Note I've just used "sc query" as an attempt to invoke *anything* on the target machine. I get the same behavior if I try things like a unicast copy, etc.)

    • Edited by Paineroo Wednesday, November 19, 2014 3:26 PM
    Wednesday, November 19, 2014 3:25 PM
  • Hi Paineroo,

    You can "Provisioning", "Configuration" or "Deployment“ type tasks only when the node template is with operating system (i.e. node template for bare metal deployment).  If the node template is without operating system, you can only add "Maintanence" type tasks.

    According to the node template you pasted, you are using a node template without operating system, so you can add a custom "Post Installation Command" task if you want to run your custom command when assigning node template. The custom command will also be run when you maintain a node in the node template.

    You can import below node template xml for your reference.

    <?xml version="1.0" encoding="utf-8"?>
    <Template xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
      Description=""
      Version="6">
      <Item
        Name="PostInstallationCommand">
        <Parameter
          Name="Command"
          Value="sc query" />
        <Parameter
          Name="WorkingDirectory"
          Value="" />
        <Parameter
          Name="Description"
          Value="Runs a command on the node after HPC Pack has been installed." />
      </Item>
      <Item
        Name="ActivateOsItem">
        <Parameter
          Name="Description"
          Value="Activates the operating system on the node." />
      </Item>
    </Template>

    • Marked as answer by Paineroo Thursday, November 20, 2014 9:46 PM
    Thursday, November 20, 2014 4:27 AM
  • Thanks, that fixed it.

    But is there a place I should have been looking for errors that I don't know about? That is, I'm learning this and know I'll make a bunch of silly mistakes, but the problem was that there was nothing indicating what my silly mistake was...

    Thursday, November 20, 2014 9:47 PM
  • You can refer to the online technet library of Microsoft HPC Pack http://technet.microsoft.com/en-us/library/jj899572.aspx

    For the node template, you can refer to http://msdn.microsoft.com/en-us/library/ff919332.aspx

    Monday, November 24, 2014 2:21 AM