none
HPC Pack 2016 Installation error. RRS feed

  • Question

  • I am installing 2016 pack on a brand new 2016 server VM. I have selected headnode itself to be the service fabric cluster (and everything else). Install goes fine till the very end and it waits for a real long time  displaying following messages:

    1) Configuring the fabric cluster

    2) Powershell Dialog with "Copying application to image store"

    and finally giving up like so:

    Component Microsoft HPC Pack 2016 Head node application cannot be installed with error code 1.Failed to create the application for HPC Head node to the Service Fabric cluster..

    Please advise.

    Install log files from C:\Windows\Temp\HPCSetupLogs folder

    Content of ServiceFabricConfig_0.json:

    {
      "name": "HpcCluster",
      "clusterManifestVersion": "1.0.0",
      "apiVersion": "2015-01-01-alpha",
      "nodes": [
        {
          "nodeName": "_Node_0",
          "iPAddress": "MYNODE",
          "nodeTypeRef": "NodeType0",
          "faultDomain": "fd:/dc1/r0",
          "upgradeDomain": "UD0"
        },
        {
          "nodeName": "_Node_1",
          "iPAddress": "MYNODE",
          "nodeTypeRef": "NodeType1",
          "faultDomain": "fd:/dc2/r0",
          "upgradeDomain": "UD1"
        },
        {
          "nodeName": "_Node_2",
          "iPAddress": "MYNODE",
          "nodeTypeRef": "NodeType2",
          "faultDomain": "fd:/dc3/r0",
          "upgradeDomain": "UD2"
        }
      ],
      "properties": {
        "reliabilityLevel": "Bronze",
        "diagnosticsStore": {
          "metadata": "Please replace the diagnostics file share with an actual file share accessible from all cluster machines.",
          "dataDeletionAgeInDays": "7",
          "storeType": "FileShare",
          "IsEncrypted": "false",
          "connectionstring": "c:\\ProgramData\\SF\\DiagnosticsStore"
        },
        "security": {
          "metadata": "The Credential type X509 indicates this is cluster is secured using X509 Certificates. The thumbprint format is - SOME FORMAT.",
          "ClusterCredentialType": "X509",
          "ServerCredentialType": "X509",
          "CertificateInformation": {
            "ClusterCertificate": {
              "Thumbprint": "ALPHANUMERICSTRING",
              "X509StoreName": "My"
            },
            "ServerCertificate": {
              "Thumbprint": "ALPHANUMERICSTRING",
              "X509StoreName": "My"
            }
          }
        },
        "nodeTypes": [
          {
            "name": "NodeType0",
            "clientConnectionEndpointPort": "10100",
            "clusterConnectionEndpointPort": "10101",
            "leaseDriverEndpointPort": "10200",
            "serviceConnectionEndpointPort": "10300",
            "httpGatewayEndpointPort": "10400",
            "applicationPorts": {
              "startPort": "20001",
              "endPort": "20100"
            },
            "ephemeralPorts": {
              "startPort": "20101",
              "endPort": "25100"
            },
            "isPrimary": true
          },
          {
            "name": "NodeType1",
            "clientConnectionEndpointPort": "10102",
            "clusterConnectionEndpointPort": "10103",
            "leaseDriverEndpointPort": "10201",
            "serviceConnectionEndpointPort": "10301",
            "httpGatewayEndpointPort": "10401",
            "applicationPorts": {
              "startPort": "25101",
              "endPort": "25200"
            },
            "ephemeralPorts": {
              "startPort": "25201",
              "endPort": "30200"
            },
            "isPrimary": false
          },
          {
            "name": "NodeType2",
            "clientConnectionEndpointPort": "10104",
            "clusterConnectionEndpointPort": "10105",
            "leaseDriverEndpointPort": "10202",
            "serviceConnectionEndpointPort": "10302",
            "httpGatewayEndpointPort": "10402",
            "applicationPorts": {
              "startPort": "30201",
              "endPort": "30300"
            },
            "ephemeralPorts": {
              "startPort": "30301",
              "endPort": "35300"
            },
            "isPrimary": false
          }
        ],
        "fabricSettings": [
          {
            "name": "Setup",
            "parameters": [
              {
                "name": "FabricDataRoot",
                "value": "C:\\ProgramData\\SF"
              },
              {
                "name": "FabricLogRoot",
                "value": "C:\\ProgramData\\SF\\Log"
              }
            ]
          }
        ]
      }
    }

    Content of ServiceFabricCluster.err file:

    Copy-ServiceFabricApplicationPackage : Operation timed out.
    At C:\Users\a-sriram\Desktop\HPC Pack 2016\ServiceFabric\ServiceFabricSDK\Publish-NewServiceFabricApplication.ps1:230 
    char:9
    +         Copy-ServiceFabricApplicationPackage -ApplicationPackagePath  ...
    +         ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
        + CategoryInfo          : OperationTimeout: (:) [Copy-ServiceFabricApplicationPackage], TimeoutException
        + FullyQualifiedErrorId : CopyApplicationPackageErrorId,Microsoft.ServiceFabric.Powershell.CopyApplicationPackage
     

    • Edited by SRIRAM R Wednesday, January 4, 2017 12:32 AM
    Wednesday, January 4, 2017 12:16 AM

All replies

  • Hi, SRIRAM:

    First of all, thanks for choosing and trying HPC Pack 2016.

    The problem here as the error message show is "timeout occurred during operation Copy-ServiceFabricApplicationPackage".

    This may occur when installing single head node cluster on a VM established on an IO busy host.

    In order to mitigate the problem, would you please modify script Publish-NewServiceFabricApplication.ps1 in ServiceFabric\ServiceFabricSDK folder of setup file folder, and add the timeout to 1800. (which is the up bound mentioned by Service Fabric guideline: https://docs.microsoft.com/en-us/powershell/servicefabric/vlatest/copy-servicefabricapplicationpackage), The Copy-ServiceFabricApplicationPackage should look like below after modification in the Publish-NewServiceFabricApplication.ps1 :

    Copy-ServiceFabricApplicationPackage-ApplicationPackagePath$AppPkgPathToUse-ImageStoreConnectionString$imageStoreConnectionString-ApplicationPackagePathInImageStore$applicationPackagePathInImageStore-TimeoutSec1800

    And try the setup again.

    In addition, Using 8 core, 16 GB Mem and good IO performance HN is recommended. (More detail please refer to "Get Started Guide for HPC Pack 2016")

    Best Regards,

    Jason 

    Wednesday, January 4, 2017 2:46 AM