none
Problem launching simple jobs (calc.exe)

    Question

  • In order to become familiar with the HPC pack, I decided to launch simple jobs. Here, I planned to launch calc.exe via the job manager. All I did was to set the command in edit section to %windir%\system32\calc.exe. As you can see in the pictures below, the manager says the job is running on the head node, however, I don't see the calculator.

    Thursday, 26 April 2018 10:06 AM

All replies

  • You should see through the task manager. By default it will not be executed on your logged on session.

    You could check this doc https://docs.microsoft.com/en-us/previous-versions/windows/it-pro/windows-hpc-server-2008R2/gg247477(v=ws.10) to see how to run the calc.exe in your session


    Qiufang Shi

    Thursday, 26 April 2018 7:00 PM
  • I did that but it doesn't work. According to the document you mentioned, I created two registry keys (HpcConsoleSupport and SoftwareSASGeneration) on all nodes including the head node. The I rebooted all machines and then submitted the calc.exe on compute01. As you can see in the picture, although I used remote desktop to connect to compute01, I don't see any calculator. Though the job manager says the job is running


    • Edited by mahmoodn Friday, 27 April 2018 10:03 AM
    Friday, 27 April 2018 10:02 AM
  • Did you set the right environment variable in your job?

    Qiufang Shi

    Monday, 30 April 2018 4:30 PM
  • When I create a job, in the environment section, I add HPC_CREATECONSOLE and set it to TRUE. By explicitly specifying compute01, and after clicking Submit, it asks for account password (WINCLUSTER\administrator) and the job eventually fails. See the screen shot please.

    Wednesday, 2 May 2018 12:43 PM
  • Hi

    Isn't there any idea? i have stuck at this problem.

    Friday, 4 May 2018 9:39 AM
  • Could you share the failed error message?

    Qiufang Shi

    Friday, 4 May 2018 5:19 PM
  • I can not find the error message. Where is that?
    Saturday, 5 May 2018 6:14 AM
  • try run command:

    job view <jobid> /detailed

    and

    task view <jobid>.<taskId> /detailed

    Or you could view the job/task detail through job GUI. Usually the error message is printed in the output 


    Qiufang Shi

    Monday, 7 May 2018 2:47 AM
  • OK here is the output

    PS C:\Users\Administrator> job view 75 /detailed
    Id                               : 75
    Name                             : r
    SubmitTime                       : 5/9/2018 9:53:21 AM
    CreateTime                       : 5/9/2018 9:52:29 AM
    StartTime                        : 5/9/2018 9:53:22 AM
    EndTime                          : 5/9/2018 9:53:22 AM
    ChangeTime                       : 5/9/2018 9:53:22 AM
    UnitType                         : Core
    MinCores                         : 1
    MaxCores                         : 1
    MinSockets                       : 1
    MaxSockets                       : 1
    MinNodes                         : 1
    MaxNodes                         : 1
    RunUntilCanceled                 : False
    IsExclusive                      : False
    ErrorCode                        : -2147218981
    ErrorParams                      : COMPUTE01|||HPC_CREATECONSOLE is set to TRUE, but the job owner is already logged in
    to the console on this node. To attach to an existing console, use HPC_ATTACHTOCONSOLE.
    State                            : Failed
    PreviousState                    : Running
    JobType                          : Batch
    Priority                         : Normal
    RequestedNodes                   : COMPUTE01
    RequiredNodes                    :
    IsBackfill                       : False
    NextTaskNiceID                   : 2
    HasGrown                         : False
    HasShrunk                        : False
    OrderBy                          :
    TaskLevelUpdateTime              : 5/9/2018 9:53:22 AM
    MinMaxUpdateTime                 : 5/9/2018 9:53:22 AM
    ComputedMinCores                 : 1
    ComputedMaxCores                 : 1
    RequeueCount                     : 3
    AutoRequeueCount                 : 3
    FailureReason                    : None
    PendingReason                    :
    AutoCalculateMax                 : True
    AutoCalculateMin                 : True
    NumberOfCalls                    : 0
    NumberOfOutstandingCalls         : 0
    CallDuration                     : 0
    CallsPerSecond                   : 0
    FailOnTaskFailure                : False
    Preemptable                      : True
    Progress                         : 100
    ProgressMessage                  :
    ExpandedPriority                 : 2000
    NotifyOnStart                    : False
    NotifyOnCompletion               : False
    RuntimeType                      : 0
    JobValidExitCodes                : 0
    FailDependentTasks               : False
    EstimatedProcessMemory           : 0
    TaskExecutionFailureRetryLimit   : 0
    HasNodePrepTask                  : False
    HasNodeReleaseTask               : False
    HasServiceTask                   : False
    ProfileMinResources              : 1
    ProfileMaxResources              : 2147483647
    ProjectId                        : 1
    JobTemplateId                    : 1
    OwnerId                          : 3
    RunAsId                          : 3
    ClientSourceId                   : 3
    CredentialId                     : 3
    IsAuthenticated                  : True
    IsMultiPhase                     : False
    ProxyNode                        : COMPUTE01
    PoolId                           : 1
    SelectedComputeNodeListId        : -1
    HasUnfinishedParentJobs          : False
    HasParentJobs                    : False
    Project                          :
    JobTemplate                      : Default
    DefaultTaskGroupId               : 75
    Owner                            : WINCLUSTER\Administrator
    TaskCount                        : 1
    ConfiguringTaskCount             : 0
    SubmittedTaskCount               : 0
    ValidatingTaskCount              : 0
    QueuedTaskCount                  : 0
    DispatchingTaskCount             : 0
    RunningTaskCount                 : 0
    FinishingTaskCount               : 0
    FinishedTaskCount                : 0
    FailedTaskCount                  : 1
    CanceledTaskCount                : 0
    CancelingTaskCount               : 0
    ClientSource                     : HpcClusterManager
    RequestCancel                    : None
    Pool                             : Default
    OfflineResourceCount             : 0
    IdleResourceCount                : 0
    ReservedResourceCount            : 0
    JobScheduledResourceCount        : 0
    ReadyForTaskResourceCount        : 0
    TaskScheduledResourceCount       : 0
    JobTaskScheduledResourceCount    : 0
    TaskDispatchedResourceCount      : 0
    TaskRunningResourceCount         : 0
    CloseTaskResourceCount           : 0
    CloseTaskDispatchedResourceCount : 0
    TaskClosedResourceCount          : 0
    CloseJobResourceCount            : 0
    AllocatedCores                   : COMPUTE01 1
    AllocatedNodes                   : COMPUTE01 1
    AllocatedSockets                 : COMPUTE01 1
    ParentJobIds                     :
    ChildJobIds                      :
    ErrorMessage                     : Job failed to start on some nodes or some nodes became unreachable.
    NodeGroupOp                      : Intersect
    Environment Variables            : HPC_CREATECONSOLE=TRUE
    PS C:\Users\Administrator>
    PS C:\Users\Administrator>
    PS C:\Users\Administrator>
    PS C:\Users\Administrator> task view 75
    Task Id                         : 75.1
    State                           : Failed
    Task Name                       : My Task
    Command Line                    : calc.exe
    Resource Request                : 1-1 cores
    Allocated Nodes                 : COMPUTE01
    Exit Code                       :
    Error Message                   :
    Error from node: COMPUTE01:Microsoft.Hpc.Activation.NodeManagerException: HPC_CREATECONSOLE is set to TRUE, but the job
    owner is already logged in to the console on this node. To attach to an existing console, use HPC_ATTACHTOCONSOLE.
       at Microsoft.Hpc.NodeManager.RemotingExecutor.JobEntry.LogonToConsole(SecureString password)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.JobEntry.Init(String userAccount, SecureString password, Byte[] certifi
    cate, CreateConsoleConnection createConsole, ConsoleConnection connectConsole, SessionConnection connectSession)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.JobEntryFactory.GetJobEntry(Int32 jobId, String userAccount, Cipher cip
    her, Byte[] cipherText, Byte[] iv, Byte[] certificate, CreateConsoleConnection createConnection, ConsoleConnection conne
    ctConsole, SessionConnection connectSession)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.RemotingNMExecImpl.StartJob(Int32 jobId, String userAccount, Byte[] cip
    herText, Byte[] iv, Byte[] certificate, CreateConsoleConnection createConnection, ConsoleConnection connectConsole, Sess
    ionConnection connectSession)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.RemotingNMExecImpl.StartJobAndTask(Int32 jobId, String userAccount, Byt
    e[] cipherText, Byte[] iv, Byte[] certificate, Int32 taskId, ProcessStartInfo startInfo)
    Output                          :
    Start Time                      : 5/9/2018 9:53:22 AM
    End Time                        : 5/9/2018 9:53:22 AM
    Total Kernel Time               :
    Total User Time                 :

    PS C:\Users\Administrator> task view 75.1 /detailed
    Id                               : 83
    SubmitTime                       : 5/9/2018 9:53:21 AM
    CreateTime                       : 5/9/2018 9:53:17 AM
    StartTime                        : 5/9/2018 9:53:22 AM
    EndTime                          : 5/9/2018 9:53:22 AM
    ChangeTime                       : 5/9/2018 9:53:22 AM
    ErrorCode                        : -2147218990
    ErrorParams                      : COMPUTE01:Microsoft.Hpc.Activation.NodeManagerException: HPC_CREATECONSOLE is set to
    TRUE, but the job owner is already logged in to the console on this node. To attach to an existing console, use HPC_ATTA
    CHTOCONSOLE.
       at Microsoft.Hpc.NodeManager.RemotingExecutor.JobEntry.LogonToConsole(SecureString password)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.JobEntry.Init(String userAccount, SecureString password, Byte[] certifi
    cate, CreateConsoleConnection createConsole, ConsoleConnection connectConsole, SessionConnection connectSession)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.JobEntryFactory.GetJobEntry(Int32 jobId, String userAccount, Cipher cip
    her, Byte[] cipherText, Byte[] iv, Byte[] certificate, CreateConsoleConnection createConnection, ConsoleConnection conne
    ctConsole, SessionConnection connectSession)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.RemotingNMExecImpl.StartJob(Int32 jobId, String userAccount, Byte[] cip
    herText, Byte[] iv, Byte[] certificate, CreateConsoleConnection createConnection, ConsoleConnection connectConsole, Sess
    ionConnection connectSession)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.RemotingNMExecImpl.StartJobAndTask(Int32 jobId, String userAccount, Byt
    e[] cipherText, Byte[] iv, Byte[] certificate, Int32 taskId, ProcessStartInfo startInfo)
    State                            : Failed
    PreviousState                    : Dispatching
    ParentJobId                      : 75
    Closed                           : False
    RequeueCount                     : 3
    AutoRequeueCount                 : 3
    FailureReason                    : ResourceFailure
    PendingReason                    :
    FailedNodeID                     : 2
    InstanceId                       : 0
    Output                           :
    ExecutionFailureRetryCount       : 0
    RecordId                         : 83
    Name                             : My Task
    MinCores                         : 1
    MaxCores                         : 1
    NiceId                           : 1
    CommandLine                      : calc.exe
    StdOutFilePath                   : c:\output.txt
    StdErrFilePath                   : c:\err.txt
    HasCustomProps                   : False
    StartValue                       : 0
    EndValue                         : 0
    IncrementValue                   : 1
    GroupId                          : 75
    Type                             : Basic
    IsServiceConcluded               : False
    FailJobOnFailure                 : False
    CurrentInstanceId                : 0
    ParentJobState                   : Failed
    UnitType                         : Core
    CredentialId                     : 3
    ParametricRunningCount           : 0
    ParametricCanceledCount          : 0
    ParametricFailedCount            : 0
    ParametricQueuedCount            : 0
    ParametricFinishedCount          : 0
    ParametricConfiguringCount       : 0
    ParametricTotalCount             : 0
    ParametricDispatchingCount       : 0
    ParametricFinishingCount         : 0
    ParametricCancelingCount         : 0
    ParametricSubmittedCount         : 0
    ParametricValidatingCount        : 0
    ParametricFailedNonPreemptionCount : 0
    RequestCancel                    : None
    OfflineResourceCount             : 0
    IdleResourceCount                : 0
    ReservedResourceCount            : 0
    JobScheduledResourceCount        : 0
    ReadyForTaskResourceCount        : 0
    TaskScheduledResourceCount       : 0
    JobTaskScheduledResourceCount    : 0
    TaskDispatchedResourceCount      : 0
    TaskRunningResourceCount         : 0
    CloseTaskResourceCount           : 0
    CloseTaskDispatchedResourceCount : 0
    TaskClosedResourceCount          : 0
    CloseJobResourceCount            : 0
    AllocatedCores                   : COMPUTE01 1
    AllocatedNodes                   : COMPUTE01 1
    AllocatedSockets                 : COMPUTE01 1
    ProcessIds                       :
    AllocatedCoreIds                 : COMPUTE01:0
    ErrorMessage                     : Error from node: COMPUTE01:Microsoft.Hpc.Activation.NodeManagerException: HPC_CREATEC
    ONSOLE is set to TRUE, but the job owner is already logged in to the console on this node. To attach to an existing cons
    ole, use HPC_ATTACHTOCONSOLE.
       at Microsoft.Hpc.NodeManager.RemotingExecutor.JobEntry.LogonToConsole(SecureString password)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.JobEntry.Init(String userAccount, SecureString password, Byte[] certifi
    cate, CreateConsoleConnection createConsole, ConsoleConnection connectConsole, SessionConnection connectSession)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.JobEntryFactory.GetJobEntry(Int32 jobId, String userAccount, Cipher cip
    her, Byte[] cipherText, Byte[] iv, Byte[] certificate, CreateConsoleConnection createConnection, ConsoleConnection conne
    ctConsole, SessionConnection connectSession)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.RemotingNMExecImpl.StartJob(Int32 jobId, String userAccount, Byte[] cip
    herText, Byte[] iv, Byte[] certificate, CreateConsoleConnection createConnection, ConsoleConnection connectConsole, Sess
    ionConnection connectSession)
       at Microsoft.Hpc.NodeManager.RemotingExecutor.RemotingNMExecImpl.StartJobAndTask(Int32 jobId, String userAccount, Byt
    e[] cipherText, Byte[] iv, Byte[] certificate, Int32 taskId, ProcessStartInfo startInfo)
    Environment Variables            :
    PS C:\Users\Administrator> ping compute01

    Pinging COMPUTE01 [10.0.0.4] with 32 bytes of data:
    Reply from 10.0.0.4: bytes=32 time<1ms TTL=128
    Reply from 10.0.0.4: bytes=32 time<1ms TTL=128

    Ping statistics for 10.0.0.4:
        Packets: Sent = 2, Received = 2, Lost = 0 (0% loss),
    Approximate round trip times in milli-seconds:
        Minimum = 0ms, Maximum = 0ms, Average = 0ms
    Control-C
    PS C:\Users\Administrator>


    • Edited by mahmoodn Friday, 11 May 2018 6:28 AM
    Friday, 11 May 2018 6:18 AM
  • I noticed the following error from the previous post

    ErrorMessage                     : Error from node: COMPUTE01:Microsoft.Hpc.Activation.NodeManagerException: HPC_CREATEC
    ONSOLE is set to TRUE, but the job owner is already logged in to the console on this node. To attach to an existing cons
    ole, use HPC_ATTACHTOCONSOLE.

    So, instead of HPC_CREATECONSOLE, I only set HPC_ATTACHCONSOLE to TRUE. As you can see in the picture, the job is said to be running, however, when I remote desktop to compute01, there is no calc.exe

    I also, ran the command to inspect the details. But the output of the command looks fine.

    PS C:\Users\Administrator> job view 76 /detailed
    Id                               : 76
    Name                             : e
    SubmitTime                       : 5/9/2018 10:08:41 AM
    CreateTime                       : 5/9/2018 10:08:14 AM
    StartTime                        : 5/9/2018 10:08:42 AM
    ChangeTime                       : 5/9/2018 10:08:48 AM
    UnitType                         : Core
    MinCores                         : 1
    MaxCores                         : 1
    MinSockets                       : 1
    MaxSockets                       : 1
    MinNodes                         : 1
    MaxNodes                         : 1
    RunUntilCanceled                 : False
    IsExclusive                      : False
    ErrorCode                        : 0
    ErrorParams                      :
    State                            : Running
    PreviousState                    : Queued
    JobType                          : Batch
    Priority                         : Normal
    RequestedNodes                   : COMPUTE01
    RequiredNodes                    :
    IsBackfill                       : False
    NextTaskNiceID                   : 2
    HasGrown                         : False
    HasShrunk                        : False
    OrderBy                          :
    TaskLevelUpdateTime              : 5/9/2018 10:08:42 AM
    MinMaxUpdateTime                 : 5/9/2018 10:08:42 AM
    ComputedMinCores                 : 1
    ComputedMaxCores                 : 1
    RequeueCount                     : 0
    AutoRequeueCount                 : 0
    FailureReason                    : None
    PendingReason                    :
    AutoCalculateMax                 : True
    AutoCalculateMin                 : True
    NumberOfCalls                    : 0
    NumberOfOutstandingCalls         : 0
    CallDuration                     : 0
    CallsPerSecond                   : 0
    FailOnTaskFailure                : False
    Preemptable                      : True
    Progress                         : 0
    ProgressMessage                  :
    ExpandedPriority                 : 2000
    NotifyOnStart                    : False
    NotifyOnCompletion               : False
    RuntimeType                      : 0
    JobValidExitCodes                : 0
    FailDependentTasks               : False
    EstimatedProcessMemory           : 0
    TaskExecutionFailureRetryLimit   : 0
    HasNodePrepTask                  : False
    HasNodeReleaseTask               : False
    HasServiceTask                   : False
    ProfileMinResources              : 1
    ProfileMaxResources              : 2147483647
    ProjectId                        : 1
    JobTemplateId                    : 1
    OwnerId                          : 3
    RunAsId                          : 3
    ClientSourceId                   : 3
    CredentialId                     : 3
    IsAuthenticated                  : True
    IsMultiPhase                     : False
    ProxyNode                        : COMPUTE01
    PoolId                           : 1
    SelectedComputeNodeListId        : -1
    HasUnfinishedParentJobs          : False
    HasParentJobs                    : False
    Project                          :
    JobTemplate                      : Default
    DefaultTaskGroupId               : 76
    Owner                            : WINCLUSTER\Administrator
    TaskCount                        : 1
    ConfiguringTaskCount             : 0
    SubmittedTaskCount               : 0
    ValidatingTaskCount              : 0
    QueuedTaskCount                  : 0
    DispatchingTaskCount             : 0
    RunningTaskCount                 : 1
    FinishingTaskCount               : 0
    FinishedTaskCount                : 0
    FailedTaskCount                  : 0
    CanceledTaskCount                : 0
    CancelingTaskCount               : 0
    ClientSource                     : HpcClusterManager
    RequestCancel                    : None
    Pool                             : Default
    OfflineResourceCount             : 0
    IdleResourceCount                : 0
    ReservedResourceCount            : 0
    JobScheduledResourceCount        : 0
    ReadyForTaskResourceCount        : 0
    TaskScheduledResourceCount       : 0
    JobTaskScheduledResourceCount    : 0
    TaskDispatchedResourceCount      : 0
    TaskRunningResourceCount         : 1
    CloseTaskResourceCount           : 0
    CloseTaskDispatchedResourceCount : 0
    TaskClosedResourceCount          : 0
    CloseJobResourceCount            : 0
    TotalKernelTime                  : 31
    TotalUserTime                    : 15
    MemoryUsed                       : 6124
    AllocatedCores                   : COMPUTE01 1
    AllocatedNodes                   : COMPUTE01 1
    AllocatedSockets                 : COMPUTE01 1
    ParentJobIds                     :
    ChildJobIds                      :
    ErrorMessage                     :
    NodeGroupOp                      : Intersect
    Environment Variables            : HPC_ATTACHCONSOLE=TRUE
    PS C:\Users\Administrator>

    PS C:\Users\Administrator> task view 76.1 /detailed
    Id                               : 84
    SubmitTime                       : 5/9/2018 10:08:41 AM
    CreateTime                       : 5/9/2018 10:08:38 AM
    StartTime                        : 5/9/2018 10:08:42 AM
    ChangeTime                       : 5/9/2018 10:08:42 AM
    ErrorCode                        : 0
    ErrorParams                      :
    State                            : Running
    PreviousState                    : Dispatching
    ParentJobId                      : 76
    Closed                           : False
    RequeueCount                     : 0
    AutoRequeueCount                 : 0
    FailureReason                    : None
    PendingReason                    :
    FailedNodeID                     : 0
    InstanceId                       : 0
    ExecutionFailureRetryCount       : 0
    RecordId                         : 84
    Name                             : My Task
    MinCores                         : 1
    MaxCores                         : 1
    NiceId                           : 1
    CommandLine                      : calc.exe
    StdOutFilePath                   : c:\output.txt
    StdErrFilePath                   : c:\err.txt
    HasCustomProps                   : False
    StartValue                       : 0
    EndValue                         : 0
    IncrementValue                   : 1
    GroupId                          : 76
    Type                             : Basic
    IsServiceConcluded               : False
    FailJobOnFailure                 : False
    CurrentInstanceId                : 0
    ParentJobState                   : Running
    UnitType                         : Core
    CredentialId                     : 3
    ParametricRunningCount           : 0
    ParametricCanceledCount          : 0
    ParametricFailedCount            : 0
    ParametricQueuedCount            : 0
    ParametricFinishedCount          : 0
    ParametricConfiguringCount       : 0
    ParametricTotalCount             : 0
    ParametricDispatchingCount       : 0
    ParametricFinishingCount         : 0
    ParametricCancelingCount         : 0
    ParametricSubmittedCount         : 0
    ParametricValidatingCount        : 0
    ParametricFailedNonPreemptionCount : 0
    RequestCancel                    : None
    TotalKernelTime                  : 31
    TotalUserTime                    : 15
    MemoryUsed                       : 6124
    OfflineResourceCount             : 0
    IdleResourceCount                : 0
    ReservedResourceCount            : 0
    JobScheduledResourceCount        : 0
    ReadyForTaskResourceCount        : 0
    TaskScheduledResourceCount       : 0
    JobTaskScheduledResourceCount    : 0
    TaskDispatchedResourceCount      : 0
    TaskRunningResourceCount         : 1
    CloseTaskResourceCount           : 0
    CloseTaskDispatchedResourceCount : 0
    TaskClosedResourceCount          : 0
    CloseJobResourceCount            : 0
    AllocatedCores                   : COMPUTE01 1
    AllocatedNodes                   : COMPUTE01 1
    AllocatedSockets                 : COMPUTE01 1
    ProcessIds                       : COMPUTE01:3536 1736 5592
    AllocatedCoreIds                 : COMPUTE01:0
    ErrorMessage                     :
    Environment Variables            :
    PS C:\Users\Administrator>


    • Edited by mahmoodn Friday, 11 May 2018 6:30 AM
    Friday, 11 May 2018 6:24 AM