locked
feature req: mark cancelled tasks as such instead of failed RRS feed

  • Question

  • I am running HPC 2008 R2, homogeneous environment.

    I have 16000 tasks.  I have my job set to fail if any one of the individual tasks fails.

    The problem is the failed task is co-mingled with all the other "failed" tasks with message "The task's job has been cancelled. Please see the Job details for additional information."

     

    I think a cancelled task state would be very helpful to distinguish these two states.

     

    One of them fails to connect to the scheduler and times out.

    Exception was:

    Could not register with the server. Try again later.

    stack was    at Microsoft.Hpc.Scheduler.Store.StoreServer._Connect()

       at Microsoft.Hpc.Scheduler.Store.StoreServer.Connect(String server, Int32 port)

       at Microsoft.Hpc.Scheduler.Store.SchedulerStoreSvc..ctor(String server, Int32 port)

       at Microsoft.Hpc.Scheduler.Store.SchedulerStore.Connect(String server)

       at Microsoft.Hpc.Scheduler.Scheduler.Connect(String cluster)

     

     


    Saturday, April 23, 2011 3:19 PM