locked
Regarding correct usage of windows hpc pack RRS feed

  • Question

  • Dear all,

    We have been evaluating Windows HPC pack for quite some time and its features are looking promising. In our company, we are developing optimization & simulation engines which have very long running times like 24 hours or so.

    And we would like to use windows HPC pack to schedule our jobs and resources, and run these long running processes. I wonder if it is the right use case of Windows HPC pack? I know that Windows HPC can run any executable but will it be working fine with these long-running processes too. What could go wrong? Is HPC designed to handle these types of jobs?

    In addition to this, these long-running processes can be of batch type or interactive types. Can HPC pack interactive sessions support these long running services that consume a lot of memory? Are there any limitations for these long-running interactive jobs.

    Thanks,

    Puneet


    Puneet Sharma


    Wednesday, March 8, 2017 10:56 PM

Answers

  • hi Puneet,

      Support long running jobs won't be a problem for HPC Pack, A few things you might need take care for long running jobs:

    1. As the job will run under user account, you need prevent the credential expiring during job execution

    2. Disable automatic windows update, you need patch the system (Headnode, computenodes) manually to prevent interrupting running jobs

    3. Better set up HA

      "Interactive type", are you referring to "SOA job", if yes, it should also supported.


    Qiufang Shi

    Thursday, March 9, 2017 1:11 AM

All replies

  • hi Puneet,

      Support long running jobs won't be a problem for HPC Pack, A few things you might need take care for long running jobs:

    1. As the job will run under user account, you need prevent the credential expiring during job execution

    2. Disable automatic windows update, you need patch the system (Headnode, computenodes) manually to prevent interrupting running jobs

    3. Better set up HA

      "Interactive type", are you referring to "SOA job", if yes, it should also supported.


    Qiufang Shi

    Thursday, March 9, 2017 1:11 AM
  • Thanks a lot Qiufang.

    1. So while using window HPC pack for these long running jobs, the scalability shouldn't be an issue. We are looking around 1000 nodes cluster where each computational unit is running 2-3 these long running jobs. As you already mentioned in the previous thread, you guys have tested the system with 2500 servers and with 10k cores.

    2. Yes, I was referring to the SOA jobs by mentioning "Interactive type" jobs. So, for these SOA jobs we will have long running sessions like 4-5 hours or so. I believe the SOA session will sustain during this time period and have good connectivity. Anything else we look for. We are also testing this today.

    Thanks,

    Puneet


    Puneet Sharma

    Thursday, March 9, 2017 11:38 PM