locked
Job Submission very slow RRS feed

  • Question

  • We have a cluster comprising a virtualised head node (a single VM on a 2xdual-core processor server with 16GB memory) and 2 compute nodes (each node is a physical server with 2 x quad core processors and 32GB memory).  We have a job with 5000 tasks which we submit to the cluster, the submission takes around 2.5 minutes.  Our Microsoft contact for this (we are doing a proof of concept of HPC) has built his own cluster of similar spec except that he is using a physical head node (running on a blade I believe); on his cluster it is taking 6 seconds.
    We have therefore built a new, physcial, head node using the fastest spare machine we have - it has 4 x dual core processors and 64GB memory.  The submission time has now gone up to over 4 minutes !
    All the machines are using HPC pack SP1 and we are using SQL Express for the database.  This matches the configuration of our Microsoft contacts cluster.

    Any assistance on how we may be able to improve the submission performance would be much appreciated.

    Thanks,
    Trevor Davies. 
    Thursday, January 21, 2010 11:01 AM

Answers

  • We have just discovered what was causing the issue - may be useful for other new users of HPC.  We downloaded the Windows 2008 HPC Server (service pack'd version) with HPC pack.  We had 'assumed' that because the download included the OS and HPC pack and the OS was service pack'd that the HPC pack was too.  Turns out it was not.  As soon as we applied SP1 to the HPC pack, submission times reduced from 4+ minutes to 8 seconds !
    Monday, January 25, 2010 9:43 AM

All replies

  • If you can change your job from 5000 tasks to 1 parametric task with 5000 subtasks it should submity a lot faster.
    Friday, January 22, 2010 6:31 PM
  • We have just discovered what was causing the issue - may be useful for other new users of HPC.  We downloaded the Windows 2008 HPC Server (service pack'd version) with HPC pack.  We had 'assumed' that because the download included the OS and HPC pack and the OS was service pack'd that the HPC pack was too.  Turns out it was not.  As soon as we applied SP1 to the HPC pack, submission times reduced from 4+ minutes to 8 seconds !
    Monday, January 25, 2010 9:43 AM