Answered by:
Loading XML & submitting takes lot of time

Question
-
Hello,
We run stochastic simulations on our cluster; this kind of simulations require the submission of a lot of runs with several different parameters, passed at the command line. So, we generate and save an XML job file, load it, and submit it. With CCS 2003 loading was as fast as possible (just 1/2 seconds for a job with 30k tasks). On HPC 2008 it takes almost half an hour, on the default Job template (no additional validation required).
Is that expected or is there something wrong with our deployment?
Which are the exact steps going on during submission from an XML file? (does it do schema validation? other validations? Is it possible to turn them off in some config file/reg key?)
Thanks,
LorenzoMonday, March 30, 2009 7:26 AM
Answers
-
They are 30000 individual tasks. We cannot use parametric tasks for this particular application (it acts on different files and parameter combinations, which can not be summarized by a single integer value).
Lorenzo- Proposed as answer by Josh BarnardModerator Thursday, April 2, 2009 10:03 PM
- Marked as answer by Lorenzo Friday, April 3, 2009 7:12 AM
Wednesday, April 1, 2009 2:55 PM -
Lorenzo,
We have some fixes around this that should be included in our first Serivce Pack; please install that SP on your client and head node when it comes out and hopefully times will be significantly reduced!
Thanks,
Josh
-Josh- Proposed as answer by Josh BarnardModerator Thursday, May 28, 2009 9:32 PM
- Marked as answer by Lorenzo Friday, May 29, 2009 7:12 AM
Thursday, May 28, 2009 9:32 PMModerator
All replies
-
Lorenzo,
Are the 30,000 tasks individual tasks or 30,000 parametric tasks?
ThanksWednesday, April 1, 2009 1:56 PM -
They are 30000 individual tasks. We cannot use parametric tasks for this particular application (it acts on different files and parameter combinations, which can not be summarized by a single integer value).
Lorenzo- Proposed as answer by Josh BarnardModerator Thursday, April 2, 2009 10:03 PM
- Marked as answer by Lorenzo Friday, April 3, 2009 7:12 AM
Wednesday, April 1, 2009 2:55 PM -
Slow performance when adding a large number of tasks to a job is a known issue that we are looking into and hope to address soon. In the meantime I suggest you use Parametric Sweeps as they will be much more performant to create.
You can work around the problem you mentoined above by wrapping you task in a batch (or PSH, or ...) script that starts your application with the correct set of parameters. You can use the * in your sweep to represent the index into the array of parameter sets, for example.
Thanks!
Josh
-JoshThursday, April 2, 2009 10:03 PMModerator -
Hello,
I'd like to avoid the creation of batch; it's cumbersome and error prone.
I suppose that we'll either write our own submission application (so that at least the user has to not wait minutes to press the "submit" button) or go back to CCS 2003 till the issue is fixed.
Thanks!
LorenzoFriday, April 3, 2009 7:15 AM -
Lorenzo,
We are working on this issue right now. Could you please send me your XML file? It would be helpful for us to see how it is constructed/what your job contains.
My e-mail: JBarnard AT Microsoft DOT com
Thanks!
Josh
-JoshThursday, April 9, 2009 12:03 AMModerator -
Lorenzo,
We have some fixes around this that should be included in our first Serivce Pack; please install that SP on your client and head node when it comes out and hopefully times will be significantly reduced!
Thanks,
Josh
-Josh- Proposed as answer by Josh BarnardModerator Thursday, May 28, 2009 9:32 PM
- Marked as answer by Lorenzo Friday, May 29, 2009 7:12 AM
Thursday, May 28, 2009 9:32 PMModerator -
Will try it asap!
Thank you!
LorenzoFriday, May 29, 2009 7:12 AM