Hello Together,
i have following Problem with my Microsoft HPC (SP2).
I want to submitt a job, where i start a application, which is programmed ourself.
This application runs well on the HPC Cluster, when we run the Application with the JobUnitType set to Node.
When i switch the JobUnitType to Core, and Create a Job, which has 3 Tasks with the same Application inside, it works also very well. When Creating a Job with 4 Tasks inside (all with same Application) and this Tasks will start on the Same Node (JobUnitType=Core)
in parallel, one Application will fail. i belive currently the Application which starts as last one will Fail. When i check teh Error Log of the Application, i find the Error Code "8" which was returned from "GetLastError()", and this Happens, when the
Application wants to Create a Dialog. But please Remember, the Application runs fine with different JobUnitSettings, or starting only 3 Tasks in parallel on one node.
Following Details i can give you here:
- The Application is programmed in Native C++ (unmanaged code).
- The Application is a 32 Bit Application.
- The Workstation Nodes have installed Win7 64 Bit.
- We are using Microsoft HPC SP2.
Additional to this, we found following out: When we run the Application on this Workstation Node in a Remote Desktop multiple Times (to be precise here, i tested the Application 4x parallel and 8x parallel) it worked well.
So it seems for me, that Microsoft HPC does something different when starting multiple times the Application at one Workstation Node.
Currently i would have following questions:
- How does Microsoft HPC start a Application on a computing Node? Is it realy the same like a remote connection with a RemoteDesktopConnection to a second pc? (LogOn User ... Desktop will be created....Start Application) Or is this different? What happens
here realy in detail, when Microsoft HPC starts my Application?
- What do i have to change in my Application, so that this Application is able to run more than 3 times in parallel on a workstation node?
- Is there a possibility to change something on my Workstation Node (Win 7 machine) (registry key,......) so that my Application is able to run more than 3 times on a Workstation Node?
It would be great, if somebody could give me some hints, what i should do to get the Application running, either in changing someting inside the Application, or changing something in our complet Environment (HPC, OS on Workstation Nodes, etc., ...)
Thanks everybody in advance,
Bobby