Answered by:
exit code -1073741515

Question
-
Hello
I am getting this error when I am executing my application on 2 nodes through the HPC Cluster Manager (mpiexec -n 2 ****.exe).
-----------
Task failed during execution with exit code -1073741515. Please check task's output for error details.
-----------
There is nothing in the output. Can you please let me know what is going wrong ?
Thanks & Regards,
Kunal
Friday, August 6, 2010 1:33 AM
Answers
-
Hello Kunal,
My suggestion is to find out in all the compute nodes, which dependent dlls are missing? You only need copy all those missing ones. It looks to me that those pgf* dlls may be the missing ones. You need double check that.
Also, could you try to run your mpi app by just running in Headnode. For example, using command
mpiexec -n 2 [path to your app]\your app]
If you have all required dlls copied to the same location as your MPI app, it shouldn't give that error code anymore. Then you can validate what kind of dlls are missing in your compute nodes.
Thanks,
James
- Marked as answer by Kunal Rao Tuesday, August 10, 2010 5:57 PM
Monday, August 9, 2010 7:33 PM
All replies
-
Hello,
The error code can be found in the ntstatus.h with the following definition:
STATUS_DLL_NOT_FOUND
# {Unable To Locate Component}
# This application has failed to start because %hs was not
# found. Re-installing the application may fix this problem.I guess the problem may occur during you compile your MPI application. It is hard to tell which dll is missing from just the error code. I suggest you delete all copies of your app, and do a clear compile and try again.
If problem still exists, please give more details about the versions of compiler, msmpi.dll (I assume you use Microsoft MPI), and the dependent dlls you needed for your app.
Thanks,
James
Friday, August 6, 2010 5:51 PM -
Hi James,
Thanks for your reply.
I am using PGI compilers (10.5) to compile my code. The dependent dlls for the application are as follows (got it through dumpbin.exe):
-----------
Image has the following dependencies:
PSAPI.DLLmsmpi.dllpgftnrtl.dllpgf90.dllpgf90rtl.dllMSVCR90.dllpgc.dllKERNEL32.dll---------------
I have copied all these dlls in the //Head-Node/public folder where I have kept the executable.
Even, then I am getting that error.
(msmpi.dll that I am using is version 2.0.1551.0)
For more details..here is the info from: job view JOBID /detailed
---------------c:\Program Files (x86)\Microsoft Visual Studio 9.0\VC>job view 336 /detailedId: : 336Name: : FlashSubmitTime: : 8/6/2010 9:43:36 PMCreateTime: : 8/6/2010 9:43:22 PMStartTime: : 8/6/2010 9:43:36 PMEndTime: : 8/6/2010 9:43:51 PMChangeTime: : 8/6/2010 9:43:22 PMUnitType: : CoreMinCores: : 1MaxCores: : 1MinSockets: : 1MaxSockets: : 1MinNodes: : 1MaxNodes: : 1RunUntilCanceled: : FalseIsExclusive: : FalseErrorCode: : -2147218980ErrorParams: : 336.1State: : FailedPreviousState: : RunningUserName: : HPC\AdministratorJobType: : BatchPriority: : NormalRequestedNodes: : COMPUTE-NODE-2,HEAD-NODERequiredNodes: :IsBackfill: : FalseNextTaskNiceID: : 2HasGrown: : FalseHasShrunk: : FalseOrderBy: :TaskLevelUpdateTime: : 8/6/2010 9:43:36 PMMinMaxUpdateTime: : 8/6/2010 9:43:36 PMComputedMinCores: : 1ComputedMaxCores: : 2RequestCancel: : NoneRequeueCount: : 0AutoRequeueCount: : 0FailureReason: : NonePendingReason: :ComputedNodeList: : COMPUTE-NODE-2,HEAD-NODEAutoCalculateMax: : TrueAutoCalculateMin: : TrueParentJobId: : 0ChildJobId: : 0NumberOfCalls: : 0NumberOfOutstandingCalls: : 0CallDuration: : 0CallsPerSecond: : 0FailOnTaskFailure: : FalsePreemptable: : TrueProjectId: : 1JobTemplateId: : 1OwnerId: : 3ClientSourceId: : 3Project: :JobTemplate: : DefaultDefaultTaskGroupId: : 336Owner: : HPC\AdministratorId: : 336TaskCount: : 1ConfiguringTaskCount: : 0SubmittedTaskCount: : 0ValidatingTaskCount: : 0QueuedTaskCount: : 0DispatchingTaskCount: : 0RunningTaskCount: : 0FinishingTaskCount: : 0FinishedTaskCount: : 0FailedTaskCount: : 1CanceledTaskCount: : 0CancelingTaskCount: : 0ClientSource: : HpcClusterManagerOfflineResourceCount: : 0IdleResourceCount: : 0ReservedResourceCount: : 0JobScheduledResourceCount: : 0ReadyForTaskResourceCount: : 0TaskScheduledResourceCount: : 0JobTaskScheduledResourceCount: : 0TaskDispatchedResourceCount: : 0TaskRunningResourceCount: : 0CloseTaskResourceCount: : 0CloseTaskDispatchedResourceCount: : 0TaskClosedResourceCount: : 0CloseJobResourceCount: : 0TotalKernelTime: : 156TotalUserTime: : 15MemoryUsed: : 239772AllocatedCores: : COMPUTE-NODE-2 2AllocatedNodes: : COMPUTE-NODE-2 1AllocatedSockets: : COMPUTE-NODE-2 1ProcessIds: :-----------------
Kindly let me know if you have any suggestions.
Thanks & Regards,Kunal
P.S. I am using -Bdynamic and -nodefaultlib=msvcrtd.lib flags during compiling the code.Saturday, August 7, 2010 2:00 AM -
Hello Kunal,
My suggestion is to find out in all the compute nodes, which dependent dlls are missing? You only need copy all those missing ones. It looks to me that those pgf* dlls may be the missing ones. You need double check that.
Also, could you try to run your mpi app by just running in Headnode. For example, using command
mpiexec -n 2 [path to your app]\your app]
If you have all required dlls copied to the same location as your MPI app, it shouldn't give that error code anymore. Then you can validate what kind of dlls are missing in your compute nodes.
Thanks,
James
- Marked as answer by Kunal Rao Tuesday, August 10, 2010 5:57 PM
Monday, August 9, 2010 7:33 PM -
Hi James,
Thanks a lot. That helped me resolve the issue. As you guessed those pgf* dll's were missing (checked that using the Dependency Walker).
I initially got it working on the head node and then compute node.
Thank you very much !!
Thanks & Regards,
Kunal
Tuesday, August 10, 2010 5:57 PM