none
Cannot find the service registration file: . File only found when running HPC SOA job as HPC Administrator

    Question

  • I am attempting to run HPC SOA jobs running under an authenticated user as a windows service. However, whenever I attempt to run a SOA job through a user which has Job Admin/Operator/User priviledges through an AD group, I get the following error messages in the Tasks view of the job:

    fo.Throw()
       at Microsoft.Hpc.RetryManager.<InvokeWithRetryAsync>d__33`1.MoveNext()
    --- End of stack trace from previous location where exception was thrown ---
       at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
       at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
       at Microsoft.Hpc.HttpClientExtension.<GetHttpApiCallAsync>d__5.MoveNext()
    --- End of stack trace from previous location where exception was thrown ---
       at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
       at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
       at Microsoft.Hpc.Rest.HpcRestClient.<GetHttpApiCallAsync>d__16`1.MoveNext()
    --- End of stack trace from previous location where exception was thrown ---
       at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
       at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
       at Microsoft.Hpc.Rest.ServiceRegistrationRestClient.<GetMd5Async>d__14.MoveNext()
    --- End of stack trace from previous location where exception was thrown ---
       at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
       at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
       at Microsoft.Hpc.Rest.ServiceRegistrationRestClient.<GetMd5Async>d__15.MoveNext()
    --- End of stack trace from previous location where exception was thrown ---
       at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
       at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
       at Microsoft.Hpc.Rest.ServiceRegistrationRestClient.<ExportToTempFileAsync>d__22.MoveNext()
    --- End of stack trace from previous location where exception was thrown ---
       at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
       at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
       at Microsoft.Hpc.Scheduler.Session.Internal.ServiceRegistrationRepo.GetServiceRegistrationPath(String filename)
       at Microsoft.Hpc.CcpServiceHosting.Program.GetServiceInfo(String serviceConfigFileName)
    [Main]: Cannot find the service registration file: . Service cannot be activated. Redeploy the service.
    HpcSoa Information: 11 : [Session:2482] Open dummy service...
    HpcSoa Information: 1002 : Servicehost is started.
    HpcSoa Verbose: 10 : [Session:2482] [HpcServiceHost]: Task Id = 15281
    HpcSoa Verbose: 10 : [Session:2482] [HpcServiceHost]: Number of processors (service capability) = 1
    HpcSoa Information: 11 : [Session:2482] [HpcServiceHost]: Cancel Task Grace Period = 15000
    HpcSoa Information: 11 : [Session:2482] [HpcServiceHost]: First Allocated CoreId = 1
    HpcSoa Information: 11 : [Session:2482] [HpcServiceHost]: EnableMessageLevelPreemption = True
    HpcSoa Error: 13 : [Session:2482] [HpcServiceHost]: Cannot find service registration file.
    HpcSoa Verbose: 10 : [Session:2482] [HpcServiceHost]: WCF network prefix is not set.
    HpcSoa Verbose: 10 : [Session:2482] [HpcServiceHost]: ServiceOperationTimeout = 86400000, MaxMessageSize = 65536000
    HpcSoa Information: 11 : [Session:2482] [HpcServiceHost]: BrokerNodeAuthManager initialized. AllowerUser = APLTEST\FCC2-Diane-HPC, JobOwner = 
    HpcSoa Information: 11 : [Session:2482] defaultBaseAddr of HostController is net.tcp://HPCHN:9101/2482/15281
    HpcSoa Information: 11 : [Session:2482] Created ServiceHost for controller.
    HpcSoa Information: 11 : [Session:2482] Added endpoint to controller.
    HpcSoa Information: 11 : [Session:2482] [HpcServiceHost]: BrokerNodeAuthManager initialized. AllowerUser = APLTEST\FCC2-Diane-HPC, JobOwner = 
    HpcSoa Information: 11 : [Session:2482] Try to call _hostController.Open() below.
    HpcSoa Information: 11 : [Session:2482] Controller opened.
    HpcSoa Verbose: 10 : [Session:2482] [HpcServiceHost]: Dummy service opened on net.tcp://hpchn:9101/2482/15281/_defaultEndpoint
    ^C

    The user in question should have sufficient priviledges granted, and the service name is being correctly passed when run through an interactive session.

    Now, when I give the AD group Administrator permissions through HPC, the jobs seem to run without issue. Based on the HPC guide online, accounts with User should be able to submit jobs [https://docs.microsoft.com/en-us/powershell/high-performance-computing/understanding-user-roles?view=hpc16-ps]. Is there a way to diagnose what might be wrong with my system's permissions? I am not seeing anything obvious in cosmos logging traces or in the log files under Microsoft HPC Pack\data\LogFiles


    Tuesday, June 26, 2018 5:45 PM

All replies