HPC Session Service will not start after install, causes installer rollback (2012 R2) RRS feed

  • Question

  • As part of the HPC Pack 2012 R2 install, it's reaching the installation of the "Server Components", performs the installation and reports the various phases complete. At the "Starting Services" phase, it takes a very long time and eventually returns the error that the HPC Session Service failed to start and to check the event viewer.

    While in the failed state (before clicking OK, which triggers a rollback of that component),  I can see that most of the services are started, except for the Scheduler Service and the Session Service.  The Session Service is in the starting state, but never seems to exit that state, even after a very long delay.  Even after starting the Scheduler service, the Session service never seems to be able to get running, even after a manual kill of the process and an attempt to restart the service.

    The event log has many entries from the Diagnostics and Hpcservice about being unable to connect to the scheduler.

    Exception: System.Net.Sockets.SocketException (0x80004005): An existing connection was forcibly closed by the remote host

    Server stack trace: 
       at System.Net.Sockets.Socket.Receive(Byte[] buffer, Int32 offset, Int32 size, SocketFlags socketFlags)
       at System.Runtime.Remoting.Channels.SocketStream.Read(Byte[] buffer, Int32 offset, Int32 size)
       at System.Net.StreamFramer.ReadMessage()
       at System.Net.Security.NegoState.StartReceiveBlob(LazyAsyncResult lazyResult)
       at System.Net.Security.NegoState.StartSendBlob(Byte[] message, LazyAsyncResult lazyResult)
       at System.Net.Security.NegoState.ProcessAuthentication(LazyAsyncResult lazyResult)
       at System.Net.Security.NegotiateStream.AuthenticateAsClient(NetworkCredential credential, String targetName, ProtectionLevel requiredProtectionLevel, TokenImpersonationLevel allowedImpersonationLevel)
       at System.Runtime.Remoting.Channels.Tcp.TcpClientTransportSink.CreateAuthenticatedStream(Stream netStream, String machinePortAndSid)
       at System.Runtime.Remoting.Channels.Tcp.TcpClientTransportSink.CreateSocketHandler(Socket socket, SocketCache socketCache, String machinePortAndSid)
       at System.Runtime.Remoting.Channels.RemoteConnection.CreateNewSocket()
       at System.Runtime.Remoting.Channels.SocketCache.GetSocket(String machinePortAndSid, Boolean openNew)
       at System.Runtime.Remoting.Channels.Tcp.TcpClientTransportSink.SendRequestWithRetry(IMessage msg, ITransportHeaders requestHeaders, Stream requestStream)
       at System.Runtime.Remoting.Channels.Tcp.TcpClientTransportSink.ProcessMessage(IMessage msg, ITransportHeaders requestHeaders, Stream requestStream, ITransportHeaders& responseHeaders, Stream& responseStream)
       at System.Runtime.Remoting.Channels.BinaryClientFormatterSink.SyncProcessMessage(IMessage msg)

    Exception rethrown at [0]: 
       at System.Runtime.Remoting.Proxies.RealProxy.HandleReturnMessage(IMessage reqMsg, IMessage retMsg)
       at System.Runtime.Remoting.Proxies.RealProxy.PrivateInvoke(MessageData& msgData, Int32 type)
       at Microsoft.Hpc.Scheduler.Store.ISchedulerStoreInternal.Register(String clientSource, String userName, ConnectionRole role, Version clientVersion, ConnectionToken& token, UserPrivilege& privilege, Version& serverVersion, Dictionary`2& serverProps)
       at Microsoft.Hpc.Scheduler.Store.StoreServer.RegisterWithServer()
       at Microsoft.Hpc.Scheduler.Store.StoreServer._Connect()
       at Microsoft.Hpc.Scheduler.Store.StoreServer.Connect(String server, Int32 port, Boolean serviceAsClient, ServiceAsClientIdentityProvider identityProvider, String serviceAsClientUser, String serviceAsClientPass)
       at Microsoft.Hpc.Scheduler.Store.SchedulerStoreSvc..ctor(String server, Int32 port, Boolean serviceAsClient, ServiceAsClientIdentityProvider provider, String serviceAsClientUser, String serviceAsClientPass, Boolean overHttp)
       at Microsoft.Hpc.Scheduler.Store.SchedulerStoreSvc.RemoteConnect(String server, Int32 port, Boolean serviceAsClient, ServiceAsClientIdentityProvider provider, String userName, String password, Boolean overHttp)
       at Microsoft.Hpc.Scheduler.Store.SchedulerStoreSvc.RemoteConnect(String server, Int32 port)
       at Microsoft.Hpc.Scheduler.Store.SchedulerStore.Connect(String server)
       at Microsoft.Hpc.Diagnostics.Controller.DiagnosticsController.ConnectToScheduler(Dictionary`2 args)

    The publisher has been disabled and its resource is not available. This usually occurs when the publisher is in the process of being uninstalled or upgraded

    Wednesday, July 9, 2014 5:45 PM

All replies

  • Hi Msmoritz,

    HPC Session Service depends on HPC Job Scheduler Service to start and running. Please double check from the event logs why HPC Job Scheduler Service did not start correctly.

    Thursday, July 10, 2014 7:29 AM
  • Changed the WCF service to just run as local system account instead of specifying it in the service panel.

    Sunday, November 10, 2019 6:16 PM