none
HPC nodes not sending back calculations

    Question

  • Hello, I have set up an HPC cluster to run Excel workbook offloading jobs. I can run the example spreadsheet (convertible pricing) on the cluster without any problems.

    I have modified the macros in the example spreadsheet, and I can run on the desktop using the standard macro code. However, when I run the macro on the cluster, the calculations are sent to the nodes, but I am not receiving anything back. Here <g class="gr_ gr_936 gr-alert gr_gramm gr_inline_cards gr_run_anim Grammar multiReplace" data-gr-id="936" id="936">are</g> a couple of screenshots of the status bar and HPC Job Manager, and the code of the 'partition' and 'execute' macros are below the pics (ignore the comments in the code). Any help will be greatly appreciated.

    Public Function HPC_Partition() As Variant
    
        ' next check the column; if we're past the last
        ' column, then we're done
        
        If CurrentRun > NoRuns Then
        
            CurrentRun = 1
            CurrentMem = CurrentMem + 1
            
        End If
        
        If CurrentMem > NoMems Then
        
            ' return null to indicate the end of the calculation
            HPC_Partition = Null
            
            ' and exit the function now, we can skip the rest
            Exit Function
        End If
        
        data(1) = CurrentRun
        data(2) = CurrentMem
        
        HPC_Partition = data
        
        ' move to the next row
        
        CurrentRun = CurrentRun + 1
        CurrentRow = CurrentRow + 1
        
        ' update the status bar with the counters
    
        SentRecords = SentRecords + 1
        UpdateStatus
        
    End Function

    Public Function HPC_Execute(data As Variant) As Variant
    
       
        Range("RunNo").Value = data(1)
        Range("MemberNo").Value = data(2)
            
        data = Results.Range(Cells(2, 1), Cells(2, dataSize))
        
        HPC_Execute = data ' return value from the function
                    
        ' make sure to remove the "Next" lines at the end of the
        ' loops - there are two, one for each of the two loops.
    
    End Function

    Wednesday, June 28, 2017 10:11 AM

Answers

  • Hi KMLN,

    If the Excel workbook runs fine locally on destop but not on cluster, first thing to check is whether the HPCWorkbookPath in the HPCControlMacros is pointing to the updated workbook on the network share, so that all compute nodes could access the same workbook for executing the HPC_Execute macro.

    Secondly, you may enable SOA message level tracing for the Microsoft.Hpc.Excel.ExcelService (via Cluster Manager Console ->Configuration->Services->right click the service->Set Event Logging Level -> choose Verbose), after repro the issue, you may check the SOA request message details via Cluster Manager Console ->Job Management->SOA Jobs->right click the job->View Message Details. You may check if there is any error in the SOA request messages.

    Regards,

    Yutong Sun

    • Marked as answer by KMLN Thursday, August 17, 2017 6:21 PM
    Thursday, June 29, 2017 3:40 AM