[MPICH] MPICH IB driver with RDMA getting extra completions?
    Heinz, Michael 
    mheinz at silverstorm.com
       
    Thu Sep  1 10:45:11 CDT 2005
    
    
  
Hi,
I'm looking at validating MPICH2 1.02 with our Infiniband drivers and I've run into an odd problem - while simple tests (bandwidth, latency, etc..) building with CH3/IB with rdma enabled works fine, but HPL/Linpack is failing with what appears to be an "extra" RDMA write completion. I say "extra" because after poring over the debug logs it looks to me like ibu_wait is properly waiting for each completion before going to the next.
Has anyone seen behavior like this? I noticed a line of commented out code that would cause the IB device to discard the extra completion, but uncommenting it causes lots of other problems and eventually results in a VAPI_LOC_PROT_ERR.
Any ideas?
    
    
More information about the mpich-discuss
mailing list