mpxyd: DTO completion ERR: status 12, op RDMA_WRITE running MPI alltoall test
Running MIC scale-up configuration with mcm provider on a MXS node
instead of shm causes DTO error due to heavy use of proxy-in buffer pools.
Hit corner case where proxy buffer management hd ptr crossed tl
ptr due to 64 byte alignment on start when hd < 64 bytes behind tl.
Add additional checking on PO and PI buffer management to handle
the case of HD passing TL on start locations. Also changed PO
processing to hold lock until hd ptr is registered with buf_wc slot
management to preserve order of memory usage across threads.
Reduced the size of WC queue for PO and PI buffer management.
Profiling, via MCM_PROFILE, was added to monitor and trigger buffer
management errors.
Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>