+commit 963e5d793867644c770c087f1ef443550779ca8c
+Author: Arlin Davis <arlin.r.davis@intel.com>
+Date: Tue Sep 29 09:05:27 2015 -0700
+
+ dtest: add -a -i options, all data sizes, incremental size
+
+ Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
+
+commit 5410203cf0f5908540b43bfa2a219de4b9042a01
+Author: Bharat Potnuri <bharat@chelsio.com>
+Date: Tue Sep 29 08:49:10 2015 -0700
+
+ dapl: Fix segfault while freeing qp
+
+ In function dapls_ib_qp_free(), pointers qp and cm_ptr->cm_id->qp are pointing to the same qp
+ structure, initialized in function dapls_ib_qp_alloc(). The memory pointed by these pointers are freed
+ twice in function dapls_ib_qp_free(), using rdma_destroy_qp() for the case _OPENIB_CMA defined and
+ then further using ibv_destroy_qp(), causing a segmentation fault while freeing the qp. Therefore
+ assigned NULL value to qp to avoid freeing illegal memory.
+
+ Fixes: 7ff4f840bf11 ("common: add CM-EP linking to support mutiple CM's and proper protection during
+ destruction")
+
+ Signed-off-by: Bharat Potnuri <bharat@chelsio.com>
+ Acked-by: Arlin Davis <arlin.r.davis@intel.com>
+
+commit fb64e157b9dd741ba942db00ceee37ea0f4ddcab
+Author: Amir Hanania <amir.hanania@intel.com>
+Date: Wed Sep 23 14:43:38 2015 -0700
+
+ mpxyd: add P2P inline support for data size <= 96 bytes
+
+ Improve small message latency for proxy to proxy service
+ by including data with the proxy work request. Necessary
+ changes made to preservie order across WR's regardless
+ of size. Additional logging included. Improves single byte
+ one-way latency of about 27% on MFO configurations.
+
+ Changes made to avoid forwarding 0-byte rdma write to
+ scif_writeto, remove CPU hand copies, and order.
+
+ Changes for numa_node == -1 such that mic0 assumes MSS
+ and mic1 assumes MXS modes.
+
+ Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
+ Signed-off-by: Amir Hanania <amir.hanania@intel.com>
+
+commit 2f7f25a808d9e6b3ee613c5b8b9a9e2f5abe5c55
+Author: Arlin Davis <arlin.r.davis@intel.com>
+Date: Mon Sep 21 15:48:15 2015 -0700
+
+ dtest: change rdma_write_ping_pong so client is always last receiver
+
+ server always waits after test loops for DREQ event so in order
+ to gracefully shutdown client should always receive last handshake
+ message and issue DREQ. Remove logging in loop.
+
+ Always init data and increase min rdma buffer size to 4KB.
+
+ Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
+
+commit 453373f018a1c823125f6dd95952b343987b1480
+Author: Arlin Davis <arlin.r.davis@intel.com>
+Date: Mon Sep 21 08:24:01 2015 -0700
+
+ ucm: add DAPL_NETWORK_PROCESS_NUM option for total ranks
+
+ Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
+
+commit 779dfdfe4ebc6f287544e8aad589e1578a58537a
+Author: Amir Hanania <amir.hanania@intel.com>
+Date: Wed Sep 16 17:31:13 2015 -0700
+
+ ucm: fca create group incorrectly using IB addr instead of socket address.
+
+ need the socket address for socket based create group info exchange.
+
+ Signed-off-by: Amir Hanania <amir.hanania@intel.com>
+
+commit 7476da8d3c523f9a719748a046d339ea29f41aef
+Author: Amir Hanania <amir.hanania@intel.com>
+Date: Wed Sep 16 17:27:27 2015 -0700
+
+ ucm: fca_comm_destroy called with NULL
+
+ In some cases dapli_free_collective_group is called without the comm was initialized.
+ fca_comm_destroy call in this func seg fault.
+
+ Signed-off-by: Amir Hanania <amir.hanania@intel.com>
+
+commit f0d97457ba86bdc13901dc37996d2f7419f64360
+Author: Arlin Davis <arlin.r.davis@intel.com>
+Date: Tue Sep 15 08:45:03 2015 -0700
+
+ dtest: add -W option for rdma write pinpong, similiar to ib_write_lat
+
+ Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
+
+commit 547d1fe1257bf4709bf38eae8c8013d320a04432
+Author: Arlin Davis <arlin.r.davis@intel.com>
+Date: Mon Aug 31 15:14:46 2015 -0700
+
+ docs: update release notes for collective build
+
+ Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
+
+commit 9b52aa90fefb69d1ba8fdd689f618c2ce250825d
+Author: Amir Hanania <amir.hanania@intel.com>
+Date: Mon Aug 24 13:22:53 2015 -0700
+
+ mpxyd: reduce log level for rcv message flush
+
+ Signed-off-by: Amir Hanania <amir.hanania@intel.com>
+
+commit 5a1d9e7f386335dcae676b372269a061e6f5294b
+Author: Carol L Soto <clsoto@linux.vnet.ibm.com>
+Date: Mon Aug 24 12:58:58 2015 -0700
+
+ dapltest: dapltest with no argument not working in ppc64 arch
+
+ If dapltest is run with no args then the client was getting
+ Warning: conn_event_wait DAT_CONNECTION_EVENT_NON_PEER_REJECTED
+ Reference to RH1056487- dapltest Read and Write performance
+ tests are not working
+
+ Signed-off-by: Carol L Soto <clsoto@linux.vnet.ibm.com>
+
+commit 91febc42f0070b2b9eaa81c0c113c6ff7ab8ea60
+Author: Arlin Davis <arlin.r.davis@intel.com>
+Date: Thu Aug 13 09:55:47 2015 -0700
+
+ Release 2.1.6
+
+ Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
+
commit ad43b8d3ca9f67d3231525b2808776719686deba
Author: Arlin Davis <arlin.r.davis@intel.com>
Date: Wed Aug 12 17:30:23 2015 -0700
README/Release Notes
- OFED 3.18 DAPL Release 2.1.6
- August 2015
+ OFED 3.18 DAPL Release 2.1.7
+ Sept 2015
User space libraries/utilities for Direct Access Transport (DAT) v2.0. DAT is
a transport-independent, platform-independent Application Programming
1.0 Release Notes
=================
+ dapl-2.1.7 changes include dtest improvement and CCL Proxy P2P inline support:
+
+ - Add dtest -W option for rdma write pinpong test,
+ new options with -W include -a (all data sizes) -i (incremental size)
+ - CCL Proxy small message latency improvment with Proxy2Proxy inline support
+ for message sizes < 96 bytes, reduces MPI pingpong single byte latency
+ for MFO devices by 27%.
+
+ dapl-2.1.6 changes include MIC support for full offload mode
+
+ - Add support for Truescale qib devices with no CCL Direct verbs support on MIC.
+ - Enhancement for inside the box transfers without IB adapter via ibscif.
+ - Add DAPL_NETWORK_NODES, DAPL_NETWORK_PPN environment variables.
+
dapl-2.1.5 changes include improvements for large scale UD communication management:
- AH caching, reduced memory footprint (grows as needed)
Tested on 1200n 28ppn cluster, AlltoAll Intel MPI, UD mode.
Both static and dynamic modes, over 500m UD QP connections.
- dapl-2.1.6 changes include MIC support for full offload mode
-
- - Add support for Truescale qib devices with no CCL Direct verbs support on MIC.
- - Enhancement for inside the box transfers without IB adapter via ibscif.
- - Add DAPL_NETWORK_NODES, DAPL_NETWORK_PPN environment variables.
-
==========
2.0 BUILD:
==========
9.0 Summary of Fixes/Changes:
=============================
+ Release 2.1.7 (OFED 3.18-1 GA)
+ dtest: add -a -i options, all data sizes, incremental size
+ dapl: Fix segfault while freeing qp
+ mpxyd: add P2P inline support for data size <= 96 bytes
+ dtest: change rdma_write_ping_pong so client is always last receiver
+ ucm: add DAPL_NETWORK_PROCESS_NUM option for total ranks
+ ucm: fca create group incorrectly using IB addr instead of socket address.
+ ucm: fca_comm_destroy called with NULL
+ dtest: add -W option for rdma write pinpong, similiar to ib_write_lat
+ docs: update release notes for collective build
+ mpxyd: reduce log level for rcv message flush
+ dapltest: dapltest with no argument not working in ppc64 arch
+
Release 2.1.6 (OFED 3.18-1)
ucm: add cluster size environments to adjust CM timers
mpxyd: proxy_in data transfers can improperly start before RTU received