]> git.openfabrics.org - ~ardavis/dapl.git/log
~ardavis/dapl.git
11 years agodapltest: add -n parameter to override default server port number (45278)
Arlin Davis [Tue, 16 Jul 2013 23:12:37 +0000 (16:12 -0700)]
dapltest: add -n parameter to override default server port number (45278)

Modify all tests and commands to take a new -n parameter option for server
listen port. The default port, when running multiple EP's and threads,
will sometimes collide and fail with EADDRINUSE on iWARP configurations
using rdma_bind_addr with sin_port=0.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoucm,scm: UD mode creates many CR objects per EP that needs cleaned up
Arlin Davis [Fri, 12 Jul 2013 18:52:33 +0000 (11:52 -0700)]
ucm,scm: UD mode creates many CR objects per EP that needs cleaned up

After connection is established and the AH is provided to consumer
on UD connect establishment there is no need to keep the CR object
on the SP. For large clusters this results in a growing memory
footprint for CR objects and long cleanup times on device close.

Change ucm and scm providers to unlink and free CR resources
during CM object free if this is a UD QP and CONN_EST state.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease intel-mic-ofed-dapl-2.0.36.9-1
Arlin Davis [Wed, 26 Jun 2013 23:58:12 +0000 (16:58 -0700)]
Release intel-mic-ofed-dapl-2.0.36.9-1

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: fix build with/without scif, init.d directory conflict, and missing -lpthread
Arlin Davis [Wed, 26 Jun 2013 23:45:12 +0000 (16:45 -0700)]
mpxyd: fix build with/without scif, init.d directory conflict, and missing -lpthread

Signed-off-by: Patrick Mccormick <patrick.m.mccormick@intel.com>
Acked-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: add support for dynamic affinity support
Arlin Davis [Fri, 21 Jun 2013 20:43:17 +0000 (13:43 -0700)]
mpxyd: add support for dynamic affinity support

Add query feature via mic sysfs files numa_node and local_cpulist
for proper thread bindings - host to device.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoconfig: update dat.conf with more consistent naming conventions for device/provider...
Arlin Davis [Fri, 21 Jun 2013 20:41:00 +0000 (13:41 -0700)]
config: update dat.conf with more consistent naming conventions for device/provider types

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoinitialize new DAPL_DBG_LEVEL default to zero
Arlin Davis [Thu, 20 Jun 2013 18:13:07 +0000 (11:13 -0700)]
initialize new DAPL_DBG_LEVEL default to zero

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoadd DAPL_DBG_LEVEL for more debug log control
Arlin Davis [Thu, 20 Jun 2013 18:10:54 +0000 (11:10 -0700)]
add DAPL_DBG_LEVEL for more debug log control

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocma: long delays when opening cma provider with no IPoIB configured
Arlin Davis [Wed, 12 Jun 2013 16:45:45 +0000 (09:45 -0700)]
cma: long delays when opening cma provider with no IPoIB configured

The rdma_cm provider (ofa-v2-ib0) can take netdev, ip address, or hostname
for local address bindings. When trying to open a non-existent netdev (ib0)
the provider will fall through and use the getaddrinfo sys call assuming
dat.conf parameter is either an IP address or hostname and not a netdev.

When trying hostname option it will attempt to resolve the name via the
name services. On a KNC this can result in long timeouts depending on the
configuration. This changes the error handling when opening the cma provider
on a non-existant netdev and will only call getaddrinfo with AI_CANONNAME
hints after checking the dat.conf parameter for a valid hostname.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease intel-mic-ofed-dapl-2.0.36.8-1
Arlin Davis [Wed, 5 Jun 2013 22:40:19 +0000 (15:40 -0700)]
Release intel-mic-ofed-dapl-2.0.36.8-1

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoupdate README.mcm
Arlin Davis [Wed, 5 Jun 2013 22:22:48 +0000 (15:22 -0700)]
update README.mcm

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoconfig: set eager_completion default setting to disabled
Arlin Davis [Wed, 5 Jun 2013 22:21:09 +0000 (15:21 -0700)]
config: set eager_completion default setting to disabled

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoconfig: add mlx5 entries to dat.conf
Arlin Davis [Wed, 5 Jun 2013 22:16:35 +0000 (15:16 -0700)]
config: add mlx5 entries to dat.conf

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: only dump cm_lists with debug build
Arlin Davis [Wed, 29 May 2013 23:12:09 +0000 (16:12 -0700)]
mpxyd: only dump cm_lists with debug build

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: CM optimizations for MIC clients, improved checking on inbound CM messages
Arlin Davis [Wed, 29 May 2013 23:00:32 +0000 (16:00 -0700)]
mpxyd: CM optimizations for MIC clients, improved checking on inbound CM messages

allow CM operations to be received on OP or EV channels from
MIC clients and provide each SMD channel with aligned message buffer
for scif_recv processing.

add checking for NO match at MD level after checking all SMD children
for inbound CM message match and add dump_cm_lists function for debug.

add check for inline message threshold, DAT_MIX_INLINE_MAX

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: scif_recv err on mpxyd when scaling up on MPI IMB scatter benchmark
Arlin Davis [Wed, 29 May 2013 22:36:29 +0000 (15:36 -0700)]
mcm: scif_recv err on mpxyd when scaling up on MPI IMB scatter benchmark

The inline send changes incorporated fragmented scif_send options which
de-serialized the stream operation on the scif endpoint. This can result
in a CM operation from the CM thread to interleave with the post_send
inline operation that sends a hdr and inline data separately.

Modify the post_send to use only one scif_send operation for inline.
Also optimize CM and Operations by moving all CM message to the
scif_ev_ep. Cleanup operation log messages to include op strings
for easier debug.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: add inline data threshold definition, 256 bytes
Arlin Davis [Wed, 29 May 2013 22:14:18 +0000 (15:14 -0700)]
mcm: add inline data threshold definition, 256 bytes

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocommon: change debug level of EP free warning
Arlin Davis [Wed, 29 May 2013 17:31:34 +0000 (10:31 -0700)]
common: change debug level of EP free warning

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocommon: increase DCM RTU time from 400 to 800ms
Arlin Davis [Tue, 28 May 2013 23:20:36 +0000 (16:20 -0700)]
common: increase DCM RTU time from 400 to 800ms

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocommon: increase DCM retry from 1 to 5
Arlin Davis [Tue, 28 May 2013 23:03:53 +0000 (16:03 -0700)]
common: increase DCM retry from 1 to 5

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocommon: increase level of ASYNC error logging, include in free builds
Arlin Davis [Tue, 28 May 2013 23:02:29 +0000 (16:02 -0700)]
common: increase level of ASYNC error logging, include in free builds

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: improved logging with CM retry/poll errors
Arlin Davis [Tue, 28 May 2013 22:59:39 +0000 (15:59 -0700)]
mcm: improved logging with CM retry/poll errors

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: MPI IMB scatter on 12 ranks, 2 KNCs + 2 Hosts, fails
Arlin Davis [Wed, 22 May 2013 21:27:54 +0000 (14:27 -0700)]
mpxyd: MPI IMB scatter on 12 ranks, 2 KNCs + 2 Hosts, fails

multiple QP's processing multiple completions
hit bug in mix_dto_event when copying multiple
cq wc entries into a single dat_mix_dto_comp_t msg.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: segfault with counters, no CM object when not found and duplicate case
Arlin Davis [Tue, 21 May 2013 21:11:07 +0000 (14:11 -0700)]
mcm: segfault with counters, no CM object when not found and duplicate case

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoEnable ccl-proxy support if possible by default: yes, if nothing specified and scif...
pmmccorm [Mon, 13 May 2013 21:03:04 +0000 (14:03 -0700)]
Enable ccl-proxy support if possible by default: yes, if nothing specified and scif.h is present no, if scif.h is not present and nothing specfied no, if --enable-mcm=no is specified yes, if --enable-mcm=yes and scif.h is present error, if --enable-mcm=yes and scif.h missing

Make the corresponding changes to the spec file so that whatever
options are specified, the RPM will contain the right files (before
we were shipping the mpxyd service and conf regardless).

11 years agoclean up configure.in and make consistent patch
pmmccorm [Mon, 13 May 2013 20:47:36 +0000 (13:47 -0700)]
clean up configure.in and make consistent patch

11 years agoupdate for dapl-2.0.36.7
Arlin Davis [Fri, 17 May 2013 19:33:26 +0000 (12:33 -0700)]
update for dapl-2.0.36.7

11 years agoSCM: getifaddrs modfications for better out of the box experience with MIC
Arlin Davis [Fri, 17 May 2013 19:27:36 +0000 (12:27 -0700)]
SCM: getifaddrs modfications for better out of the box experience with MIC

socket cm will now walk list of interfaces and ignore loopback
and ignore IB devices, unless the IB netdev is the only device.
Works better in a heterogenous environment with a mix of MICs.
Tested with br0, mic0, and mic0:ib netdev mixes.
Overriding with DAPL_SCM_NETDEV still works as is.

Signed-off-by: Patrick Mccormick <patrick.m.mccormick@intel.com>
Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: mr_create and mr_free not returning error
Arlin Davis [Tue, 7 May 2013 21:56:39 +0000 (14:56 -0700)]
mpxyd: mr_create and mr_free not returning error

change mr_create and mr_free to return error
to MIC client if incorrectly called. Unsupported
feature at this time.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: cleanup port space, qp and cm objects
Arlin Davis [Mon, 29 Apr 2013 19:00:39 +0000 (12:00 -0700)]
mpxyd: cleanup port space, qp and cm objects

Port space leak during close, and CM disconnect.
Changes to link and unlink CM and QP during
QP create/destruction and CM disconnect states.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: memory leak of scif EP's, cleanup listen errors
Arlin Davis [Mon, 29 Apr 2013 18:55:17 +0000 (11:55 -0700)]
mcm: memory leak of scif EP's, cleanup listen errors

The new ev and tx EP's created for performance
were not destroyed properly during close.

Listen returned incorrect error instead of EADDRINUSE
so consumers didn't retry appropriately.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agodapltest: fix endian adjustments for different platform types
Arlin Davis [Wed, 17 Apr 2013 22:12:34 +0000 (15:12 -0700)]
dapltest: fix endian adjustments for different platform types

if local and remote endpoints are different endian then swap meminfo
and key information for RDMA. Was only swapping big endian side.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agobuild/packaging: auto scan cleanup
Arlin Davis [Wed, 17 Apr 2013 17:53:03 +0000 (10:53 -0700)]
build/packaging: auto scan cleanup

Signed-off-by: Patrick Mccormick <patrick.m.mccormick@intel.com>
Acked-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: Correct some error logging, avoids err=Success messages when failing to open...
Arlin Davis [Wed, 17 Apr 2013 17:43:51 +0000 (10:43 -0700)]
mcm: Correct some error logging, avoids err=Success messages when failing to open a device

Signed-off-by: Patrick Mccormick <patrick.m.mccormick@intel.com>
Acked-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: allow separate CPU affinity bindings for MIC and IB
Arlin Davis [Wed, 17 Apr 2013 17:19:08 +0000 (10:19 -0700)]
mpxyd: allow separate CPU affinity bindings for MIC and IB

allow configuration of different CPU bindings as follow:
mcm_affinity_base_hca 1
mcm_affinity_base_mic 8

Note: when set to 0, mpxyd will dynamically set affinity
based on locality of HCA and MIC adapter specified
during device open.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agodtest: set rdma_read_in/out attributes to 0 on write only
Arlin Davis [Wed, 17 Apr 2013 17:05:21 +0000 (10:05 -0700)]
dtest: set rdma_read_in/out attributes to 0 on write only

when running in write_only mode, create the EP with rdma read
attributes set to 0.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agodat.conf: add new device definitions for 2nd mlx4 adapter
Arlin Davis [Wed, 17 Apr 2013 16:57:01 +0000 (09:57 -0700)]
dat.conf: add new device definitions for 2nd mlx4 adapter

add mlx4_1 entries for ucm, scm, and mcm providers
in dat.conf. New entries appended to existing list for
backward compatibility.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: returning device rdma read depth of 0 causes MPI to fail
Arlin Davis [Mon, 15 Apr 2013 18:58:59 +0000 (11:58 -0700)]
mcm: returning device rdma read depth of 0 causes MPI to fail

return the RDMA read support via provider query and simply return
the rdma_read values from device via dat_ia_query/dat_ep_query.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocommon: add error logging on ep_create attribute checking
Arlin Davis [Mon, 15 Apr 2013 18:56:36 +0000 (11:56 -0700)]
common: add error logging on ep_create attribute checking

add logging to help distinguish between transport and general EP
attribute failures for ARG6.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocma: user reject is IB specific, should be transport agnostic
Arlin Davis [Thu, 11 Apr 2013 22:34:57 +0000 (15:34 -0700)]
cma: user reject is IB specific, should be transport agnostic

remove check for IB type, private data is enough
context for user specific reject type.

Signed-off-by: Jianxin Xiong <jianxin.xiong@intel.com>
Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoopenib: add new provider specific attributes
Arlin Davis [Mon, 25 Mar 2013 17:24:54 +0000 (10:24 -0700)]
openib: add new provider specific attributes

DAT_IB_PROVIDER_NAME = MCM/UCM/CMA/SCM
DAT_IB_DEVICE_NAME = mlx4_0/scif0/ipath0/etc
DAT_IB_CONNECTIVITY_MODE = DIRECT/PROXY
DAT_IB_RDMA_READ = TRUE/FALSE
DAT_IB_NODE_GUID = xxxx:xxxx:xxxx:xxxx

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: add support for full work request or memory pool
Arlin Davis [Mon, 25 Mar 2013 17:18:05 +0000 (10:18 -0700)]
mpxyd: add support for full work request or memory pool

Current implemention will fail when WR or memory is full. Change to
throttle and retry mix post_send opertions during full work queue.
New wr_pp (pst pending) added to m_qp for tracking outstanding
IB work request in flight.

Add counters for full wr and mem pool cases. Print mix-version on
startup.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease intel-mic-ofed-dapl-2.0.36.7-1
Arlin Davis [Mon, 18 Mar 2013 18:49:50 +0000 (11:49 -0700)]
Release intel-mic-ofed-dapl-2.0.36.7-1

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: reducing logging and add configuration for counters
Arlin Davis [Mon, 18 Mar 2013 18:46:16 +0000 (11:46 -0700)]
mpxyd: reducing logging and add configuration for counters

reduce logging level of resource allocation to reduce noise.
mcm_counters setting added to mpxyd.conf to control counters

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoconfig: add dat.conf entry for RDMA CM provider for IB SCIF
Arlin Davis [Fri, 15 Mar 2013 23:22:46 +0000 (16:22 -0700)]
config: add dat.conf entry for RDMA CM provider for IB SCIF

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: mpxyd does not start on SLES11SP2
Arlin Davis [Fri, 15 Mar 2013 21:55:19 +0000 (14:55 -0700)]
mpxyd: mpxyd does not start on SLES11SP2

add checking for SUSE and RH and process
accordingly.

Signed-off-by: Patrick Mccormick <patrick.m.mccormick@intel.com>
Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: Intel MPI Library functional tests failed with CCL-proxy enabled
Arlin Davis [Fri, 15 Mar 2013 21:45:35 +0000 (14:45 -0700)]
mpxyd: Intel MPI Library functional tests failed with CCL-proxy enabled

fix for too many open files on SCIF. CCL proxy missed cleanup
of new scif_ev_ep for CM processing. Every MIC client open/close
leaked a scif EP.

Signed-off-by: Jianxin Xiong <jianxin.xiong@intel.com>
Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agodtest: fix test to ack data before time stamp for accurate perf results
Arlin Davis [Fri, 15 Mar 2013 21:36:21 +0000 (14:36 -0700)]
dtest: fix test to ack data before time stamp for accurate perf results

Add rdma_write with immediat on last message and returned message to
insure all data is received. The existing write time was not accurate.

Fix signaling rate support, default = 10.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: add inline support, eager completion, improve proxy resource management
Arlin Davis [Fri, 15 Mar 2013 21:27:17 +0000 (14:27 -0700)]
mpxyd: add inline support, eager completion, improve proxy resource management

Add inline support for MIX and IB dma channels

Add eager completion, configurable, to signal writes or sends
after scif_readfrom is signaled and all data is local to proxy
instead of waiting for IB signal. User data on MIC is available
for reuse.

Combine sends and writes to mix_post_send command, provide
ordering guarantees between inline and dma data. Allow's
direct posting from OP thread is head of queue.

Add new counters for inline and signaled IO.

Extend m_wr to include flags for controlling eager completions
and proxy buffer and work request management.

cq event FD is now non-blocking and processed via TX thread
instead of OP thread. Allows for polling > 1 event at a time.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd config: add new options for inline and eager completions
Arlin Davis [Fri, 15 Mar 2013 21:17:24 +0000 (14:17 -0700)]
mpxyd config: add new options for inline and eager completions

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomix: protocol change to v3, reduce sge add inline options
Arlin Davis [Fri, 15 Mar 2013 21:14:26 +0000 (14:14 -0700)]
mix: protocol change to v3, reduce sge add inline options

post send changes to reduce sge entries to 4,
add inline options, and remove hard coded
wr size.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoopenib: add m_inline field to provider QP object
Arlin Davis [Fri, 15 Mar 2013 19:29:30 +0000 (12:29 -0700)]
openib: add m_inline field to provider QP object

add mix to mpxyd inline configuration

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoopenib: cleanup warnings/logging in dto post send function
Arlin Davis [Fri, 15 Mar 2013 19:27:59 +0000 (12:27 -0700)]
openib: cleanup warnings/logging in dto post send function

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: add support for mix inline data, improve mix_poll events
Arlin Davis [Fri, 15 Mar 2013 19:23:16 +0000 (12:23 -0700)]
mcm: add support for mix inline data, improve mix_poll events

mpxyd can be configured for inline data for posted
writes and sends. This will use scif_send/recv instead
of scif_readfrom based on threashold set in mpxyd.conf

change the mix_poll command to NOT issue the request
on scif and simply wait for mpxyd to write completion
back to EVD. This removes unneccesary SCIF command
traffic.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agobuild: update makefile to disable mpxyd builds
Arlin Davis [Fri, 15 Mar 2013 19:20:31 +0000 (12:20 -0700)]
build: update makefile to disable mpxyd builds

--disable-mcm will now disable both mcm provider
and mpxyd build to support non SCIF platforms

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease intel-mic-ofed-dapl-2.0.36.6-1
Arlin Davis [Fri, 8 Feb 2013 20:25:09 +0000 (12:25 -0800)]
Release intel-mic-ofed-dapl-2.0.36.6-1

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agodtest: add signal rate option -S to dtest
Arlin Davis [Fri, 8 Feb 2013 20:19:43 +0000 (12:19 -0800)]
dtest: add signal rate option -S to dtest

Allow user to change the signal rate on rdma write
test to adjust completion rates per IO

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: fix memory leak during smd proxy device close
Patrick Mccormick [Thu, 7 Feb 2013 21:04:28 +0000 (13:04 -0800)]
mpxyd: fix memory leak during smd proxy device close

Need to cleanup the proxy buffer created during device open.
unregister the proxy buffer with both scif and IB and free memory.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease intel-mic-ofed-dapl-2.0.36.5-1
Arlin Davis [Wed, 6 Feb 2013 20:33:50 +0000 (12:33 -0800)]
Release intel-mic-ofed-dapl-2.0.36.5-1

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoinstall: problem with rpm update when updating from 2.0.34 or older
Arlin Davis [Wed, 6 Feb 2013 19:49:29 +0000 (11:49 -0800)]
install: problem with rpm update when updating from 2.0.34 or older

The postun will remove entries on older packages that incorrectly
add and remove entries instead of updating the file. When updating
from these older version we end up with an empty /etc/dat.conf.

In order to fix we have to save the dat.conf and restore during
the upgrade process with the triggerpostun

%triggerpostun -- dapl < 2.0.35-1

11 years agodtest: incorrectly setting completions on each rdma write
Arlin Davis [Wed, 6 Feb 2013 19:33:21 +0000 (11:33 -0800)]
dtest: incorrectly setting completions on each rdma write

Fix setting of completion flags based on signal rates.
Change default burst to 100 from 50
Add size of IO to final bandwidth results

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: fix query of mcm provider to indicate no rdma read support
Arlin Davis [Wed, 6 Feb 2013 19:29:20 +0000 (11:29 -0800)]
mcm: fix query of mcm provider to indicate no rdma read support

Set the rdma_in, rdma_out attributes to 0 to indicate NO RDMA READ
support when running on a MIC with CCL Proxy

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoREAME.mcm: update to include non-root execution instructions
Arlin Davis [Mon, 4 Feb 2013 18:14:27 +0000 (10:14 -0800)]
REAME.mcm: update to include non-root execution instructions

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: cm scaling bug fixes and profiling
Arlin Davis [Sat, 2 Feb 2013 01:33:17 +0000 (17:33 -0800)]
mpxyd: cm scaling bug fixes and profiling

New CM thread to help with CM scale out. Testing with dtestcm
with 1000's of connections. MPI testing up to 60ppn on KNC nodes.

Add new disc timers and disconnect logging for debug.
Add cleanup for IB device during service termination.
Add profiling of device and CM operations to help debug scaling issues

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoREADME: update package and mcm readme files
Arlin Davis [Sat, 2 Feb 2013 01:07:29 +0000 (17:07 -0800)]
README: update package and mcm readme files

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agodtestcm: add more detailed debug during disconnect phase
Arlin Davis [Sat, 2 Feb 2013 00:54:58 +0000 (16:54 -0800)]
dtestcm: add more detailed debug during disconnect phase

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocommon: add debug logging to dat_lmr_create
Arlin Davis [Sat, 2 Feb 2013 00:53:27 +0000 (16:53 -0800)]
common: add debug logging to dat_lmr_create

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocommon: add ia stats logging and logging bit
Arlin Davis [Sat, 2 Feb 2013 00:49:57 +0000 (16:49 -0800)]
common: add ia stats logging and logging bit

added new DAPL_DBG_TYPE level of 0x2000000
that will print IA stats (non-zero) during dat_ia_close

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoscif: add scif ucm provider entry in dat.conf
Arlin Davis [Sat, 2 Feb 2013 00:48:23 +0000 (16:48 -0800)]
scif: add scif ucm provider entry in dat.conf

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease intel-mic-ofed-dapl-2.0.36.4-1
Arlin Davis [Mon, 21 Jan 2013 22:56:44 +0000 (14:56 -0800)]
Release intel-mic-ofed-dapl-2.0.36.4-1

11 years agoREADME: add environment variable settings/defaults
Arlin Davis [Mon, 21 Jan 2013 22:55:17 +0000 (14:55 -0800)]
README: add environment variable settings/defaults

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: TX thread can miss pending requests with multiple clients
Arlin Davis [Mon, 21 Jan 2013 20:51:42 +0000 (12:51 -0800)]
mpxyd: TX thread can miss pending requests with multiple clients

Pending data variable is overwritten with multiple SCIF clients
bound to one HCA causing rdma_write to stall and not posted
on IB device. MPI running multiple ranks on a KNC can stall.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease intel-mic-ofed-dapl-2.0.36.3-1
Arlin Davis [Wed, 16 Jan 2013 22:26:32 +0000 (14:26 -0800)]
Release intel-mic-ofed-dapl-2.0.36.3-1

11 years agopackage: add README and README.mcm and update content
Arlin Davis [Wed, 16 Jan 2013 22:11:09 +0000 (14:11 -0800)]
package: add README and README.mcm and update content

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm,mpxyd: add multi-enpoint, multi-threaded, and CPU affinity support for mpxyd...
Arlin Davis [Wed, 16 Jan 2013 21:39:59 +0000 (13:39 -0800)]
mcm,mpxyd: add multi-enpoint, multi-threaded, and CPU affinity support for mpxyd and mcm clients

For performance reasons separate EP's and separate threads have been incorporated.
3 scif eps. operation, events, and transmit are created for every device open
2 threads per MIC adapter, one for operations and one for RDMA operations
CPU affinity support as been added to to assist in HCA to MIC locality
for optimum performance. This fixes some performance issues seen at scale
on HT systems.

Also added some performance profiling to help with future tunining on
various platforms.

The CPU affinity and profiling are set via new mpxyd.conf parameters.
defaults are affinity=1, affinity base cpu_id=0, profiling=0

mcm_affinity, mcm_affinity_base, mcm_profile

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocommon: add support for ia name during dat_ia_query
Arlin Davis [Wed, 16 Jan 2013 21:38:18 +0000 (13:38 -0800)]
common: add support for ia name during dat_ia_query

the device name was not being updated during a query. Copy
the hca name into ia_attr->adapter_name for consumers.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agodtest: changes to support signaling rates during rdma_write testing
Arlin Davis [Wed, 16 Jan 2013 21:36:18 +0000 (13:36 -0800)]
dtest: changes to support signaling rates during rdma_write testing

To support larger iterations without huge TX queues we need
to signal. Also add unidirectional and bidirectional performance
results.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: register memory on tx endpoint.
Arlin Davis [Wed, 16 Jan 2013 21:32:23 +0000 (13:32 -0800)]
mcm: register memory on tx endpoint.

separated processing across multiple EP's.
operation and scif dma on different EP's so
register on the tx_ep for proxy enabled providers.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: fix proxy cq_poll, return only empty or number of completions
Arlin Davis [Wed, 16 Jan 2013 21:30:28 +0000 (13:30 -0800)]
mcm: fix proxy cq_poll, return only empty or number of completions

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoucm: reduce log level of CM warning on UNKNOWN state message
Arlin Davis [Thu, 10 Jan 2013 17:36:55 +0000 (09:36 -0800)]
ucm: reduce log level of CM warning on UNKNOWN state message

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: proxy segfaulting running as daemon, IB recv len incorrect
sean.hefty@intel.com [Mon, 10 Dec 2012 16:56:40 +0000 (08:56 -0800)]
mpxyd: proxy segfaulting running as daemon, IB recv len incorrect

didn't account for IB UD packet length being larger
than actual CM message due to added 40 byte GRH
with each receive packet. Adjust each recv pkt
len according to size of GRH.

Change default log to /tmp from stdout
Cleanup debug logs

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease intel-mic-ofed-dapl-2.0.36.2-1
Arlin Davis [Thu, 6 Dec 2012 22:30:43 +0000 (14:30 -0800)]
Release intel-mic-ofed-dapl-2.0.36.2-1

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agodecrease debug log level on modify qp during close
Arlin Davis [Thu, 6 Dec 2012 20:28:30 +0000 (12:28 -0800)]
decrease debug log level on modify qp during close

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: scif_connect return checking incorrect
Arlin Davis [Thu, 6 Dec 2012 20:27:22 +0000 (12:27 -0800)]
mpxyd: scif_connect return checking incorrect

check for -1 on errors, port_id returned on success.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm: support for rdma writes with immediate plus associated fixes
Arlin Davis [Thu, 6 Dec 2012 01:27:17 +0000 (17:27 -0800)]
mcm: support for rdma writes with immediate plus associated fixes

Segmented writes fixed to return proper length in work completion
Mpxyd segmentation size sync'ed with configuration file
Reduced proxy WR depth for segmentation, limit to x8.
Add debug info to help profile performance stalls

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: add common scif_send_msg inline function to handle partial writes
Arlin Davis [Tue, 4 Dec 2012 17:52:22 +0000 (09:52 -0800)]
mpxyd: add common scif_send_msg inline function to handle partial writes

scif_send blocking mode will not always block until
entire message is sent as documented. If will sometimes
return with partial sends. Create a common inline send
function what will handle condition and only return failure
with errors and not partial writes.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agomcm/mpxyd: add string print helper function for QP states
Arlin Davis [Tue, 4 Dec 2012 17:50:30 +0000 (09:50 -0800)]
mcm/mpxyd: add string print helper function for QP states

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoconfig: add scif device to dat.conf static configuration, remove v1.2 devices
Arlin Davis [Tue, 4 Dec 2012 17:46:31 +0000 (09:46 -0800)]
config: add scif device to dat.conf static configuration, remove v1.2 devices

v1.2 is no longer supported.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agospecfile: add definition to disable debug package
Arlin Davis [Thu, 29 Nov 2012 23:09:27 +0000 (15:09 -0800)]
specfile: add definition to disable debug package

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agospecfile: remove debug package
Arlin Davis [Thu, 29 Nov 2012 22:15:54 +0000 (14:15 -0800)]
specfile: remove debug package

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: add exposed cm timers to configuration file
Arlin Davis [Thu, 29 Nov 2012 22:14:46 +0000 (14:14 -0800)]
mpxyd: add exposed cm timers to configuration file

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease intel-mic-ofed-dapl-2.0.36.1-1
Arlin Davis [Mon, 26 Nov 2012 18:56:46 +0000 (10:56 -0800)]
Release intel-mic-ofed-dapl-2.0.36.1-1

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoadd new package spec file - intel-mic-ofed-dapl.spec.in
Arlin Davis [Mon, 26 Nov 2012 18:29:40 +0000 (10:29 -0800)]
add new package spec file - intel-mic-ofed-dapl.spec.in

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agopackage update: rename package for Intel MPSS release
Arlin Davis [Mon, 26 Nov 2012 18:23:29 +0000 (10:23 -0800)]
package update: rename package for Intel MPSS release

dapl- renamed to intel-mic-ofed-dapl-

new package definitions will obsolete the previous dapl version

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoRelease 2.0.36.1
Arlin Davis [Thu, 15 Nov 2012 22:14:14 +0000 (14:14 -0800)]
Release 2.0.36.1

Change version to include sub-minor numbers. Add options
to specfile to include CCFLAGS and LDFLAG options via rpmbuild.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agoinstall: correct the path to /etc
Patrick Mccormick [Thu, 15 Nov 2012 02:49:16 +0000 (18:49 -0800)]
install: correct the path to /etc

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agompxyd: dont segfault if pid file is not found when attempting to kill running daemon
Patrick Mccormick [Thu, 15 Nov 2012 02:46:07 +0000 (18:46 -0800)]
mpxyd: dont segfault if pid file is not found when attempting to kill running daemon

11 years agompxyd: misc cleanups for consistency
Patrick Mccormick [Thu, 15 Nov 2012 02:44:34 +0000 (18:44 -0800)]
mpxyd: misc cleanups for consistency

11 years agompxyd: expose CM request and reply timers and retry count
Arlin Davis [Thu, 15 Nov 2012 02:36:56 +0000 (18:36 -0800)]
mpxyd: expose CM request and reply timers and retry count

add entries in the mpxyd.conf for timers and retry. Start with
larger default timers given small cores are processing messages
and they are proxied via SCIF.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
11 years agocommon: 2 new debug logging levels for low system memory and package info
Arlin Davis [Mon, 29 Oct 2012 18:54:46 +0000 (11:54 -0700)]
common: 2 new debug logging levels for low system memory and package info

DAPL_DBG_TYPE_SYS_WARN = 0x800000
DAPL_DBG_TYPE_VER = 0x1000000

export DAPL_DBG_SYS_MEM = 5 will set the checking for memory less than 5%
when DAPL_DBG_TYPE is set with bit DAPL_DBG_TYPE_SYS_WARN.

The package must be built with --enable-counters for memory checking to
be enabled.

In addition, if DAPL_DBG_TYPE -s set with bit DAPL_DBG_TYPE_VER than
the package rev and build date will be sent to stdout during library
init.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>