git.openfabrics.org - ~shefty/libmlx4.git/commit

author	Sebastien Dugue <sebastien.dugue@bull.net>
	Wed, 29 Jul 2009 18:45:55 +0000 (11:45 -0700)
committer	Roland Dreier <rolandd@cisco.com>
	Wed, 29 Jul 2009 18:54:28 +0000 (11:54 -0700)
commit	87750d1db4bc73933e92d862ff73d10d06ff4f5d
tree	5c9d0785df94a0f53f8430f0eb08343e33172836	tree \| snapshot
parent	09f8dcd3769a52eebceeae15e2c5a844e6ee4586	commit \| diff

Use mmap(MAP_ANONYMOUS) to allocate queue buffers

Internal buffers for QPs, CQs, SRQs etc. are allocated with
mlx4_alloc_buf(), which rounds the buffer's size to the page size and
then allocates page aligned memory using posix_memalign().

However, this allocation is quite wasteful on architectures using 64K
pages (ia64 for example) because we then hit glibc's MMAP_THRESHOLD
malloc parameter and chunks are allocated using mmap.  Thus we end up
allocating:

  (requested size rounded to the page size) + (page size) + (malloc overhead)

rounded internally to the page size.

So for example, if we request a buffer of page_size bytes, we end up
consuming 3 pages.  In short, for each buffer we allocate, there is an
overhead of 2 pages.  This is quite visible on large clusters where
the number of QPs can reach several thousands.

This patch replaces the call to posix_memalign() in mlx4_alloc_buf()
with a direct call to mmap().

Signed-off-by: Sebastien Dugue <sebastien.dugue@bull.net>
Signed-off-by: Roland Dreier <rolandd@cisco.com>