10 releases
0.7.0 | May 16, 2024 |
---|---|
0.6.6 | Jan 23, 2024 |
0.6.1 | Dec 22, 2023 |
0.5.6 | Sep 7, 2023 |
0.4.4 |
|
#498 in Network programming
Used in rrppcc
215KB
5K
SLoC
rrddmma
A Rust RDMA library.
This library is more for academic use than for industry. It is highly specialized to Mellanox/NVIDIA ConnectX network adapter series.
WARNING: the interfaces are unstable and under continuous change!
Linkage
This library supports multiple linkage types to the ibverbs
library.
-
First, this library respects existing MLNX_OFED installations. It works on both v4.9-x and v5.x versions.
MLNX_OFED v4.9-x will enable experimental verbs.(TODO)MLNX_OFED v5.x will enable(TODO)mlx5dv_*
features.
-
Otherwise,
rrddmma
will try to find an existinglibibverbs
installation viapkg-config
.This will enable enable(TODO)mlx5dv_*
features.
-
Otherwise,
rrddmma
will try to download rdma-core and build from source. You need to ensure that the dependencies are properly installed. In Ubuntu and other Debian-derived OSs, these are:sudo apt install -y build-essential cmake gcc libclang-dev libudev-dev libsystemd-dev \ libnl-3-dev libnl-route-3-dev ninja-build pkg-config valgrind \ python3-dev cython3 python3-docutils pandoc
Building from source is different from the previous two approaches in that
libibverbs
is linked statically and cannot detect providers at runtime. This library currently only allows themlx5
provider.This will enable enable(TODO)mlx5dv_*
features.
Some Design Principles
Panics in Fallible Methods
It is widely recognized as a bad design pattern to panic in fallible methods (i.e., methods that return a Result
).
However, in RDMA, this is not the case because there are actually two different types of errors:
- Programming errors: These are logic errors caused by the programmers who do not follow the instructions of the
manual.
Examples include forgetting to bind a QP to a local port before making or binding a peer for it.
These errors should be caught during development and testing, should never happen in production, and is never
recoverable.
rrddmma
panics for these errors. - Runtime errors: These are errors reported by the
libibverbs
library, such as failing to create a QP due to resource exhaustion. These errors are recoverable and should be handled by the caller.rrddmma
returnsErr
for these errors.
Dependencies
~3.5–7MB
~131K SLoC