Installing and Configuring Mellanox InfiniBand Driver
To use InfiniBand (IB) networking, you must install the Mellanox OFED (OpenFabrics Enterprise Distribution) driver, which supports RDMA and other IB features.
Step 1: Verify InfiniBand Hardware
Before installation, check if the InfiniBand adapter is detected:
If you see an output like:
Then your IB card is detected.
Step 2: Download Mellanox OFED Driver
Get the latest Mellanox OFED driver from NVIDIA's official website:
🔗 Mellanox OFED Downloads
Alternatively, use wget
:
(Change the version based on your OS.)
Step 3: Install the Mellanox OFED Driver
1️⃣ Extract the downloaded file:
2️⃣ . Run the installer:
(The --without-fw-update
flag prevents accidental firmware updates.)
3️⃣ Reboot the system:
Step 4: Verify Installation
Check if the InfiniBand modules are loaded:
You should see output like:
Check the status of the IB interfaces:
It will display details about the InfiniBand HCA (Host Channel Adapter) if properly installed.
Step 5: Configure the InfiniBand Network
Enable IB Interface
1️⃣ List network interfaces:
Example output:
2️⃣ Assign an IP address (if using IPoIB):
Edit /etc/sysconfig/network-scripts/ifcfg-ib0
(for RHEL/AlmaLinux) or /etc/netplan/
(for Ubuntu):
Then restart the network service:
Step 6: Test InfiniBand Communication
1️⃣ Check the InfiniBand link status:
2️⃣ . Run a bandwidth test:
3️⃣ Run an RDMA ping test:
(on another node)
Conclusion
✅ Mellanox OFED installation enables InfiniBand communication in HPC systems.
✅ Use RDMA for low-latency, high-speed networking.
✅ Ensure IB devices are configured and active using ibstat
, ibping
, and ib_send_bw
.
Comments
Post a Comment