A collection of the best methods, practices, and configuration values known for delivering the best possible performance in a consistent and predictable way for the workloads built on the librpma library.
General notes
The Performance - Tuning aims to collect all tested and proven procedures with known beneficial effects on the performance of the workloads built on the librpma library. Nonetheless, each change of configuration should be set up and tested in a testing environment before being applied to a production system. Backing up all data and configuration settings prior to tuning is also recommended.
BIOS settings
Cascade Lake
- Platform and CPU related [1.1]
- Disable lower CPU power states: C-states [1.2], C1E, and memory and PCI-e power saving states. Settings vary from vendor to vendor so some of them may not be available for you e.g.:
- Power and Performance - CPU C State Control - Package C-State - C0/C1 state
- Power and Performance - CPU C State Control - C1E - Disabled
- Check for other settings that might influence performance. This varies greatly by OEM, but should include anything power related, such as fan speed settings (more is better) e.g.:
- Power and Performance - CPU Power and Performance Policy - Performance
- System Acoustic and Performance Configuration - Set Fan Profile - Performance
- PMem-related
- Configure maximum available operating power for your PMem devices [XXX source and details are missing]. Note: Different sizes of PMem devices have different performance capabilities. If it is important for you, pick the right one for your application e.g.: [1.3]
- Memory Configuration - Average Power Budget - 18 mW
- Memory Configuration - NVM Performance Setting - Latency Optimized
Not yet confirmed [1.1]
- Ensure that Intel® Turbo Boost [1.4] is on.
- Power and Performance - CPU P State Control - Intel Turbo Boost Technology - Enabled
- Power and Performance - CPU P State Control - Energy Efficient Turbo - Disabled
- Disable hyper-threading to reduce variations in latency (jitter).
- Processor Configuration - Intel(R) Hyper-Threading Tech - Disabled
- Disable any virtualization options.
- Processor Configuration - Intel(R) Virtualization Technology - Disabled
- Disable any monitoring options.
- Memory Configuration - Thermal Monitor - Disabled
- Disable Hardware Power Management, introduced in the Intel® Xeon® processor E5-2600 v4 product family. It provides more control over power management, but it can cause jitter and so is not recommended for latency-sensitive applications.
Ice Lake
- Platform and CPU related [1.1]
- Disable lower CPU power states: C-states [1.2] and memory and PCI-e power saving states. Settings vary from vendor to vendor so some of them may not be available for you e.g.:
- Advanced Power Management Configuration - Package C State Control - Package C-State - C0/C1 state
- Check for other settings that might influence performance. This varies greatly by OEM, but should include anything power related, such as fan speed settings (more is better) e.g.:
- System Acoustic and Performance Configuration - Set Fan Profile - Performance
- PMem-related
- configure maximum available operating power for your PMem devices [XXX source and details are missing]. Note: Different sizes of PMem devices have different performance capabilities. If it is important for you, pick the right one for your application e.g.: [1.3]
- Memory Configuration - PMem Configuration - 200 Series PMem Average Power Limit (in mW) - 15 mW
- Memory Configuration - PMem Configuration - PMem Performance Setting - BW Optimized
Not yet confirmed [1.1]
- Disable hyper-threading to reduce variations in latency (jitter).
- Processor Configuration - Intel(R) Hyper-Threading Tech - Disabled
- Disable any monitoring options.
- Advanced Power Management Configuration - CPU Thermal Management - Thermal Monitor - Disable
- Disable Hardware Power Management, introduced in the Intel® Xeon® processor E5-2600 v4 product family. It provides more control over power management, but it can cause jitter and so is not recommended for latency-sensitive applications.
- Advanced Power Management Configuration - CPU P State Control - Energy Efficient Turbo - Disable
- Advanced Power Management Configuration - CPU P State Control - Turbo Mode - Disable
References
CPU
Disabling intel_pstate
is recommended for latency sensitive workloads. [2.1]
CPU driver
To disable intel_pstate
CPU driver edit your /etc/default/grub
and add the following to the kernel command line:
GRUB_CMDLINE_LINUX_DEFAULT="intel_pstate=disable"
Rebuild the grub.cfg file as follows (make sure you use the path adjusted to your OS):
- On BIOS-based machines, issue the following command:
sudo grub2-mkconfig -o /boot/grub2/grub.cfg
- On UEFI-based machines, issue the following command:
sudo grub2-mkconfig -o /boot/efi/EFI/redhat/grub.cfg
This change requires a reboot to take effect.
CPU scaling governor
Set the most performant scaling governor:
sudo cpupower frequency-set --governor performance
CPU frequency
List available frequencies and pick the biggest one.
cpupower frequency-info | grep steps
available frequency steps: 2.30 GHz, 2.30 GHz, 2.20 GHz, 2.10 GHz, 2.00 GHz, 1.90 GHz, 1.80 GHz, 1.70 GHz, 1.60 GHz, 1.50 GHz, 1.40 GHz, 1.30 GHz, 1.20 GHz, 1.10 GHz, 1000 MHz
Set the frequency range to the single biggest available value:
FREQ=2.3Ghz
sudo cpupower frequency-set --min $FREQ
sudo cpupower frequency-set --max $FREQ
References
RDMA-capable network interface (RNIC)
PCIe attributes [3.1]
- Width - provide for the RNIC the maximum number of PCIe lanes it is capable of using
- Speed - attach the RNIC to PCIe interface of maximum supported speed
- Max Payload Size - learn what the Max Payload Size for your system is and adjust expectations accordingly
- Max Read Request - configure the maximum available read request value
Knowing all of these attributes allows calculating the RNIC bandwidth limitation. For details please see “Understanding PCIe Configuration for Maximum Performance” Chapter “Calculating PCIe Limitations”.
List supported link modes:
$ ethtool ens785f0
Settings for ens785f0:
Supported ports: [ Backplane ]
Supported link modes: 1000baseKX/Full
# ...
100000baseKR4/Full
100000baseSR4/Full
100000baseCR4/Full
100000baseLR4_ER4/Full
# ...
Set the maximum available value:
$ sudo ethtool -s ens785f0 speed 100000 autoneg off
Maximum Transmission Unit (MTU) [3.3]
Set MTU to the maximum recommended value. In most cases, it is 4200.
$ ifconfig ens785f0 mtu 4200
References
Running workloads
In order to learn how to run the individual workloads, please visit: https://github.com/pmem/rpma/main/tools/perf/BENCHMARKING.md
Disclaimer
Performance varies by use, configuration and other factors.
No product or component can be absolutely secure.
Your costs and results may vary.
Intel technologies may require enabled hardware, software or service activation.
Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade.