MLXCX(4D) Devices MLXCX(4D)
NAME
mlxcx - Mellanox ConnectX-4/5/6 Ethernet controller driver
SYNOPSIS
/dev/net/mlxcx*DESCRIPTION
The
mlxcx driver is a GLDv3 NIC driver for the ConnectX-4, ConnectX-4
Lx, ConnectX-5 and ConnectX-6 families of ethernet controllers from
Mellanox. It supports the Data Link Provider Interface,
dlpi(4P).
This driver supports:
- Jumbo frames up to 9000 bytes.
- Checksum offload for TCP, UDP, IPv4 and IPv6.
- Group support with VLAN and MAC steering to avoid software
classification when using VNICs.
- Promiscuous access via
snoop(8) and
dlpi(4P) - LED control
- Transceiver information
- Internal temperature sensors
At this time, the driver does not support Large Send Offload (LSO),
energy efficient Ethernet (EEE), or the use of flow control through
hardware pause frames.
CONFIGURATION
The
mlxcx.conf file contains user configurable parameters, including
the ability to set the number of rings and groups advertised to MAC,
the sizes of rings and groups, and the maximum number of MAC address
filters available.
PROPERTIES
The driver supports the following device properties which may be tuned
through its driver.conf file,
/kernel/drv/mlxcx.conf. These properties
cannot be changed after the driver has been attached.
These properties are not considered stable at this time, and may
change.
eq_size_shift Minimum:
2 | Maximum:
device dependent (up to 255) The
eq_size_shift property determines the number of entries on
Event Queues for the device. The number of entries is
calculated as (1 << eq_size_shift), so a value of 9 would mean
512 entries are created on each Event Queue. The default value
is
9.
cq_size_shift Minimum:
2 | Maximum:
device dependent (up to 255) The
cq_size_shift property determines the number of entries on
Completion Queues for the device. The number of entries is
calculated as (1 << cq_size_shift), so a value of 9 would mean
512 entries are created on each Event Queue. The default value
is device dependent,
10 for devices with maximum supported
speed of 10Gb/s or less and
12 for devices with higher
supported speeds. This should be kept very close to the value
set for
rq_size_shift and
sq_size_shift.
rq_size_shift Minimum:
2 | Maximum:
device dependent (up to 255) The
rq_size_shift property determines the number of descriptors
on Receive Queues for the device. The number of descriptors is
calculated as (1 << rq_size_shift), so a value of 9 would mean
512 descriptors are created on each Receive Queue. This sets
the number of packets on RX rings advertised to MAC. The
default value is device dependent,
10 for devices with maximum
supported speed of 10Gb/s or less and
12 for devices with
higher supported speeds.
sq_size_shift Minimum:
2 | Maximum:
device dependent (up to 255) The
sq_size_shift property determines the number of descriptors
on Send Queues for the device. The number of descriptors is
calculated as (1 << sq_size_shift), so a value of 9 would mean
512 descriptors are created on each Send Queue. This sets the
number of packets on RX rings advertised to MAC. The default
value is device dependent,
11 for devices with maximum
supported speed of 10Gb/s or less and
13 for devices with
higher supported speeds. Note that large packets often occupy
more than one descriptor slot on the SQ, so it is sometimes a
good idea to increase this if using a large MTU.
tx_ngroups Minimum:
1 | Maximum:
device dependent The
tx_ngroups property determines the number of TX groups
advertised to MAC. The default value is
1.
tx_nrings_per_group Minimum:
1 | Maximum:
device dependent The
tx_nrings_per_group property determines the number of rings
in each TX group advertised to MAC. The default value is
64.
rx_ngroups_large Minimum:
1 | Maximum:
device dependent The
rx_ngroups_large property determines the number of "large"
RX groups advertised to MAC. The size of "large" RX groups is
set by the
rx_nrings_per_large_group property. The default
value is
2.
rx_nrings_per_large_group Minimum:
1 | Maximum:
device dependent The
rx_nrings_per_large_group property determines the number of
rings in each "large" RX group advertised to MAC. The number
of such groups is determined by the
rx_ngroups_large property.
The default value is
16.
rx_ngroups_small Minimum:
1 | Maximum:
device dependent The
rx_ngroups_small property determines the number of "small"
RX groups advertised to MAC. The size of "small" RX groups is
set by the
rx_nrings_per_small_group property. It is
recommended to use many small groups when using a large number
of VNICs on top of the NIC (e.g. on a system with many zones).
The default value is
256.
rx_nrings_per_small_group Minimum:
1 | Maximum:
device dependent The
rx_nrings_per_small_group property determines the number of
rings in each "small" RX group advertised to MAC. The number
of such groups is determined by the
rx_ngroups_small property.
The default value is
4.
ftbl_root_size_shift Minimum:
4 | Maximum:
device dependent The
ftbl_root_size_shift property determines the number of flow
table entries on the root flow table, and therefore how many
MAC addresses can be filtered into groups across the entire
NIC. The number of flow entries is calculated as (1 <<
ftbl_root_size_shift), so a value of 9 would mean 512 entries
are created in the root flow table. The default value is
12.
cqemod_period_usec Minimum:
1 | Maximum:
65535 The
cqemod_period_usec property determines the maximum delay
after a completion event has occurred before an event queue
entry (and thus an interrupt) is generated. The delay is
measured in microseconds. The default value is
50.
cqemod_count Minimum:
1 | Maximum:
65535 The
cqemod_count property determines the maximum number of
completion events that can have occurred before an event queue
entry (and thus an interrupt) is generated. The default value
is
80% of the CQ size.
intrmod_period_usec Minimum:
1 | Maximum:
65535 The
intrmod_period_usec property determines the maximum delay
after an event queue entry has been generated before an
interrupt is raised. The delay is measured in microseconds.
The default value is
10.
tx_bind_threshold Minimum:
1 | Maximum:
65535 The
tx_bind_threshold property determines the minimum number of
bytes in a packet before the driver uses
ddi_dma_addr_bind_handle(9F) to bind the packet memory for DMA,
rather than copying the memory as it does for small packets.
DMA binds are expensive and involve taking locks in the PCI
nexus driver, so it is seldom worth using them for small
packets. The default value is
2048.
rx_limit_per_completion Minimum:
16 | Maximum:
4096 The
rx_limit_per_completion property determines the maximum
number of packets that will be processed on a given completion
ring during a single interrupt. This is done to try and
guarantee some amount of liveness in the system. The default
value is
256.
rx_p50_loan_min_size Minimum:
0 | Maximum:
MTU The
rx_p50_loan_min_size property determines the minimum size
of packet buffers allowed to be loaned to MAC when the ring has
reached >=50% of its buffers already on loan. Packet buffers
larger than this value will be copied. At <50% of ring buffers
on loan, all buffers will be loaned. At >=75% of buffers on
loan, all packets will be copied instead to ensure ring
availability. The default value is
256.
FILES
/kernel/drv/amd64/mlxcx Device driver (x86)
/kernel/drv/mlxcx.conf Driver configuration file containing
user-configurable options
SEE ALSO
dlpi(4P),
driver.conf(5),
dladm(8),
snoop(8)illumos August 27, 2020 illumos