100 строки
3.7 KiB
Plaintext
100 строки
3.7 KiB
Plaintext
SYSFS FILES
|
|
|
|
For each InfiniBand device, the InfiniBand drivers create the
|
|
following files under /sys/class/infiniband/<device name>:
|
|
|
|
node_type - Node type (CA, switch or router)
|
|
node_guid - Node GUID
|
|
sys_image_guid - System image GUID
|
|
|
|
In addition, there is a "ports" subdirectory, with one subdirectory
|
|
for each port. For example, if mthca0 is a 2-port HCA, there will
|
|
be two directories:
|
|
|
|
/sys/class/infiniband/mthca0/ports/1
|
|
/sys/class/infiniband/mthca0/ports/2
|
|
|
|
(A switch will only have a single "0" subdirectory for switch port
|
|
0; no subdirectory is created for normal switch ports)
|
|
|
|
In each port subdirectory, the following files are created:
|
|
|
|
cap_mask - Port capability mask
|
|
lid - Port LID
|
|
lid_mask_count - Port LID mask count
|
|
rate - Port data rate (active width * active speed)
|
|
sm_lid - Subnet manager LID for port's subnet
|
|
sm_sl - Subnet manager SL for port's subnet
|
|
state - Port state (DOWN, INIT, ARMED, ACTIVE or ACTIVE_DEFER)
|
|
phys_state - Port physical state (Sleep, Polling, LinkUp, etc)
|
|
|
|
There is also a "counters" subdirectory, with files
|
|
|
|
VL15_dropped
|
|
excessive_buffer_overrun_errors
|
|
link_downed
|
|
link_error_recovery
|
|
local_link_integrity_errors
|
|
port_rcv_constraint_errors
|
|
port_rcv_data
|
|
port_rcv_errors
|
|
port_rcv_packets
|
|
port_rcv_remote_physical_errors
|
|
port_rcv_switch_relay_errors
|
|
port_xmit_constraint_errors
|
|
port_xmit_data
|
|
port_xmit_discards
|
|
port_xmit_packets
|
|
symbol_error
|
|
|
|
Each of these files contains the corresponding value from the port's
|
|
Performance Management PortCounters attribute, as described in
|
|
section 16.1.3.5 of the InfiniBand Architecture Specification.
|
|
|
|
The "pkeys" and "gids" subdirectories contain one file for each
|
|
entry in the port's P_Key or GID table respectively. For example,
|
|
ports/1/pkeys/10 contains the value at index 10 in port 1's P_Key
|
|
table.
|
|
|
|
There is an optional "hw_counters" subdirectory that may be under either
|
|
the parent device or the port subdirectories or both. If present,
|
|
there are a list of counters provided by the hardware. They may match
|
|
some of the counters in the counters directory, but they often include
|
|
many other counters. In addition to the various counters, there will
|
|
be a file named "lifespan" that configures how frequently the core
|
|
should update the counters when they are being accessed (counters are
|
|
not updated if they are not being accessed). The lifespan is in milli-
|
|
seconds and defaults to 10 unless set to something else by the driver.
|
|
Users may echo a value between 0 - 10000 to the lifespan file to set
|
|
the length of time between updates in milliseconds.
|
|
|
|
MTHCA
|
|
|
|
The Mellanox HCA driver also creates the files:
|
|
|
|
hw_rev - Hardware revision number
|
|
fw_ver - Firmware version
|
|
hca_type - HCA type: "MT23108", "MT25208 (MT23108 compat mode)",
|
|
or "MT25208"
|
|
|
|
HFI1
|
|
|
|
The hfi1 driver also creates these additional files:
|
|
|
|
hw_rev - hardware revision
|
|
board_id - manufacturing board id
|
|
tempsense - thermal sense information
|
|
serial - board serial number
|
|
nfreectxts - number of free user contexts
|
|
nctxts - number of allowed contexts (PSM2)
|
|
chip_reset - diagnostic (root only)
|
|
boardversion - board version
|
|
ports/1/
|
|
CCMgtA/
|
|
cc_settings_bin - CCA tables used by PSM2
|
|
cc_table_bin
|
|
cc_prescan - enable prescaning for faster BECN response
|
|
sc2v/ - 32 files (0 - 31) used to translate sl->vl
|
|
sl2sc/ - 32 files (0 - 31) used to translate sl->sc
|
|
vl2mtu/ - 16 (0 - 15) files used to determine MTU for vl
|