I’m also not seeing the card recognised in `lspci`. I’m adding a post to this thread to keep all the info in one place for others, but let me know if it should be a different thread.
I’ve enabled `pci-check.service` and get the following when I check the status;
alsutton@svr220:~$ sudo systemctl status pcie-check.service
● pcie-check.service - Check for PCIe devices with vendor ID 1f9d and reboot once if not found
Loaded: loaded (/etc/systemd/system/pcie-check.service; enabled; vendor preset: enabled)
Active: active (exited) since Wed 2025-08-06 09:33:01 UTC; 11min ago
Process: 1709 ExecStart=/usr/local/bin/check_pcie_device.sh (code=exited, status=0/SUCCESS)
Main PID: 1709 (code=exited, status=0/SUCCESS)
CPU: 6ms
Aug 06 09:33:01 svr220 systemd91]: Starting Check for PCIe devices with vendor ID 1f9d and reboot once if not found...
Aug 06 09:33:01 svr220 check_pcie_device.sh01709]: Reboot already performed; skipping further checks.
Aug 06 09:33:01 svr220 systemd91]: Finished Check for PCIe devices with vendor ID 1f9d and reboot once if not found.
`dmesg` contains the following lines;
519.220595] triton: root directory for triton
e 844.197296] triton: debugfs root directory triton removed
`lspci -tv contains`;
+-a0000:80]-+-02.0-o81]--
| +-02.3-r82]----00.0 Micron/Crucial Technology P2 nNick P2] / P3 / P3 Plus NVMe PCIe SSD (DRAM-less)
(The NVMe PCIe SSD adapter is on the same riser card as the Axelera card, and when the card isn’t present there is no n81] entry)
The metis module doesn’t autoload, so I’m loading it manually, and after doing that, any attempt to run `triton_multi_ctx` results in;
(venv) alsutton@svr220:~/Utils/voyager-sdk$ triton_multi_ctx --cold-boot 3
_libtriton_linux.c:985] Could not open directory '/sys/class/metis/': No such file or directory
Fail to get device name
(venv) alsutton@svr220:~/Utils/voyager-sdk$ sudo modprobe metis
ssudo] password for alsutton:
(venv) alsutton@svr220:~/Utils/voyager-sdk$ triton_multi_ctx --cold-boot 3
Fail to get device name
I’ve run `update-pciids`, have put `pcie_aspm=off` into the grub cmdline (and run `update-grub`), and tried reinstalling the kernel modules (`metis-dkms`), set `d3cold_allowed` to 0, and tried setting it to 1, triggering a pci rescan after each change.
Does anyone have any thoughts?
eMachine Details; Supermicro X10DRU-i+ with two Intel Xeon E5-2673 v4’s]