Hi folks,
after I resolved some of the issues I had with my Metis PCIe device, I stopped working with it since it was so loud that I got headache after 30min of runtime.
Now I realized that there is a new FW which silences the fan and I upgraded the SDK and the FW:
(venv) root@holodeck7:/voyager-sdk# axdevice -v
INFO: Found PCI device: 01:00.0 Processing accelerators: Axelera AI Metis AIPU (rev 02)
INFO: Found AIPU driver: metis 90112 0
INFO: Firmware version matches: v1.3.1
INFO: Using device metis-0:1:0
Device 0: metis-0:1:0 4GiB pcie flver=1.2.0-rc2 bcver=1.0 clock=800MHz(0-3:800MHz) mvm=0-3:100%
device_runtime_firmware=v1.3.1
board_controller_board_type=matterhorn
sw_throttling: 200°C, hysteresis 5°C, throttle rate:12%
hw_throttling: 105°C, hysteresis 10°C
pvt_warning_threshold: 95°C
BUT the fan keeps on spinning at max although the Metis is not being used at all.
Tried to find means to read-out the actual temperature, which does not seem to work:
(venv) root@holodeck7:/voyager-sdk# triton_multi_ctx --board-temp
blibtriton_linux.c:1082] Device communication timed out: device did not respond within 1 seconds. (705)
Failed to read sensor temperature from the board controller
I also tried to reduce the chip clock to the minimum:
(venv) root@holodeck7:/voyager-sdk# axdevice -v
INFO: Found PCI device: 01:00.0 Processing accelerators: Axelera AI Metis AIPU (rev 02)
INFO: Found AIPU driver: metis 90112 0
INFO: Firmware version matches: v1.3.1
INFO: Using device metis-0:1:0
Device 0: metis-0:1:0 4GiB pcie flver=1.2.0-rc2 bcver=1.0 clock=100MHz(0-3:100MHz) mvm=0-3:100%
device_runtime_firmware=v1.3.1
board_controller_board_type=matterhorn
sw_throttling: 200°C, hysteresis 5°C, throttle rate:12%
hw_throttling: 105°C, hysteresis 10°C
pvt_warning_threshold: 95°C
Still, Metis is idle but the fan is spinning at max. I am ok when the fan boosts when the card is being used, but when it is idle I expect the fan to just stop. I can’t use this PC unless I unplug the Metis PCIe card, which is a pita and might wear out the PCIe connector over time…
So simple question: how can I control the fan? Most relevant: I want the fan to stop when device is idle.
Thanks