Skip to main content
Question

Axelera Metis PCIe AI Accelerator not recognized by lspci

  • April 4, 2025
  • 31 replies
  • 1229 views

Show first post

31 replies

  • Author
  • Cadet
  • April 11, 2025

Hello Victor,

I followed your instructions and retested the setup. Here are the details for completeness:

  • Kernel Parameters:
    Initially, running:

    cat /proc/cmdline

    produced:

    BOOT_IMAGE=/boot/vmlinuz-6.8.0-57-generic root=UUID=60bd6ac3-6af8-4ba0-9722-935f61fb73a3 ro quiet splash amd_iommu=off vt.handoff=7

    I then noticed that the pcie_aspm parameter was missing, so I updated my configuration. The updated output is now:

    BOOT_IMAGE=/boot/vmlinuz-6.8.0-57-generic root=UUID=60bd6ac3-6af8-4ba0-9722-935f61fb73a3 ro quiet splash amd_iommu=off pcie_aspm=off vt.handoff=7
  • Refresh Command Output:
    With the corrected kernel parameter in place, I executed the refresh command (using axdevice --refresh -v) several times. Here’s a representative output:

    INFO:axelera.runtime.axdevice:Removing 0000:03:00.0
    INFO:axelera.runtime.axdevice:PCIE rescan
    0000:04:00.0 : Axelera AI Metis AIPU (rev 02)
    INFO:axelera.runtime:Found PCI device: 04:00.0 Processing accelerators: Axelera AI Metis AIPU (rev 02)
    INFO:axelera.runtime:Found AIPU driver: metis 90112 0
    WARNING:axelera.runtime:4PCI device count mismatch: lspci=1, triton=0
    Traceback (most recent call last):
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/bin/axdevice", line 8, in <module>
    sys.exit(entrypoint_main())
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 625, in entrypoint_main
    main(args)
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 608, in main
    devices = _find_devices(found_devices, device_id)
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 187, in _find_devices
    raise RuntimeError("No devices found, use -v for more information")
    RuntimeError: No devices found, use -v for more information
    (venv) tripton@tripton-ubuntu:~/repos/voyager-sdk$ axdevice --refresh -v
    INFO:axelera.runtime.axdevice:Removing 0000:03:00.0
    INFO:axelera.runtime.axdevice:PCIE rescan
    0000:04:00.0 : Axelera AI Metis AIPU (rev 02)
    INFO:axelera.runtime:Found PCI device: 04:00.0 Processing accelerators: Axelera AI Metis AIPU (rev 02)
    INFO:axelera.runtime:Found AIPU driver: metis 90112 0
    WARNING:axelera.runtime:4PCI device count mismatch: lspci=1, triton=0
    Traceback (most recent call last):
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/bin/axdevice", line 8, in <module>
    sys.exit(entrypoint_main())
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 625, in entrypoint_main
    main(args)
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 608, in main
    devices = _find_devices(found_devices, device_id)
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 187, in _find_devices
    raise RuntimeError("No devices found, use -v for more information")
    RuntimeError: No devices found, use -v for more information
    (venv) tripton@tripton-ubuntu:~/repos/voyager-sdk$ axdevice --refresh -v
    INFO:axelera.runtime.axdevice:Removing 0000:03:00.0
    INFO:axelera.runtime.axdevice:PCIE rescan
    0000:04:00.0 : Axelera AI Metis AIPU (rev 02)
    INFO:axelera.runtime:Found PCI device: 04:00.0 Processing accelerators: Axelera AI Metis AIPU (rev 02)
    INFO:axelera.runtime:Found AIPU driver: metis 90112 0
    WARNING:axelera.runtime:4PCI device count mismatch: lspci=1, triton=0
    Traceback (most recent call last):
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/bin/axdevice", line 8, in <module>
    sys.exit(entrypoint_main())
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 625, in entrypoint_main
    main(args)
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 608, in main
    devices = _find_devices(found_devices, device_id)
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 187, in _find_devices
    raise RuntimeError("No devices found, use -v for more information")
    RuntimeError: No devices found, use -v for more information
    (venv) tripton@tripton-ubuntu:~/repos/voyager-sdk$ axdevice --refresh -v
    INFO:axelera.runtime.axdevice:Removing 0000:03:00.0
    INFO:axelera.runtime.axdevice:PCIE rescan
    0000:04:00.0 : Axelera AI Metis AIPU (rev 02)
    INFO:axelera.runtime:Found PCI device: 04:00.0 Processing accelerators: Axelera AI Metis AIPU (rev 02)
    INFO:axelera.runtime:Found AIPU driver: metis 90112 0
    WARNING:axelera.runtime:4PCI device count mismatch: lspci=1, triton=0
    Traceback (most recent call last):
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/bin/axdevice", line 8, in <module>
    sys.exit(entrypoint_main())
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 625, in entrypoint_main
    main(args)
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 608, in main
    devices = _find_devices(found_devices, device_id)
    File "/home/tripton/.cache/axelera/venvs/93f45ae3/lib/python3.10/site-packages/axelera/runtime/axdevice.py", line 187, in _find_devices
    raise RuntimeError("No devices found, use -v for more information")
    RuntimeError: No devices found, use -v for more information

The output shows that although the device is being detected by the system (as reflected in the logs), it still isn’t found correctly by the driver, and the device count mismatch remains.

Please let me know if you require any additional details or further tests.

Thanks,

Tripton


Forum|alt.badge.img+2
  • Community Manager
  • April 14, 2025

Hello ​@tripton ,

Just to double check, did you do sudo update-grub  and then added pcie_aspm=off and then did sudo reboot?

 

I have some additional requests:

  • Can you share with us the information of which host systems you are testing?
  • I see you are now testing your AMD host. Can you try also all the suggestions from my previous messages for the Intel host? Note that for intel it is intel_iommu=off.
  • In your Metis card there should be a serial number. Can you share that with us?

Thank you beforehand,

Victor


Forum|alt.badge.img+2
  • Community Manager
  • April 15, 2025

Hi ​@tripton ,

Do you have any updates on:

  • Can you share with us the information of which host systems you are testing?
  • I see you are now testing your AMD host. Can you try also all the suggestions from my previous messages for your Intel host? Note that for intel it is intel_iommu=off.
  • In your Metis card there should be a serial number. Can you share that with us?

@Spanner Please keep track of this post while I am off if possible ( I am back from Wednesday next week). Thanks!


  • Author
  • Cadet
  • April 16, 2025

Hello Victor and team,

Sorry for the late response; I’ve been busy with other tasks.

  • Host System Information:
    I’m currently testing on an AMD host with an ASUS B550-F motherboard, an AMD 5950X CPU, and an NVIDIA RTX 3080. The system is still running Ubuntu 22.04. Let me know if you need any additional details about this host.

  • Kernel Parameter Confirmation:
    I did run sudo update-grub after adding pcie_aspm=off and rebooted. I can confirm the parameter is active, as seen in my dmesg output:

    [    0.059275] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-6.8.0-57-generic root=UUID=60bd6ac3-6af8-4ba0-9722-935f61fb73a3 ro quiet splash amd_iommu=off pcie_aspm=off vt.handoff=7
  • Intel Host Testing:
    I will test the Intel host next week and share the results with you.

  • Serial Number on the Metis Card:
    I’m not completely sure where the serial number is located. If it’s on the sticker with the barcode, it appears as either “AD-PEG-AM1A” or “A96BD002489.”

Please let me know if any more information or tests are needed. I’ll keep you posted about the intel system.

Thanks,
Tripton


  • Cadet
  • August 6, 2025
Hi everyone, I just bought my board and have been having trouble with the same issue.The firmware on my board was trouble - "4PCI device count mismatch: lspci=1, triton=0" in my AMD environment and I was having problems with it not being recognized.I switched to an Intel environment and connected it, and the firmware update was successful and it was recognized without any problems.Next I changed it to AMD environment and it worked fine.It seems that the Metis AIPU board with the initial firmware does not work well in an AMD environment.

Spanner
Axelera Team
Forum|alt.badge.img+3
  • Axelera Team
  • August 6, 2025

Thanks for sharing the update ​@FUKU55, great to hear your board is up and running. Good workaround there. The firmware update was likely the key step here. Once that’s done, things usually run smoothly across different systems. 👍