I created a custom model based on yolo26n, 'yolo26n-aya'.
The performance results for source code (bitmap (256x256) images and mp4 video) are shown below.
(a) images
(venv) ubuntu@antelao-3588:~/voyager-sdk$ AXELERA_USE_CL_DOUBLE_BUFFER=0 ./inference.py yolo26n-aya ./data/aya100/images --show-stats --no-display
========================================================================
Element Time(??s) Effective FPS
========================================================================
axinplace-addstreamid0 141 7,076.4
inference-task0:libtransform_resize_cl_0 371 2,692.7
inference-task0:libtransform_padding_0 844 1,184.0
inference-task0:inference 2,961 337.7
inference-task0:Inference latency 208,874 n/a
inference-task0:libdecode_yolov8_0 653 1,531.2
inference-task0:Postprocessing latency 25,097 n/a
inference-task0:Total latency 308,970 n/a
========================================================================
End-to-end average measurement 0.0
========================================================================
(b)mp4
(venv) ubuntu@antelao-3588:~/voyager-sdk$ AXELERA_USE_CL_DOUBLE_BUFFER=0 ./inference.py yolo26n-aya ./media/traffic3_720p.mp4 --show-stats --no-display
========================================================================
Element Time(??s) Effective FPS
========================================================================
qtdemux0 117 8,546.7
h264parse0 646 1,546.7
capsfilter0 153 6,516.0
mppvideodec0 6,368 157.0
decodebin-link0 115 8,636.4
inference-task0:libtransform_resize_cl_0 542 1,844.6
inference-task0:libtransform_padding_0 778 1,285.0
inference-task0:inference 550 1,816.0
inference-task0:Inference latency 24,772 n/a
inference-task0:libdecode_yolov8_0 581 1,719.9
inference-task0:Postprocessing latency 2,782 n/a
inference-task0:Total latency 35,448 n/a
========================================================================
End-to-end average measurement 674.6
========================================================================
I have a question:
(1) Why are the values of inference-task0:inference different?
(2) My goal is to measure the time (end-to-end) from the input (image) buffer in main memory to the output (result) buffer.
Thank you in advance for your assistance.
