I am looking for a way to infer Qwen VL 32b instruct A3b on the edge using Axelera m 2 form factor hardware. But I would need a lot of Ram to make this happen I am talking at least 48Gb. Could anyone share some light on what are my options. I need to do video inferencing and scene understanding on request from the user. My idea was to grab a few frames whenever the request was initiated and then choose the best frame to do inference.
Question
Qwen 32b instruct A3b on the edge
Sign up
Already have an account? Login
Log in, or create an Axelera AI account
Log In or Register HereEnter your E-mail address. We'll send you an e-mail with instructions to reset your password.

