I'm evaluating your devices. I really love both the European heart and the great TOPS/Watt ratio.
It would be amazing for edge computing.
On the other side, our company is waiting for a GX10 from ASUS/NVidia with 128Gb of shared memory.
The reason why I think that the Metis form factor is more interesting than the pci-ex would need a very long document :-)
I read that your SDK is supporting off-load but reading on the forum I think that your off-loading support is limited only to data and not the core of the model.
You know that when you use CUDA+torch you're able to off-load some part of the model out of the GPU memory and execute it running a small part of the model at once swapping coefficients from the GPU memory to the host RAM and vice versa.
Are you planning to improve your off-loading system in your SDK?
Are you planning to introduce a Metis with more RAM?
Â