Are multimodal models the way forward?

Forum|Forum|1 year ago
April 3, 2025
1 reply
76 views

Tinkers Rucksack
Cadet

There’s a lot of new hype everyday around new models that can do language + images + audio etc all at once. There’s a convenience to it, but is it the best way? Can’t help but think ”jack of all trades” when a new one is announced.

I suppose it could be because of the chase towards the first AGI, but if you’re building an AI tool or platform, don’t you want a specialized model rather than a general one?

Would be interested to hear if anyone here has actually built something on top of this — or tried and hit a wall.

+3

Spanner
Axelera Team
Forum|Forum|1 year ago
April 3, 2025

I don’t disagree at all. Even when you buy a computer or a phone, you pick according to your needs - you don’t just automatically opt for an all-rounder. No reason AI wouldn’t be more effective if it developed in the same way to meet specific needs 👍

Like

Sign up

Log in, or create an Axelera AI account

Login to the community

Log in, or create an Axelera AI account