Meta’s vanilla Maverick AI model ranks below rivals on a popular chat benchmark

Besvar
nyheder
Indlæg: 9969
Tilmeldt: tirs sep 22, 2020 3:13 pm

Meta’s vanilla Maverick AI model ranks below rivals on a popular chat benchmark

Indlæg af nyheder »

Earlier this week, Meta landed in hot water for using an experimental, unreleased version of its Llama 4 Maverick model to achieve a high score on a crowdsourced benchmark, LM Arena. The incident prompted the maintainers of LM Arena to apologize, change their policies, and score the unmodified, vanilla Maverick. Turns out, it’s not very […]

Source: https://techcrunch.com/2025/04/11/metas ... benchmark/
Besvar