Meta’s vanilla Maverick AI model ranks below rivals on a popular chat benchmark

nyheder · Indlæg af **nyheder** » fre apr 11, 2025 10:46 pm

Earlier this week, Meta landed in hot water for using an experimental, unreleased version of its Llama 4 Maverick model to achieve a high score on a crowdsourced benchmark, LM Arena. The incident prompted the maintainers of LM Arena to apologize, change their policies, and score the unmodified, vanilla Maverick. Turns out, it’s not very […]

Source: https://techcrunch.com/2025/04/11/metas ... benchmark/