Meta’s benchmarks for its new AI models are a bit misleading

Besvar
nyheder
Indlæg: 9969
Tilmeldt: tirs sep 22, 2020 3:13 pm

Meta’s benchmarks for its new AI models are a bit misleading

Indlæg af nyheder »

One of the new flagship AI models Meta released on Saturday, Maverick, ranks second on LM Arena, a test that has human raters compare the outputs of models and choose which they prefer. But it seems the version of Maverick that Meta deployed to LM Arena differs from the version that’s widely available to developers. […]

Source: https://techcrunch.com/2025/04/06/metas ... isleading/
Besvar