Discussion about this post

User's avatar
Jurgen Gravestein's avatar

Great piece summarizing the core limitations of LLMs.

The incentive for AI companies to continue to make progress is so big that benchmaxxing (training for the test) is too tempting. And you know what they say: when a metric becomes a target, it ceases to be a good metric.

The same is true for benchmarks.

No posts

Ready for more?