[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]  Ethan Mollick [@emollick](/creator/twitter/emollick) on x 281.4K followers Created: 2025-07-24 18:36:52 UTC The mitigating factor for the problem with AI benchmarks (errors, saturation, contamination) is that, despite issues, they are all still fairly heavily correlated. So if your AI does well on GPQA or MMLU or HLE it also tends to do well on other benchmarks & on vibes & real work. XXXXXX engagements  **Related Topics** [coins ai](/topic/coins-ai) [Post Link](https://x.com/emollick/status/1948452336675835967)
[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]
Ethan Mollick @emollick on x 281.4K followers
Created: 2025-07-24 18:36:52 UTC
The mitigating factor for the problem with AI benchmarks (errors, saturation, contamination) is that, despite issues, they are all still fairly heavily correlated.
So if your AI does well on GPQA or MMLU or HLE it also tends to do well on other benchmarks & on vibes & real work.
XXXXXX engagements
Related Topics coins ai
/post/tweet::1948452336675835967