AI Models Are Getting Smarter. New Tests Are Racing to Catch Up

AI Models Are Getting Smarter. New Tests Are Racing to Catch Up

11 views
1 min read

By Tharin Pillay December 24, 2024 10:05 AM EST Despite their expertise, AI developers don’t always know what their most advanced systems are capable of—at least, not at first. To find out, systems are subjected to a range of tests—often called evaluations, or ‘evals’—designed to tease out their limits. But due to rapid progress in the field, today’s systems regularly achieve top scores on many popular tests, including SATs and the U.S. bar exam, making it harder to judge just how quickly they are improving. A new set of much more challenging evals has emerged in response, created by companies, nonprofits, and governments. Yet even on the most advanced evals, AI systems are making astonishing progress. In November, the nonprofit research institute Epoch AI announced a set of exceptionally […]

Latest from Blog

withemes on instagram