The best-performing AI can now achieve 97.1% on GSM8K (Zhong et al, 2024), an improvement from 74.4% in April 2022 (Wang et al, 2022), and 87.9% on MATH (Lei et al, 2024), an improvement from 64.9% in ...
When most peo­ple hear the term ‘ar­ti­fi­cial in­tel­li­gence’ (AI), they pic­ture self-dri­ving cars, voice as­sis­tants like Siri and Alexa, or chat­bots that gen­er­ate text on de­mand. These ...