As AI models increasingly ace conventional tests, researchers are looking for new benchmarking methods. Google is betting on games.
As AI models increasingly ace conventional tests, researchers are looking for new benchmarking methods. Google is betting on games.