Abstract: Recently, researchers in the field of math word problem (MWP) solving have reported performance metrics for various large language models (LLMs) on benchmark datasets, with some models ...
An engineer for New York Times Games has been trying to teach artificial intelligence to understand wordplay more like a human. By Shafik Quoraishee Shafik Quoraishee is a machine-learning engineer ...
Undergraduate students across North America sat down on Saturday to write a grueling six-hour math exam, many of them unlikely to solve a single problem. The notoriously brutal William Lowell Putnam ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results