A research team affiliated with UNIST has unveiled a novel AI system capable of grading and providing detailed feedback on ...
SolidGeo is the first large-scale benchmark specifically designed to evaluate the performance of MLLMs on mathematical reasoning tasks in solid geometry. SolidGeo consists of 3,113 real-world K–12 and ...