…models (like ChatGPT) using your math and analytical skills. You’ll design problems, check how well AI solves them, and work with researchers to build better benchmarks.
Responsibilities:
Design advanced math problems to test AI performance (e.g., multi-step…