The Enigma Eval benchmark is a collection of multi-step puzzles, similar to the MIT Mystery Hunt,..., Sonic AI
“The Enigma Eval benchmark is a collection of multi-step puzzles, similar to the MIT Mystery Hunt, that require group-level intelligence and have a low solve rate for humans.”