“Current language models are capable of solving some Humanities Last Exam (HLE) questions that are non-trivial for human PhD researchers.”