
Humanity's Last Exam: The AI Benchmark for LLM Reasoning
Learn about Humanity's Last Exam (HLE), the Nature-published AI benchmark testing true LLM reasoning with 2,500 expert-level questions. Updated with 2026 leaderboard scores from GPT-5, Claude Opus, and Gemini 3.