Artificial intelligence (AI) is increasingly being asked not just to generate answers but also to evaluate them. Whether it’s deciding which chatbot response is better, assessing scientific claims, or even grading essays, AI systems that use Large Language Models (LLMs) are being…