My career began in Natural Language Processing, and I’ve studied the mathematics and theories behind AI extensively. So when ChatGPT took the world by storm, I approached it with skepticism. I viewed Large Language Models (LLMs) as sophisticated “next word predictors” – black boxes that ingest vast amounts of data and generate plausible word sequences from initial prompts, and felt that The hype surrounding LLMs seemed overblown. I struggled to see how these “predictors” could rev...