One term that I have been hearing a lot lately is reward hacking. I have heard this term multiple times from folks at OpenAI and Anthropic, and it represents a fundamental challenge in AI alignment…| Shekhar Gulati
I was listening to a talk by Anthropic folks on Claude Code In the talk speaker was asked why they built Claude code as CLI tool instead of IDE. They gave two reasons: Claude Code is built by Anthr…| Shekhar Gulati
Mistral released a new model yesterday. It is designed to excel at Agentic coding tasks meaning it can use tools. It is Apache 2.0 license. It is finetuned from Mistral-Small-3.1, therefore it has …| Shekhar Gulati