
The author presented a keynote at the AI Engineer World's Fair in San Francisco, discussing recent advancements in Large Language Models (LLMs). They evaluated 30 models released in the last six months, including Llama 4, GPT-4.5, and Claude 3.7 Sonnet, using a custom benchmark to compare their performance.