Coding agents are getting quite good, and the question everyone asks is: which one should I use? However, agent performance varies considerably by language, task type, and time. When you commit to a single agent, you're predicting it will be best for whatever task you throw at it. That bet might be informed by evals, experience, or word of mouth. But the variance is high enough that ...