Tech

Prompt Engineering: A Surprising Switching Cost of Large Language Models

Michael Tefula

13 Feb 2025 — 1 min read

I've been working on some exceptionally long LLM prompts for a couple of projects at work. I've noticed a fascinating phenomenon: A prompt that works well with one model can diverge in performance when applied to another.

This presents switching costs for developers and businesses. You can spend hours or days crafting a carefully engineered prompt, but when you switch models, you might find that you need to invest significant time re-engineering your prompt to avoid performance degradation.

Here's a simple example I ran this morning comparing prompt responses from ChatGPT, Claude, and Google. I gave the LLMs the front page of a newspaper website and asked for a simple sentiment analysis. These responses are from the consumer chat apps, and API responses can vary. Still, it's surprising to see that Google's Gemini app refused to respond at all.

These models have a personality of sorts derived from whoever created them. That said, API access can give more consistent results since you can guide the LLM with system prompts to alter that "personality".

Note: A snippet of the Times page I extracted headlines from.

A sample from the The Times (13 Feb 2025)

The LLM summary page: You can view the full version here.

Learn Slow So You Can Move Fast

I learned to code the old-school way: I bought a Python textbook and went through examples and exercises, page by page, writing all the code from scratch. Today, we have AI agents writing code for us. I often use Cursor and LLMs to rapidly generate snippets or whole sections of

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

Today, I read two contrasting articles. One posited that we are near the peak of investor hype in Gen AI. It argued that productivity gains from this new technology will be incremental rather than transformative. Another article suggested the opposite. It made the distinction between good bubbles and bad bubbles,

A Goldilocks Introduction to AI Agents: Opportunities, Challenges, and Everyday Impact

AI agents have quickly evolved from obscure technical experiments to mainstream buzz. In the last 18 months, new automation capabilities have captured the imagination of entrepreneurs and established organizations alike. While it’s possible that we’re in another AI bubble, the early adopters of AI agents are reporting impressive

Open Problems (& Opportunities) for AI: Summary Notes from the Conference

Last week I attended the Algorithmic Innovation & Entrepreneurship Summit on Open Problems for AI. That’s a bit of a mouthful but in essence, it was a 2-day conference covering the challenges and opportunities for AI over the next 5 years. Day 1 was focussed on algorithms and AI

Read more

Learn Slow So You Can Move Fast

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

A Goldilocks Introduction to AI Agents: Opportunities, Challenges, and Everyday Impact

Open Problems (& Opportunities) for AI: Summary Notes from the Conference