Tech

Day 77 of 100 Days of AI

Michael Tefula

10 Jun 2024 — 1 min read

Building and running AI agents is a messy process. I completed a draft crew of agents that can take a product name and provide summary of Youtube reviews. You can see a sample output below.

The first image is my terminal command, with the agents running.

Below is the final output, with links to Youtube videos in red.

I also run the agents (this time without source links) for a review of the BYD Atto 3 electric vehicle. The points below are helpful and are grounded from the agents doing research across a number of Youtube reviews.

The challenges I saw with this process though are:

Costs — Running a few experiments cost me $3.28 for about half a dozen attempts at running the agents fully.
Inconsistent outputs — Sometimes I get great outputs, and other times the agents fail completely or make up content.
Slow and inefficient — The agents for this review app can take up to a minute to run fully, and occassionally take routes that are not necessary.

All these issues can be fixed with more powerful models, but by that point, would we have to build the agents ourselves? Or, would we just leave a more powerful LLM to figure things out rapidly and at a lower cost? That’s the view I wrote about here and increasingly, I’m starting to believe it.

3 Learnings from Building a Weekend App with AI

This past long weekend (thanks, Easter break) I dug into an idea I had knocking around. I read Hacker News religiously and the lengthy comments sections can be as insightful as the articles they reference. However, the most interesting stories on Hacker News often have too many comments to digest.

Learn Slow So You Can Move Fast

I learned to code the old-school way: I bought a Python textbook and went through examples and exercises, page by page, writing all the code from scratch. Today, we have AI agents writing code for us. I often use Cursor and LLMs to rapidly generate snippets or whole sections of

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

Today, I read two contrasting articles. One posited that we are near the peak of investor hype in Gen AI. It argued that productivity gains from this new technology will be incremental rather than transformative. Another article suggested the opposite. It made the distinction between good bubbles and bad bubbles,

Prompt Engineering: A Surprising Switching Cost of Large Language Models

I've been working on some exceptionally long LLM prompts for a couple of projects at work. I've noticed a fascinating phenomenon: A prompt that works well with one model can diverge in performance when applied to another. This presents switching costs for developers and businesses. You

Read more

3 Learnings from Building a Weekend App with AI

Learn Slow So You Can Move Fast

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

Prompt Engineering: A Surprising Switching Cost of Large Language Models