Michael Tefula (Page 2)

3 Learnings from Building a Weekend App with AI

This past long weekend (thanks, Easter break) I dug into an idea I had knocking around. I read Hacker News religiously and the lengthy comments sections can be as insightful as the articles they reference. However, the most interesting stories on Hacker News often have too many comments to digest.

Learn Slow So You Can Move Fast

I learned to code the old-school way: I bought a Python textbook and went through examples and exercises, page by page, writing all the code from scratch. Today, we have AI agents writing code for us. I often use Cursor and LLMs to rapidly generate snippets or whole sections of

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

Today, I read two contrasting articles. One posited that we are near the peak of investor hype in Gen AI. It argued that productivity gains from this new technology will be incremental rather than transformative. Another article suggested the opposite. It made the distinction between good bubbles and bad bubbles,

Prompt Engineering: A Surprising Switching Cost of Large Language Models

I've been working on some exceptionally long LLM prompts for a couple of projects at work. I've noticed a fascinating phenomenon: A prompt that works well with one model can diverge in performance when applied to another. This presents switching costs for developers and businesses. You

A Goldilocks Introduction to AI Agents: Opportunities, Challenges, and Everyday Impact

AI agents have quickly evolved from obscure technical experiments to mainstream buzz. In the last 18 months, new automation capabilities have captured the imagination of entrepreneurs and established organizations alike. While it’s possible that we’re in another AI bubble, the early adopters of AI agents are reporting impressive

Open Problems (& Opportunities) for AI: Summary Notes from the Conference

Last week I attended the Algorithmic Innovation & Entrepreneurship Summit on Open Problems for AI. That’s a bit of a mouthful but in essence, it was a 2-day conference covering the challenges and opportunities for AI over the next 5 years. Day 1 was focussed on algorithms and AI

The Software Engineer is Dead, Long Live the Software Engineer!

An undeniable killer use case for large language models is code generation. It’s already helping software developers write code up to twice as fast and with more delight. As LLM models improve with each release, they can write better software and handle more complex architectures at a fraction of

Day 100 of 100 Days of AI 🎉

AI agents have been getting a lot of attention but they can be expensive to run. This is because they use a lot of tokens and make multiple calls to LLMs. Some AI developers have beat performance benchmarks by using exceptionally expensive agentic runs. However, this is unrealistic because in

Day 99 of 100 Days of AI

I switched one of my AI agents projects from GPT-4o to the new Claude 3.5 Sonnet model. The agents now run with a cost that’s roughly 85% cheaper and the final results arrive about 15% faster. To upgrade my AI agents project, all I had to do was

Day 98 of 100 Days of AI

On the way home today I was listening this podcast — “Lessons from a Year of Building with LLMs“. The biggest takeaway? While everyone has been building lots of experimental projects with LLMs and often shipping to production scrappy builds of seemingly impressive demos (I’m guilty of this!), few people

Day 97 of 100 Days of AI

AI Agents are a mixed bag. I tried this morning to get a group of CrewAI agents to complete some Youtube summaries and they kept failing or going off the rails. My deterministic version which I hard coded works significantly better than leaving things to a group of autonomous agents.

Day 96 of 100 Days of AI

There’s an article in the Wall Street Journal I came across today with the headline below. The article ends with a quote from a Google Cloud Chief Evangelist, Richard Seroter: “If you don’t have your data house in order, AI is going to be less valuable than it

Tech

Day 95 of 100 Days of AI

Claude AI is really good at coding. In one shot, I asked it to create streamlit app that I can use to chat to my PDFs with a local, offline, large language model. The first attempt had a roughly working version and with just 2 more follow-on prompts and refinement,

Tech

Day 94 of 100 Days of AI

Just 18 months ago, OpenAI released GPT-3.5 Turbo which had double the input token context window of its predecessor, GPT-3. We went from 2048 tokens to 4096 tokens and that felt like a significant leap. But today, we are enjoying context windows of 128,000 tokens with GPT4o. How

Tech

Day 93 of 100 Days of AI

A couple of experiments today: I completed a simple tutorial that helps you build a RAG app using local models. I also installed and tested https://openwebui.com/. This is a really cool open source project that gives you a ChatGPT-style interface to use with large language models that run

Tech

Day 92 of 100 Days of AI

I’ve just finished listening to this podcast with the founders of https://www.factory.ai/. Their startup automates away the drudgery bits of work in software engineering. That includes writing tests, debugging, documentation, and migrations. I found the views of the founders compelling. For example, they don’t think

Tech

Day 91 of 100 Days of AI

I finally have a Youtube video summarizer working! I’ve deployed the app with Streamlit.io. On the backend I’m using the Gemini 1.5 Flash LLM because it’s cheap and fast! The summaries are “chained” together using Langchain. A 20 minute video can be summarized in around

Tech

Day 90 of 100 Days of AI

I’ve been testing out the new Claude LLM and it’s better at coding than GPT4o. I might actually start using it more given how fast and effective it is. You can see the benchmarks below that Anthropic provided. They probably have some bias as they are self-published but

Tech

Day 89 of 100 Days of AI

Today I got a Youtube summarizer script working! I can input a Youtube URL and I get useful summary of the video using the Langchain “refine” documents chain. I really like this because iterates over portions of text and it refines the summary until something more thorough is generated. I’

Tech

Day 88 of 100 Days of AI

This evening I experimented with Langchain’s summarisation frameworks. This is something LLMs are great at natively, but with Langchain, you can use even more sophisticated summarisation techniques. Here’s a GPT-generated summary of these techniques based on the Langchain documentation: From ChatGPT4o: 1. Stuff Method * Concept: Simply concatenate all

Tech

Day 87 of 100 Days of AI

Given the note I wrote yesterday about the need to be “programmatically” lazy, I’m rewriting my Youtube reviewer system. I’m building it initially without agents, and I’ll only add an agentic layer in areas that are more unstructured. I’m not yet sure what areas will have

Tech

Day 86 of 100 Days of AI

The temptation with any new powerful tool is to apply it in more places than is necessary. What’s that saying…”to a man with a hammer, every problem is a nail”? The same appears to be happening with large language model frameworks and AI agent frameworks. I’ve experienced

Tech

Day 85 of 100 Days of AI

One of the pioneers of how large language models are trained and fine-tuned, Jeremy Howard, has a great article here on “What policy makers need to know about AI (and what goes wrong if they don’t)”. The article is actually very accessible to everyday readers. Here’s an excerpt

Tech

Day 84 of 100 Days of AI

I tested out a few AI agents today with the goal of optimising on cost, but they remain expensive to run if you have tasks that involve multi-step reasoning with chunky bits of content. One group of tasks that I gave to GPT4o cost $0.15 and took 2 minutes

Latest

3 Learnings from Building a Weekend App with AI

Learn Slow So You Can Move Fast

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

Prompt Engineering: A Surprising Switching Cost of Large Language Models

A Goldilocks Introduction to AI Agents: Opportunities, Challenges, and Everyday Impact

Open Problems (& Opportunities) for AI: Summary Notes from the Conference

The Software Engineer is Dead, Long Live the Software Engineer!

Day 100 of 100 Days of AI 🎉

Day 99 of 100 Days of AI

Day 98 of 100 Days of AI

Day 97 of 100 Days of AI

Day 96 of 100 Days of AI

Day 95 of 100 Days of AI

Day 94 of 100 Days of AI

Day 93 of 100 Days of AI

Day 92 of 100 Days of AI

Day 91 of 100 Days of AI

Day 90 of 100 Days of AI

Day 89 of 100 Days of AI

Day 88 of 100 Days of AI

Day 87 of 100 Days of AI

Day 86 of 100 Days of AI

Day 85 of 100 Days of AI

Day 84 of 100 Days of AI