Tech

Day 4 of 100 Days of AI

Michael Tefula

29 Mar 2024 — 1 min read

Today, I went through a classifier lab on the intro ML course. There are several bits I didn’t quite understand but GPT helped get me over the basics. For example, I will need to review my notes on the Jaccard Index and F1-score (evaluation metrics for classifier models), and the concept of normalisation, where you transform your data without changing its distribution. This makes it easier to calculate distances between points, a critical bit when trying to make classification predictions.

On the latter point, I’ve included some charting code in the github repo here (see image below), which helped me understand the normalisation concept. The charting code was written by GPT, with some minor tweaks from me.

Key takeaways:

Classification is a supervised machine learning approach.
It makes a prediction about what discrete class some item should fall into.
Classifiers can be used for spam detection, document classification, speech recognition, or even to predict if a certain customer will churn, based on a variety of characteristics.
Classification algorithms include k-nearest neighbour (which I’ve put on github here), decision trees, and logistic regression (which instead of putting an item into a class, gives you a probability that it will fit a particular bucket.)
The K-nearest neighbours algorithm was fun to learn about, and the intuition for it is simpler than I expected. The basic notion is as follows: for a given item to predict on, look at a select number of neigbhours (the k-number), and predict the outcome based on the most popular category that those neighbours are in (or the neighbours’ mean or median of the values you’re trying to predict for e.g. house price based on location, square foot size etc.)
Classification algorithms can be evaluated with a number of accuracy measures, such as the Jaccard Index, a F1-score, or Log Loss. I didn’t cover these in detail but I did enough to get the very basics.

3 Learnings from Building a Weekend App with AI

This past long weekend (thanks, Easter break) I dug into an idea I had knocking around. I read Hacker News religiously and the lengthy comments sections can be as insightful as the articles they reference. However, the most interesting stories on Hacker News often have too many comments to digest.

Learn Slow So You Can Move Fast

I learned to code the old-school way: I bought a Python textbook and went through examples and exercises, page by page, writing all the code from scratch. Today, we have AI agents writing code for us. I often use Cursor and LLMs to rapidly generate snippets or whole sections of

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

Today, I read two contrasting articles. One posited that we are near the peak of investor hype in Gen AI. It argued that productivity gains from this new technology will be incremental rather than transformative. Another article suggested the opposite. It made the distinction between good bubbles and bad bubbles,

Prompt Engineering: A Surprising Switching Cost of Large Language Models

I've been working on some exceptionally long LLM prompts for a couple of projects at work. I've noticed a fascinating phenomenon: A prompt that works well with one model can diverge in performance when applied to another. This presents switching costs for developers and businesses. You

Read more

3 Learnings from Building a Weekend App with AI

Learn Slow So You Can Move Fast

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

Prompt Engineering: A Surprising Switching Cost of Large Language Models