Tech

Day 11 of 100 Days of AI

Michael Tefula

05 Apr 2024 — 1 min read

Logistic regression continued.

In the lab portion of the intro to ML course today, I went through an exercise of running a logistic regression analysis on fictional customer data. I’ve put the code on Github here.

The model is structured as follows:

logit(p) = -0.2675 + (-0.1526 * tenure) + (-0.0791 * age) + (-0.0721 * address) + (-0.0196 * income) + (0.0519 * ed) + (-0.0950 * employ) + (0.1601 * equip)

A visual representation of the impact of the coefficients on churn is summarized in this chart.

And here’s the performance of the model, illustrated with a confusion matrix.

The basic steps to produce the model were as follows:

Load the dataset from a CSV file.
Select the features we want to use for predictions. These were: tenure, age, address, income, education, employment status, equipment, and churn status.
Preprocesses the data. We did just two bits of preprocessing here: (a) make sure the churn column has just integers and (b) normalize the feature set.
Split the dataset into training and testing sets.
Train a logistic regression model using the training data.
Make predictions on the test data.
Evaluate the performance of the model using a confusion matrix, classification report, and log loss.
I also added a bar graph that charts the coefficients so we can see which features have the greatest impact on churn.

I still find it incredible that if you can write some code, you can build a simple machine learning model with a few lines of code per the example below.

Learn Slow So You Can Move Fast

I learned to code the old-school way: I bought a Python textbook and went through examples and exercises, page by page, writing all the code from scratch. Today, we have AI agents writing code for us. I often use Cursor and LLMs to rapidly generate snippets or whole sections of

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

Today, I read two contrasting articles. One posited that we are near the peak of investor hype in Gen AI. It argued that productivity gains from this new technology will be incremental rather than transformative. Another article suggested the opposite. It made the distinction between good bubbles and bad bubbles,

Prompt Engineering: A Surprising Switching Cost of Large Language Models

I've been working on some exceptionally long LLM prompts for a couple of projects at work. I've noticed a fascinating phenomenon: A prompt that works well with one model can diverge in performance when applied to another. This presents switching costs for developers and businesses. You

A Goldilocks Introduction to AI Agents: Opportunities, Challenges, and Everyday Impact

AI agents have quickly evolved from obscure technical experiments to mainstream buzz. In the last 18 months, new automation capabilities have captured the imagination of entrepreneurs and established organizations alike. While it’s possible that we’re in another AI bubble, the early adopters of AI agents are reporting impressive

Read more

Learn Slow So You Can Move Fast

The Gen AI Frenzy: What’s Hype, What’s Real, and Where’s the Productivity?

Prompt Engineering: A Surprising Switching Cost of Large Language Models

A Goldilocks Introduction to AI Agents: Opportunities, Challenges, and Everyday Impact