
Day 22: Data Leakage, ML Workflow & Interview Cheat Sheet
Twenty-one days, fifteen algorithms, six fundamentals. Today we tie everything together: the end-to-end workflow we would actually use in a job, the s...
Parathan ThiyagalingamArticles, tutorials, and insights

Twenty-one days, fifteen algorithms, six fundamentals. Today we tie everything together: the end-to-end workflow we would actually use in a job, the s...
Parathan Thiyagalingam
Three days of clustering grouped rows together. Today we tackle the opposite problem, which is too many columns. PCA is the most famous tool for shrin...
Parathan Thiyagalingam
K-Means and Hierarchical clustering both assume our groups are roughly globular. But what about ring-shaped clusters? Crescent moons? Data scattered w...
Parathan Thiyagalingam
K-Means demanded that we know K up front. Today's algorithm is the opposite. It builds a tree of every possible clustering, from "everything is one bi...
Parathan Thiyagalingam
Seventeen days of supervised learning, where every example came with an answer attached. Today we cross into a different world. The model has to find ...
Parathan Thiyagalingam
Our last supervised algorithm. Before deep learning conquered everything in the 2010s, SVMs were the king of classification. They are still genuinely ...
Parathan Thiyagalingam
Random Forest trained 500 trees independently and averaged them. Today we meet a smarter idea. Train trees one at a time, with each new tree focused o...
Parathan Thiyagalingam
A single decision tree is fragile. Today we fix that with one of the most famous ideas in ML: instead of trusting one tree, trust a thousand. Random F...
Parathan Thiyagalingam
If we have ever played the game Twenty Questions, we have already done decision trees in our head. Today we automate that intuition. Decision Trees ar...
Parathan Thiyagalingam