
Day 5: Sliding Chunks, Token Costs & Processing Real PDFs
Before we get into PDF processing, two more chunking strategies are worth meeting first. Those are sliding chunking and token-based chunking. And then...
Articles, tutorials, and insights

Before we get into PDF processing, two more chunking strategies are worth meeting first. Those are sliding chunking and token-based chunking. And then...

After spending the last few weeks documenting my RAG self-study journey (you can find those posts under the RAG section), I have decided to dive into ...

This is the intro post for my 22 days of Machine Learning self-study series. The plan was simple. To understand the building blocks of classical ML in...

This blog post is a daily learning summary of my 40-day RAG class from Syed Jaffer of Parotta Salna.Small change of plan. Yesterday I told you today i...

Today, we zoom in on the step that happens before embedding: chunking. It quietly decides whether your RAG system is amazing or unusable.You can have ...

This blog post is a daily learning summary of my 40 Day RAG class from Syed Jaffer of Parotta Salna.Why Keyword Search Isn't EnoughYour knowledge base...

This blog post is a daily learning summary of my 40 Day RAG class from Syed Jaffer of Parotta Salna. Try asking ChatGPT:"What were my company's Q3 sal...

I just finished watching Krish Dinesh’s excellent video “Fuel QR System Failure Explained | Why Simple Systems Fail at Scale”, and I couldn’t stop thi...

I’ve always been the guy who gets excited about way too many things.One month, I’m deep into coding. Building React apps, experimenting with new JS li...

The Strands Agents TypeScript SDK lets you build multi-agent AI systems on Amazon Bedrock using typed tool definitions with Zod, async iterator stream...

இன்று (13/02/2026)அழகான திரைப்படம். மனதில் ஓர் மெல்லிய உணர்வு இறுதி வரை இருந்தது. படம் முடிந்த பின்னரும் அது தொடர்கிறது.படத்தின் இறுதியில் இரண்டு 'Yes...

AI didn’t suddenly wake up one day and become “agentic”.What we’re seeing now as autonomous systems that can plan, act, and operate across tools are t...

I built a book tracking app with AWS Amplify Gen 2. It cost me around $0.60/month to run.This isn't a perfect tutorial. Just sharing what worked and w...

Hi everyone,In this article, I’ll share what I’ve learnt about prompt injection. As developers, most of us are familiar with classic vulnerabilities l...

இனிய வணக்கம்,இன்றுடன் 2025 க்கு நன்றிகள் சொல்லி விடையனுப்பும் நாள். ஆண்டின் இறுதி நாள், சற்றே ஓய்வாக இந்த ஆண்டில் நான் செய்தது, இந்த ஆண்டு எனக்கு செய்...

The Problem That Started It AllIf you had to go through each word in order and couldn’t move on to the next one until you fully understood the one bef...

When your DigitalOcean volume runs out of space, here’s the quickest way to expand it. In most cases, you only need one command after increasing the s...

In this post, I would like to share my understanding and the summary I got from the latest research paper published by OpenAI on “Why Language Model H...

For a long time, I used ChatGPT like Google, asking random questions, copying answers, and moving on.It felt productive, but my learning was shallow.O...

In this article, we are going to deploy a web application on AWS serverless, which means we don’t have to manage servers at all, no scaling of servers...

Machine learning often feels full of abstract formulas, but what if we explain it through something familiar? Imagine you’re running a popular tea sho...

The world of software development has changed dramatically. Teams are shipping code faster than ever; sometimes every day, sometimes every hour. Using...

Imagine you have a big box of Legos. With these Legos, you can build lots of different things, like houses, cars, and even spaceships. But to build th...

நீண்ட இடைவெளிக்குப் பிறகு… மீண்டும் எழுத ஆரம்பிக்கிறேன்.முன்பு பல்வேறு blogging platforms-ல நான் எழுதினேன். Medium, மற்ற சில இடங்கள் எல்லாம் ஒரு கட்டத...