EDA with Pandas img

EDA in Public (Part 3): RFM Analysis for Customer Segmentation in Pandas

! If you’ve been following along, we’ve come a long way. In Part 1, we did the “dirty work” of cleaning and prepping. In Part 2, we zoomed out to a high-altitude view of NovaShop’s world — spotting the big storms (high-revenue countries) and the seasonal patterns (the massive Q4 rush). But here’s the thing: a business …

EDA in Public (Part 3): RFM Analysis for Customer Segmentation in Pandas Read More »

bala lesser known python libraries

10 Lesser-Known Python Libraries Every Data Scientist Should Be Using in 2026

Image by Author   Contents# Introduction# 1. Pandera# 2. Vaex# 3. Pyjanitor# 4. D-Tale# 5. Sweetviz# 6. cuDF# 7. ITables# 8. GeoPandas# 9. tsfresh# 10. ydata-profiling (pandas-profiling)# Wrapping Up # Introduction  As a data scientist, you’re probably already familiar with libraries like NumPy, pandas, scikit-learn, and Matplotlib. But the Python ecosystem is vast, and there are plenty of lesser-known libraries that can help you make your data science …

10 Lesser-Known Python Libraries Every Data Scientist Should Be Using in 2026 Read More »

eEfDhwb u99oZnuvwc3STyyWrtN 4LnaqKBXCTP6Zy7yZGEJ ryNjOOYaWJ3Lc8JmRXVH9DA EqSb7k61rXtDeG0wB6Tj0ho0mAa

FACTS Benchmark Suite: a new way to systematically evaluate LLMs factuality

Large language models (LLMs) are increasingly becoming a primary source for information delivery across diverse use cases, so it’s important that their responses are factually accurate. In order to continue improving their performance on this industry-wide challenge, we have to better understand the types of use cases where models struggle to provide an accurate response …

FACTS Benchmark Suite: a new way to systematically evaluate LLMs factuality Read More »

5 Fun Docker Projects for Absolute Beginners

5 Fun Docker Projects for Absolute Beginners

Image by Author   Contents# Introduction# 1. Static Website Hosting with Nginx and Docker# 2. Multi-Container Docker Applications with Docker Compose# 3. One Database Shared by Multiple Containers# 4. Automated Continuous Integration with Jenkins in Docker# 5. Logging and Monitoring Docker# Wrapping Up # Introduction  Before jumping into the projects, let’s clear up what Docker is and why people care about it. Docker …

5 Fun Docker Projects for Absolute Beginners Read More »

awan top 7 open source ocr models 3

Top 7 Open Source OCR Models

Image by Author   Contents# Introduction# 1. olmOCR 2 7B 1025# 2. PP OCR v5 Server Det# 3. OCRFlux 3B# 4. MiniCPM-V 4.5# 5. InternVL 2.5 4B# 6. Granite Vision 3.3 2b# 7. Trocr Large Printed# Summary # Introduction  OCR (Optical Character Recognition) models are gaining new recognition every day. I am seeing new open-source models pop up on Hugging Face that have crushed previous …

Top 7 Open Source OCR Models Read More »

1D CNN

The Machine Learning “Advent Calendar” Day 23: CNN in Excel

were first introduced for images, and for images they are often easy to understand. A filter slides over pixels and detects edges, shapes, or textures. You can read this article I wrote earlier to understand how CNNs work for images with Excel. For text, the idea is the same. Instead of pixels, we slide filters …

The Machine Learning “Advent Calendar” Day 23: CNN in Excel Read More »

Scroll to Top