Deep Dives

The Visual Cortex as a Computer Vision Architecture: Dorsal and Ventral Streams

How the brain's visual cortex uses two parallel streams, feedback loops, and relative perception to process vision. Everything rewritten for engineers who build vision systems.

Temporal and Spatial Attention for 3D Pose Estimation

How temporal and spatial attention mechanisms enable accurate 2D-to-3D human pose estimation, with a deep dive into DSTformer.

Building a Sign Language Recognition System with CNNs and OpenCV

A practical walkthrough of building a real-time ASL alphabet recogniser. Dataset, CNN architecture with the actual math, and live webcam inference using OpenCV.

ROC Curves and AUC: Picking the Right Threshold for Your Model

Why accuracy fails on skewed data, how confusion matrices work, and how to use ROC curves and AUC to evaluate and compare probabilistic classifiers.

Vector Database Management Systems: Architecture and Internals

A technical overview of how Vector Database Management Systems work, covering vectorization, indexing, hardware tradeoffs, and the core challenges in building systems that search by meaning rather than exact match.

Probability Mass Functions, Expectation, and Variance

Relearning probability from a machine learning perspective. What PMFs are, how to compute expectation and variance from first principles, and why any of this matters for ML.