Technical Blogs
Deep dives into computer vision, geometry, and 3D understanding. When something clicks in my mind, I try to write it down. Not tutorials, just my notes on things I finally understood.
The Visual Cortex as a Computer Vision Architecture: Dorsal and Ventral Streams
How the brain's visual cortex uses two parallel streams, feedback loops, and relative perception to process vision. Everything rewritten for engineers who build vision systems.
Temporal and Spatial Attention for 3D Pose Estimation
How temporal and spatial attention mechanisms enable accurate 2D-to-3D human pose estimation, with a deep dive into DSTformer.
Building a Sign Language Recognition System with CNNs and OpenCV
A practical walkthrough of building a real-time ASL alphabet recogniser. Dataset, CNN architecture with the actual math, and live webcam inference using OpenCV.
ROC Curves and AUC: Picking the Right Threshold for Your Model
Why accuracy fails on skewed data, how confusion matrices work, and how to use ROC curves and AUC to evaluate and compare probabilistic classifiers.
Vector Database Management Systems: Architecture and Internals
A technical overview of how Vector Database Management Systems work, covering vectorization, indexing, hardware tradeoffs, and the core challenges in building systems that search by meaning rather than exact match.
Probability Mass Functions, Expectation, and Variance
Relearning probability from a machine learning perspective. What PMFs are, how to compute expectation and variance from first principles, and why any of this matters for ML.