Welcome to @naskovai’s blog

Negative Sampling for Embedding-Based Retrieval: An Overview

How production retrieval systems learn to rank a billion items, tracing the evolution of negative sampling from random batches through hard mining, bias correction, and ANCE.

March 26, 2026 4366 words 21 min

Generative Recommendations: A Mechanistic Guide

A mechanistic deep dive into how generative recommender systems work: from Semantic IDs and RQ-VAE to HSTU, M-FALCON, and production deployment at Meta, Kuaishou, and beyond.

March 25, 2026 15746 words 74 min

From RMSProp to AdamW: The Optimizer Evolution Story

Tracing the evolution of modern neural network optimizers through the lens of what each was designed to fix: gradient scale heterogeneity, mini-batch noise, and regularization interference.

August 25, 2025 2644 words 13 min

The Sandwich Framework for Understanding Linear Algebra

Coordinate Translations, Scaling, and State Transitions - A unified approach to linear algebra decompositions