@naskovai

Large ID Embedding Tables in Recommendation Systems

Everything that breaks when you put a 450-million-row lookup table at the center of your model, and how the industry fixes it: cardinality, the one-epoch problem, transfer and enrichment, and distributed serving.

Generative Recsys in Production: Three Lessons from Shopify's Commerce Engine

What Shopify's production generative recommender reveals about building on HSTU: time encoding for seasonality, negative sampling as the primary scaling lever, and training for incremental recall within an ensemble.

Negative Sampling for Embedding-Based Retrieval: An Overview

How production retrieval systems learn to rank a billion items, tracing the evolution of negative sampling from random batches through hard mining, bias correction, and ANCE.

Generative Recommendations: A Mechanistic Guide

A mechanistic deep dive into how generative recommender systems work: from Semantic IDs and RQ-VAE to HSTU, M-FALCON, and production deployment at Meta, Kuaishou, and beyond.

From RMSProp to AdamW: The Optimizer Evolution Story

Tracing the evolution of modern neural network optimizers through the lens of what each was designed to fix: gradient scale heterogeneity, mini-batch noise, and regularization interference.

The Sandwich Framework for Understanding Linear Algebra

Coordinate Translations, Scaling, and State Transitions - A unified approach to linear algebra decompositions