Paper Notes: HAKES — High Availability Key-value Engine for Search
HAKES is a vector search system designed to scale to billions of vectors while maintaining sub-millisecond tail latencies. Unlike monolithic vector databases, HAKES adopts a disaggregated architecture that separates storage, indexing, and search layers to achieve high availability and seamless horizontal scaling. TL;DR HAKES addresses the limitations of traditional vector databases—specifically resource contention (“heat”) and the high cost of graph-based indices. By employing a two-stage Filter-and-Refine architecture based on IVF + PQ, it offloads persistent data to Cloud Storage and uses a unified cluster management approach to handle planet-scale workloads. ...