24–09–2023: VectorDB

Adrien
4 min readSep 24, 2023

VectorDB to buy or not to buy

If you work in data science, chances are that you had more than 5 talks from vendors about vector DB about how fast they are, how scalable they are, and how easy they are

The Key question is

Which Vector DB should you buy?

Some History

Realistically 95% of the Data scientists had never used a VectorDB 1 year ago. Vector DBs have nothing new. They have been used for specialized applications for storing vectors. Such examples included recommendation engines, customer behavior can be represented as vectors and typically can leverage nearest neighbors and other vector-based calculations. Another use was embedding representation when features (text or not) are represented as vectors.

Even then very few companies had datasets justifying investing into specialized databases unless the dataset was massive and required fast computation.

Fast forward 2023, and every enterprise wants one…

Technical aspects

Key Features:

Key features of vector DB are only 3. Many vendors added features on top of this.

  • Ability to store a large number of vector objects
  • Ability to connect to embedding model so that when you push data in the vector store, it will get converted to vector and stored as text too (optional)
  • Ability to retrieve vector…

--

--

Adrien

Strategy/Data/Leadership head of DS at OCBC ~~ exTwitter ~~ ex-gojek