sqlite-vec/site/guides/matryoshka.md

# Matryoshka (Adaptive-Length) Embeddings

Matryoshka embeddings are a new class of embedding models introduced in the
TODO-YYY paper [_TODO title_](https://arxiv.org/abs/2205.13147). They allow one
to truncate excess dimensions in large vector, without sacrificing much quality.

Let's say your embedding model generate 1024-dimensional vectors. If you have 1
million of these 1024-dimensional vectors, they would take up `4.096 GB` of
space! You're not able to reduce the dimensions without losing a lot of
quality - if you were to remove half of the dimensions 512-dimensional vectors,
you could expect to also lose 50% or more of the quality of results. There are
other dimensional-reduction techniques, like [PCA](#TODO) or [Product Quantization](#TODO), but they typically require
complicated and expensive training processes.

Matryoshka embeddings, on the other hand, _can_ be truncated, without losing much
quality. Using [`mixedbread.ai`](#TODO) `mxbai-embed-large-v1` model, they claim
that

They are called "Matryoshka" embeddings because ... TODO

## Matryoshka Embeddings with `sqlite-vec`

You can use a combination of [`vec_slice()`](../api-reference.md#vec_slice) and
[`vec_normalize()`](../api-reference.md#vec_slice) on Matryoshka embeddings to
truncate.

```sql
select
  vec_normalize(
    vec_slice(title_embeddings, 0, 256)
  ) as title_embeddings_256d
from vec_articles;
```

[`vec_slice()`](../api-reference.md#vec_slice) will cut down the vector to the first 256 dimensions. Then [`vec_normalize()`](../api-reference.md#vec_normalize) will normalize that truncated vector, which is typically a required step for Matryoshka embeddings.

## Benchmarks

## Suppported Models

https://supabase.com/blog/matryoshka-embeddings#which-granularities-were-openais-text-embedding-3-models-trained-on

`text-embedding-3-small`: 1536, 512 `text-embedding-3-large`: 3072, 1024, 256

https://x.com/ZainHasan6/status/1757519325202686255

`text-embeddings-3-large:` 3072, 1536, 1024, 512

https://www.mixedbread.ai/blog/binary-mrl

`mxbai-embed-large-v1`: 1024, 512, 256, 128, 64

`nomic-embed-text-v1.5`: 768, 512, 256, 128, 64

```
# TODO new snowflake model
```
doc updates 2024-06-22 16:46:33 -07:00			`# Matryoshka (Adaptive-Length) Embeddings`

			`Matryoshka embeddings are a new class of embedding models introduced in the`
			`TODO-YYY paper [_TODO title_](https://arxiv.org/abs/2205.13147). They allow one`
docs 2024-07-31 12:55:03 -07:00			`to truncate excess dimensions in large vector, without sacrificing much quality.`
doc updates 2024-06-22 16:46:33 -07:00
			`Let's say your embedding model generate 1024-dimensional vectors. If you have 1`
			million of these 1024-dimensional vectors, they would take up `4.096 GB` of
docs 2024-07-31 12:55:03 -07:00			`space! You're not able to reduce the dimensions without losing a lot of`
doc updates 2024-06-22 16:46:33 -07:00			`quality - if you were to remove half of the dimensions 512-dimensional vectors,`
			`you could expect to also lose 50% or more of the quality of results. There are`
docs 2024-07-31 12:55:03 -07:00			`other dimensional-reduction techniques, like [PCA](#TODO) or [Product Quantization](#TODO), but they typically require`
			`complicated and expensive training processes.`
doc updates 2024-06-22 16:46:33 -07:00
docs 2024-07-31 12:55:03 -07:00			`Matryoshka embeddings, on the other hand, _can_ be truncated, without losing much`
doc updates 2024-06-22 16:46:33 -07:00			quality. Using [`mixedbread.ai`](#TODO) `mxbai-embed-large-v1` model, they claim
			`that`

			`They are called "Matryoshka" embeddings because ... TODO`

			## Matryoshka Embeddings with `sqlite-vec`

docs 2024-07-31 12:55:03 -07:00			You can use a combination of [`vec_slice()`](../api-reference.md#vec_slice) and
			[`vec_normalize()`](../api-reference.md#vec_slice) on Matryoshka embeddings to
doc updates 2024-06-22 16:46:33 -07:00			`truncate.`

			```sql
			`select`
docs 2024-07-31 12:55:03 -07:00			`vec_normalize(`
			`vec_slice(title_embeddings, 0, 256)`
			`) as title_embeddings_256d`
doc updates 2024-06-22 16:46:33 -07:00			`from vec_articles;`
			```

docs 2024-07-31 12:55:03 -07:00			[`vec_slice()`](../api-reference.md#vec_slice) will cut down the vector to the first 256 dimensions. Then [`vec_normalize()`](../api-reference.md#vec_normalize) will normalize that truncated vector, which is typically a required step for Matryoshka embeddings.

doc updates 2024-06-22 16:46:33 -07:00			`## Benchmarks`

			`## Suppported Models`

			`https://supabase.com/blog/matryoshka-embeddings#which-granularities-were-openais-text-embedding-3-models-trained-on`

			`text-embedding-3-small`: 1536, 512 `text-embedding-3-large`: 3072, 1024, 256

			`https://x.com/ZainHasan6/status/1757519325202686255`

			`text-embeddings-3-large:` 3072, 1536, 1024, 512

			`https://www.mixedbread.ai/blog/binary-mrl`

			`mxbai-embed-large-v1`: 1024, 512, 256, 128, 64

			`nomic-embed-text-v1.5`: 768, 512, 256, 128, 64
docs 2024-07-31 12:55:03 -07:00
			```
			`# TODO new snowflake model`
			```