fc.embedding.*.
normalize
Normalize embedding vectors to unit length.Column containing embedding vectors.
A column of normalized embedding vectors with the same embedding type.
- Normalizes each embedding vector to have unit length (L2 norm = 1)
- Preserves the original embedding model in the type
- Null values are preserved as null
- Zero vectors become NaN after normalization
Examples
compute_similarity
Compute similarity between embedding vectors using specified metric.Column containing embedding vectors.
Either:
- Another column containing embedding vectors for pairwise similarity
- A query vector (list of floats or numpy array) for similarity with each embedding
The similarity metric to use:
cosine: Cosine similarity (range: -1 to 1, higher is more similar)dot: Dot product similarity (raw inner product)l2: L2 (Euclidean) distance (lower is more similar)
A column of float values representing similarity scores.
- Cosine similarity normalizes vectors internally, so pre-normalization is not required
- Dot product does not normalize, useful when vectors are already normalized
- L2 distance measures the straight-line distance between vectors
- When using two columns, dimensions must match between embeddings
