Question 1

If "cat" and "kitten" have similar embeddings, and "cat" and "quantum physics" have very different embeddings, what property does this demonstrate?

Accepted Answer

Embeddings capture semantic meaning and relationships. Embeddings capture semantic meaning — words/phrases with related meanings cluster together in vector space regardless of exact wording.

Question 2

What does a cosine similarity score of 0 between two embeddings mean?

Accepted Answer

The texts are semantically unrelated (orthogonal vectors). Cosine similarity of 0 means the vectors are perpendicular (orthogonal) — no directional relationship, indicating semantically unrelated content.

Question 3

In a RAG pipeline, when is the embedding model used vs the LLM?

Accepted Answer

Embedding model retrieves relevant documents; LLM generates the final answer. RAG divides responsibilities: the embedding model handles retrieval (finding relevant chunks via similarity search), and the LLM handles generation (synthesizing the answer from retrieved context).

What are Embeddings? (Vector Representations)

The Core Idea: Meaning as Numbers

How Similarity Search Works

Embeddings vs LLMs

Where Embeddings Power Products You Use

Deep Dive Articles

Related Concepts