documentdb-vector-search
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseVector Search — Azure DocumentDB (cosmosSearch
)
cosmosSearch向量搜索 — Azure DocumentDB(cosmosSearch
)
cosmosSearchAzure DocumentDB's native vector index type is . Pick the sub-type by scale:
cosmosSearch| Index sub-type | Scale sweet spot | Tier |
|---|---|---|
| Up to 500k+ vectors | M30+ |
| Up to ~50k vectors | M30+ |
| Under ~10k vectors | M10+ |
Similarity options: (cosine), (Euclidean), (inner product).
COSL2IPAzure DocumentDB的原生向量索引类型为。可根据规模选择子类型:
cosmosSearch| 索引子类型 | 适用规模区间 | 层级 |
|---|---|---|
| 最多50万+向量 | M30+ |
| 最多约5万向量 | M30+ |
| 少于约1万向量 | M10+ |
相似度选项:(余弦)、(欧几里得)、(内积)。
COSL2IPRules
规则
- vector-choose-index-type — Prefer DiskANN for production; use HNSW up to 50k, IVF under 10k.
- vector-create-diskann-index — Create a index with correct
vector-diskann,dimensions,similarity, andmaxDegree.lBuild - vector-knn-query — Query with +
$search; tunecosmosSearchandlSearch; combine with pre-filters.k - vector-product-quantization — Shrink high-dimensional vectors (up to 16,000 dims) while preserving recall.
- vector-half-precision — Halve vector memory with fp16 indexing and minimal recall loss.
- vector-normalize-embeddings — Normalize embeddings when using cosine similarity; store model + dimensions alongside vectors.
- vector-choose-index-type — 生产环境优先选择DiskANN;向量规模达5万时使用HNSW,少于1万时使用IVF。
- vector-create-diskann-index — 创建索引时需设置正确的
vector-diskann、dimensions、similarity和maxDegree参数。lBuild - vector-knn-query — 使用+
$search进行查询;调优cosmosSearch和lSearch参数;结合预筛选器使用。k - vector-product-quantization — 在保留召回率的前提下压缩高维向量(最高支持16000维)。
- vector-half-precision — 通过fp16索引将向量内存占用减半,且召回率损失极小。
- vector-normalize-embeddings — 使用余弦相似度时需归一化嵌入向量;将模型及维度信息与向量一同存储。