[ PROMPT_NODE_22971 ]

Index Types

[ SKILL_DOCUMENTATION ]

# FAISS Index Types Guide Complete guide to choosing and using FAISS index types. ## Index selection guide | Dataset Size | Index Type | Training | Accuracy | Speed | |--------------|------------|----------|----------|-------| | 10M | IVF+PQ | Yes | 90-95% | Fast, low memory | ## Flat indices (exact search) ### IndexFlatL2 - L2 (Euclidean) distance ```python import faiss import numpy as np d = 128 # Dimension index = faiss.IndexFlatL2(d) # Add vectors vectors = np.random.random((1000, d)).astype('float32') index.add(vectors) # Search k = 5 query = np.random.random((1, d)).astype('float32') distances, indices = index.search(query, k) ``` **Use when:** - Dataset 10M vectors) - Can accept ~90-95% accuracy ### IndexIVFPQ - IVF + PQ combined ```python # Best for very large datasets nlist = 4096 m = 8 nbits = 8 quantizer = faiss.IndexFlatL2(d) index = faiss.IndexIVFPQ(quantizer, d, nlist, m, nbits) # Train index.train(vectors) index.add(vectors) # Search index.nprobe = 32 distances, indices = index.search(query, k) ``` **Use when:** - Dataset > 10M vectors - Need fast search + low memory - Can accept 90-95% accuracy ## GPU indices ### Single GPU ```python import faiss # Create CPU index index_cpu = faiss.IndexFlatL2(d) # Move to GPU res = faiss.StandardGpuResources() # GPU resources index_gpu = faiss.index_cpu_to_gpu(res, 0, index_cpu) # GPU 0 # Use normally index_gpu.add(vectors) distances, indices = index_gpu.search(query, k) ``` ### Multi-GPU ```python # Use all available GPUs index_gpu = faiss.index_cpu_to_all_gpus(index_cpu) # Or specific GPUs gpus = [0, 1, 2, 3] # Use GPUs 0-3 index_gpu = faiss.index_cpu_to_gpus_list(index_cpu, gpus) ``` **Speedup:** - Single GPU: 10-50× faster than CPU - Multi-GPU: Near-linear scaling ## Index factory ```python # Easy index creation with string descriptors index = faiss.index_factory(d, "IVF100,Flat") index = faiss.index_factory(d, "HNSW32") index = faiss.index_factory(d, "IVF4096,PQ8") # Train and use index.train(vectors) index.add(vectors) ``` **Common descriptors:** - `"Flat"`: Exact search - `"IVF100,Flat"`: IVF with 100 clusters - `"HNSW32"`: HNSW with M=32 - `"IVF4096,PQ8"`: IVF + PQ compression ## Performance comparison ### Search speed (1M vectors, k=10) | Index | Build Time | Search Time | Memory | Recall | |-------|------------|-------------|--------|--------| | Flat | 0s | 50ms | 512 MB | 100% | | IVF100 | 5s | 2ms | 512 MB | 95% | | HNSW32 | 60s | 1ms | 1GB | 99% | | IVF4096+PQ8 | 30s | 3ms | 32 MB | 90% | *CPU (16 cores), 128-dim vectors* ## Best practices 1. **Start with Flat** - Baseline for comparison 2. **Use IVF for medium datasets** - Good balance 3. **Use HNSW for best quality** - If memory allows 4. **Add PQ for memory savings** - Large datasets 5. **GPU for > 100K vectors** - 10-50× speedup 6. **Tune nprobe/efSearch** - Trade-off speed/accuracy 7. **Train on representative data** - Better clustering 8. **Save trained indices** - Avoid retraining ## Resources - **Wiki**: https://github.com/facebookresearch/faiss/wiki - **Paper**: https://arxiv.org/abs/1702.08734

Source: claude-code-templates (MIT). See About Us for full credits.

BAGUA AI