[FEATURE] Multi-model sparse search ensembling #990

martin-gaievski · 2024-11-16T02:26:08Z

Add support for ensemble-based neural sparse search that combines results from multiple sparse models to improve search quality and robustness.

Motivation

Research shows that ensemble of sparse retrievers provides:

Better generalization across different query types
Improved robustness to different document types
Better effectiveness-efficiency trade-off

Key research:

Proposed Functionality

Ensemble Configuration

PUT _neural/sparse_model/ensemble
{
  "name": "sparse_ensemble",
  "models": [
    {
      "model_id": "splade_v2",
      "weight": 0.6
    },
    {
      "model_id": "unicoil",
      "weight": 0.4
    }
  ],
  "combination_method": "weighted_sum",  // or "max", "mean"
  "cache_policy": {
    "enabled": true,
    "ttl": "1h"
  }
}

Search API

GET my-index/_search
{
  "query": {
    "neural_sparse_ensemble": {
      "query_text": "search query",
      "ensemble_id": "sparse_ensemble",
      "k": 100
    }
  }
}

As shown in the configuration section, we can use caching to improve remote call latency.

The text was updated successfully, but these errors were encountered:

heemin32 · 2024-11-16T04:09:05Z

Does this imply that only the query embedding varies across models? Shouldn't the index embedding also differ for each model?

martin-gaievski added untriaged enhancement labels Nov 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Multi-model sparse search ensembling #990

[FEATURE] Multi-model sparse search ensembling #990

martin-gaievski commented Nov 16, 2024

heemin32 commented Nov 16, 2024

[FEATURE] Multi-model sparse search ensembling #990

[FEATURE] Multi-model sparse search ensembling #990

Comments

martin-gaievski commented Nov 16, 2024

Motivation

Proposed Functionality

Ensemble Configuration

Search API

heemin32 commented Nov 16, 2024