HuggingFaceTEIRanker
Ranks documents based on their semantic similarity to the query.
Bear with us while we're working on adding pipeline examples and most common components connections.
Key Features
- Reranks documents by semantic similarity to the query using a Text Embeddings Inference (TEI) service.
- Works with self-hosted TEI services or Hugging Face Inference Endpoints.
- Configurable
top_kto limit the number of returned documents. - Built-in retry logic with configurable retry status codes.
- Optional token authentication for secured TEI endpoints.
Configuration
- Drag the
HuggingFaceTEIRankercomponent onto the canvas from the Component Library. - Click the component to open the configuration panel.
- On the General tab:
- Enter the
urlof your TEI reranking service (for example,https://api.example.com).
- Enter the
- Go to the Advanced tab to configure the token, top_k, raw scores, timeout, max retries, and retry status codes.
Connections
HuggingFaceTEIRanker accepts a query string (query) and a list of Document objects (documents) as inputs. It outputs a list of reranked documents (documents) and optionally supports a top_k override at run time.
In a query pipeline, connect a retriever's documents output to the documents input, and connect the reranked documents output to a prompt builder or answer builder.
Usage Example
components:
HuggingFaceTEIRanker:
type: components.rankers.hugging_face_tei.HuggingFaceTEIRanker
init_parameters:
Parameters
Inputs
| Parameter | Type | Default | Description |
|---|---|---|---|
| query | str | The user query string to guide reranking. | |
| documents | List[Document] | List of Document objects to rerank. | |
| top_k | Optional[int] | None | Optional override for the maximum number of documents to return. |
| truncation_direction | Optional[TruncationDirection] | None | If set, enables text truncation in the specified direction. |
Outputs
| Parameter | Type | Default | Description |
|---|---|---|---|
| documents | List[Document] | A dictionary with the following keys: - documents: A list of reranked documents. |
Init Parameters
These are the parameters you can configure in Pipeline Builder:
| Parameter | Type | Default | Description |
|---|---|---|---|
| url | str | Base URL of the TEI reranking service (for example, "https://api.example.com"). | |
| top_k | int | 10 | Maximum number of top documents to return. |
| raw_scores | bool | False | If True, include raw relevance scores in the API payload. |
| timeout | Optional[int] | 30 | Request timeout in seconds. |
| max_retries | int | 3 | Maximum number of retry attempts for failed requests. |
| retry_status_codes | Optional[List[int]] | None | List of HTTP status codes that will trigger a retry. When None, HTTP 408, 418, 429 and 503 will be retried (default: None). |
| token | Optional[Secret] | Secret.from_env_var(['HF_API_TOKEN', 'HF_TOKEN'], strict=False) | The Hugging Face token to use as HTTP bearer authorization. Not always required depending on your TEI server configuration. Check your HF token in your account settings. |
Run Method Parameters
These are the parameters you can configure for the component's run() method. This means you can pass these parameters at query time through the API, in Playground, or when running a job. For details, see Modify Pipeline Parameters at Query Time.
| Parameter | Type | Default | Description |
|---|---|---|---|
| query | str | The user query string to guide reranking. | |
| documents | List[Document] | List of Document objects to rerank. | |
| top_k | Optional[int] | None | Optional override for the maximum number of documents to return. |
| truncation_direction | Optional[TruncationDirection] | None | If set, enables text truncation in the specified direction. |
Was this page helpful?