FilesInput

The first component in your indexing pipeline.

Basic Information

  • Pipeline type: Indexing
  • Type: Doesn't apply
  • Components it can connect with:
    • FilesInput is the first component in an indexing pipeline.
    • FileTypeRouter: FilesInput can send sources to FileTypeRouter, which then routes them by file type to appropriate Converters. This is useful if you have files of different types.
    • Converters: FilesInput can send sources to Converters for further processing.

Inputs

FilesInput doesn't need you to provide any inputs. It automatically maps to your files.

Outputs

NameTypeDescription
sourcesLIst of strings, paths or ByteStream objectsThe files to process.

Overview

FilesInput represents the files that your indexing pipeline will process. It must always be the first component in the indexing pipeline.

Usage Example

In Builder, drag FilesInput on the canvas and connect it to a Converter or FileTypeRouter, as in this example:

The FilesInput component connected to FileTypeRouter in pipeline builder

In YAML, you simply list files among pipeline's inputs and specify the components' inputs that should receive them:

inputs:
  files:
    - file_classifier.sources # Here you list the components that receive files (sources)

Init Parameters

This component doesn't take any initialization parameters.