Converters
Converters convert data from one format into another. They're most often used in indexing pipelines.
There are several converters available for different file formats:
- AzureOCRDocumentConverter: Converts files of different types using Azure's Document Intelligence service.
- CSVToDocument: Converts CSV files to documents.
- DeepsetDeepLTextTranslator: Uses DeepL's translation services to translate documents into a language you specify.
- DocxToDocument: Converts DOCX files.
- HTMLToDocument Converter: Converts HTML files. You can also convert LinkContentFetcher's output into a proper Document object.
- JSONConverter: Converts JSON files to documents.
- MarkdownToDocument Converter: Converts Markdown files.
- OutputAdapter: Converts a component's output so that it fits as input of another component.
- PPTXToDocument: Converts PPTX files.
- PyPDFToDocument: Converts PDF files.
- TextFileToDocument: Converts TXT files.
- UnstructuredFileConverter: Converts files to documents using Unstructured services.
- XLSXToDocument: Converts XLSX files to documents.
Updated 28 days ago