Multimodal Systems
Multimodal systems can process, understand, and generate information across multiple types such as text, images, audio, and video. Learn what's possible in deepset AI Platform.
You can build systems that combine different data types and formats. From simple systems that transcribe speech to text or generate image captions, to more complex ones that can process and analyze videos. Such systems have a variety of applications. They can give your AI assistants new capabilities but also help people with disabilites.
Updated about 7 hours ago